Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
A dataset of 5.5 KB contains tuned TF-IDF and LSA parameters for modeling discussion initiation and growth on Reddit. It was created by Cara Lynch and last updated on 2026-05-14. The data is derived from reconstructed threads from r/politics, r/CryptoCurrency, and r/CryptoCurrency, extracting textual, semantic, temporal, domain, and author features.
File format is XLS, requiring software that can read Excel files.