Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
Edlira Gugu's supplementary materials for a COMPTEXT 2026 paper include six appendices supporting a computational text analysis of 15,154 empirical research articles. The corpus spans Linguistics, Social Sciences, and Computer Science, partitioned into pre-LLM (2020-2022) and post-LLM (2023-2024) periods. Appendices contain corpus metadata, preprocessing code, extended statistical results, topic model documentation, a rhetorical template catalogue, and analysis code.
Files are in PDF and PY formats; the 2.2 MB size indicates supplementary documentation and code, not the full 15,154-article text corpus.