Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
2,414 dark web pages were collected via a custom crawler from the Torch search engine for a comparative analysis of text mining techniques. Jin Gyeong Kim published this dataset on figshare in May 2026, which contains the top 20 keywords extracted using the TF-IDF method. The work evaluates TF-IDF, Eigenvector Centrality, and Word2Vec for extracting investigative keywords related to child sexual abuse materials.
File format is XLS (Excel), requiring compatible software to open. License is CC-BY-4.0.