Reddit Cybersecurity Discussions with Themes and Sentiment, 2020-2025
by S. M. Mahedy Hasan·Updated 29d ago
43.0 MB1files
Available on 1 platform
Sign in to view source links and access this dataset
Description
Inside the Hive Mind is a text dataset of Reddit comments on cybersecurity topics from 2020 to 2025, created by S. M. Mahedy Hasan. It identifies frequently discussed themes, including password security, phishing, ransomware, AI-driven threats, and zero-trust security. The dataset also explores the distribution and evolution of sentiments and emotions across these topics over time.
Use Cases
Track the evolution of public concern over emerging cybersecurity threats like ransomware and AI-driven attacks based on the described temporal analysis.
Analyze sentiment and emotional tone (e.g., fear, concern) expressed in public discussions about traditional topics like password security and phishing.
Identify the most prominent discussion themes in cybersecurity discourse on social media as mentioned in the description.
Model shifts in public perception and thematic focus within cybersecurity from 2020 to 2025.
Strengths
Covers a multi-year time range from 2020 to 2025, allowing for longitudinal analysis.
Analyzes multiple dimensions: discussion themes, sentiment, and emotions, as described.
Released under a permissive CC-BY-4.0 license for broad reuse.
Limitations
Row count and specific column structure are unknown, limiting suitability assessment.
Column-level documentation is absent; field semantics must be inferred after download.
Data may reflect the bias inherent to discussions on the Reddit platform.
Provenance
Source
figshare, author S. M. Mahedy Hasan
Collection Method
Mined from Reddit users' comments on cybersecurity-related posts.
Time Range
2020 to 2025
Freshness
Last updated 2026-05-07 20:55:52; freshness should be verified.
The dataset is packaged in a 43.0 MB ZIP file; actual internal file formats and structure are not specified.