Hacker News stories, comments, and polls scraped via the Algolia API. The dataset likely contains user-generated content from the technology and startup discussion forum. The author, organization, and last update date are unknown.
Use Cases
- Analyze discussion trends based on scraped story and comment text.
- Study community voting patterns based on poll data mentioned in the description.
- Build search or recommendation models based on forum content gathered at scale.
Strengths
- Data is gathered via the official Algolia API, suggesting a structured collection method.
- The description mentions collection at scale, implying a potentially large volume of records.
Limitations
- Row count, column definitions, and file formats are unknown, limiting suitability assessment.
- Last update date is unknown; freshness unverified.
- Description metadata is limited; actual data quality requires manual inspection after download.
Provenance
- Source
- Hacker News via Algolia API
- Collection Method
- Scraped via API