Loading...
Loading...
News corpora, social media analysis, movie/music metadata, sports data, cultural datasets, misinformation
10,999 datasets
Offering longitudinal survey results from a cluster randomized controlled trial across 120 villages in rural Senegal. Created by the International Food Policy Research Institute (IFPRI) and updated in 2026, it evaluates the impact of the 'C’est la Vie!' television series on women aged 14–34. The records include baseline, midline, and endline data covering health, violence, and behavioral outcomes.
A dataset titled 'Trending Movie History-2025' published on Kaggle. The title suggests it contains information about movies and their historical trends, likely focusing on the year 2025. The dataset's specific contents, size, and origin are not detailed in the available metadata.
Trending movies sourced from The Movie Database (TMDB) API. The dataset includes popularity scores, ratings, and complete metadata for films. The author, organization, and specific temporal coverage are not provided in the input.
400,000+ cleaned Indonesian user reviews for the Telegram application, sourced from the Google Play Store. The dataset is hosted on Kaggle, but the author, organization, and specific collection date are unknown. Column-level documentation and file format details are not provided in the available metadata.
An archive of PDF reports and listings for Ethics Compliance Reviews. The data includes columns for Year, Department / Agency / Authority, and File Type. It is published by data.nj.gov and was last updated in February 2026.
Review checkpoints likely contain text data for analysis, sourced from Kaggle. The dataset's specific content, size, and origin are not detailed in the available metadata. Users must download the data to verify its scope and structure.
This community-driven repository features 3D-scanned climbing holds and walls in GLB format, developed by setrsoft and updated through March 2026. The project currently provides raw scans intended for structural establishment and collaborative brand identification.
A news dataset published on Kaggle. The dataset's specific content, size, and origin are unknown from the provided metadata. Columns and sample data are unavailable, requiring verification after download.
A dataset for detecting fake news, likely containing text samples in both English and Bangla languages. It is hosted on Kaggle, but the specific volume, creation date, and authorship are not detailed in the available metadata. The dataset's content and structure must be verified after download.
Movie ranking data published on the Kaggle platform. The dataset's specific contents, such as ranking criteria, titles, or user scores, are not detailed in the available metadata. Its scale, origin, and recency are unknown and require verification after download.
A dataset concerning Kurdish cultural heritage, published on Kaggle. The dataset's specific content, size, and creation details are not provided in the metadata. Users must download the data to verify its scope, structure, and potential applications.
Depression Detection Reddit Dataset is a text corpus sourced from the Reddit platform. The dataset likely contains user posts or comments for the purpose of detecting depression-related language. It is published on Kaggle, but specific details about its size, creation date, and author are unknown.
Social media posts from the York Chapter of Moms for Liberty were collected from their account between January 1, 2022 and July 30, 2023. The dataset includes post text and any attached documents, newspaper articles, or referenced legislation. Jennifer D McGrew of the York County Chapter compiled this collection.
Amazon Echo Dot 2 reviews posted on Amazon during September and October 2017. The dataset includes fields such as review text, rating, user verification status, and review date. PromptCloud created this dataset via its web data extraction solution.
January 1, 2022 to July 30, 2023 marks the temporal coverage of this collection of social media posts from the Brevard Chapter of Moms for Liberty. It includes text content and any attachments, referenced articles, documents, or legislation. The dataset was authored by Jennifer D McGrew of the Brevard County Chapter.
Movie data is a dataset hosted on Kaggle. The dataset's specific contents, size, and origin are not detailed in the available metadata. Its scope and structure require verification after download.
Movies dataset published on Kaggle. The dataset likely contains information related to films, but specific columns and content are unknown. Metadata is minimal; actual data quality and scope require verification after download.
HackerNews-Posts is a dataset of posts from the HackerNews community platform, sourced from Kaggle. The dataset likely contains user-submitted stories, comments, and associated metadata. Metadata is minimal; actual content requires verification after download.
2004 to 2017 autonomous seawater measurements of partial pressure of carbon dioxide (pCO2) and pH from 40 surface buoys. The dataset includes sea surface temperature and salinity time series from 17 oceanic and 13 coastal regimes, including 10 coral reef sites. It was produced by the National Oceanic and Atmospheric Administration and provides a synthesis product for evaluating natural and anthropogenic trends in ocean carbonate chemistry.
An experimental study by Natalie Taylor and Jacqueline Joudo investigated the impact of testimony delivery methods on juror decisions in sexual assault cases. The project involved 210 public participants acting as jurors across 18 mock trials held in a mock courtroom. The report details the methodology, key findings, and juror feedback on reaching unanimous verdicts.