Loading...
Loading...
News corpora, social media analysis, movie/music metadata, sports data, cultural datasets, misinformation
11,035 datasets
This dataset supports research on the impact of the electric telegraph network expansion in America from 1840 to 1852 on national elections. It was created by Tianyi Wang to analyze how access to telegraphed news from Washington affected voter turnout and newspaper coverage of national politics.
Aggregated information about all Facebook content reshared 100 or more times from July 1, 2020 through February 1, 2021. Each row corresponds to an individual reshare tree, capturing its size and depth at specific hours and days from the initial post. The data originates from the U.S. 2020 Facebook and Instagram Election Study, a partnership between Meta and academic researchers.
Diffusion of Facebook Posts with 100 or More Reshares
Data from the 2020 U.S. Facebook and Instagram Election Study, a partnership between Meta and academic researchers. It contains aggregated log data from Instagram user accounts of participants in a deactivation experiment, focusing on political attitudes and behaviors.
Platform data from Facebook participants in the 2020 U.S. election deactivation experiment. The dataset contains aggregated log data for each participant's user account over specific time periods. It was produced by Meta Platforms, Inc. in partnership with academic researchers for the U.S. 2020 Facebook and Instagram Election Study.
ReviewBooks2018 is a dataset of book reviews published on Kaggle. The dataset likely contains textual reviews and associated metadata from the year 2018. Its specific content, scale, and authorship require verification after download.
IMDb Movie Metadata & Reviews Dataset provides data and user reviews for understanding movies. The dataset likely contains movie metadata such as titles, genres, and ratings, alongside user-generated reviews. Its source is Kaggle, but specific details about its size, author, and update date are unknown.
Replication data for a study on engagement, incivility, and constructive language on Twitter by Kevin Li. The dataset is hosted by Harvard Dataverse and was last updated on March 16, 2026. It likely contains metrics and text related to social media interactions.
A dataset containing cultural text and associated temporal engagement metrics, published on Kaggle. The raw description indicates it focuses on regional narratives. Specific details regarding size, authorship, and temporal coverage are not provided in the available metadata.
A dataset titled 'review-chekpoints--2026-05-02--13241-13241' was published on the Kaggle platform. The title suggests it may contain data related to model checkpoints or evaluation metrics. Metadata is minimal; the actual content, scale, and authorship require verification after download.
Kaggle hosts a dataset titled 'Movies Dataset'. The dataset's specific content, size, and creator are not detailed in the provided metadata. Its last update date and licensing information are also unknown.
A dataset titled 'mes final review' published on Kaggle. The title suggests it may relate to student assessments or final reviews, potentially within an educational context. No further metadata is available to confirm its specific contents, size, or origin.
Kaggle hosts a dataset of top-rated movies sourced from The Movie Database (TMDB). The dataset likely contains movie titles, user ratings, and other metadata. The specific number of records, columns, and time period covered are unknown from the provided metadata.
A subset of the SynVA Procedural Dataset, published on Kaggle for review purposes. The dataset's full scope, creation method, and temporal coverage are not specified in the available metadata. Content and structure require verification after download.
cultural-events-fr is a dataset published on Kaggle. The title suggests it contains information about cultural events in France. The dataset's specific content, size, and origin are unknown from the provided metadata.
A dataset titled 'cultural-events-fr-short' is hosted on Kaggle. The dataset likely contains information about cultural events in France, but its specific content, size, and origin are unconfirmed. Metadata is minimal; actual data requires verification after download.
Twitter_Data_30K is a dataset of social media posts sourced from the Twitter platform. The dataset likely contains 30,000 text entries, though the specific content, time range, and collection method are not detailed in the provided metadata. It was published on Kaggle, but the author, organization, and license information are unknown.
MN_DS News is a dataset published on Kaggle. Its title suggests it contains news-related content, likely text articles. The dataset's specific scope, size, and collection details are not provided in the available metadata.
Hotel reviews from a leading travel site, containing user-provided text and metadata. The dataset includes columns for a unique User_ID, the review Description, Browser_Used, Device_Used, and a target variable Is_Response. It is published under a CC0-1.0 license on the OpenML platform.
A dataset of car reviews from Edmunds. No information is available regarding its size, features, or creation details.