Loading...
Loading...
News corpora, social media analysis, movie/music metadata, sports data, cultural datasets, misinformation
11,020 datasets
A collection of movie-related datasets published on the Kaggle platform. The specific contents, scale, and creation details are not provided in the metadata. Users must inspect the data after download to determine its suitability for their projects.
From July 1, 2020 to February 1, 2021, this dataset aggregates statistics for content reshared between 1 and 99 times. It was created by Meta Platforms, Inc. as part of the U.S. 2020 Facebook and Instagram Election Study, providing aggregated metrics per tree size and content category.
From March to November 2020, this dataset provides tweet IDs related to COVID-19 discourse, collected over eight months in the first year of the pandemic. It tracks keywords including generic terms like 'corona virus' and non-pharmaceutical interventions such as 'lockdown' and 'social distancing', based on a list from prior research.
ReviewToys2018 is a dataset of product reviews for toys, published on Kaggle. The dataset likely contains user-generated text and potentially associated metadata from the year 2018. The specific author, organization, and data collection method are unknown.
A dataset titled 'ReviewBeauty2018' suggests a collection of product reviews from the year 2018. The dataset is hosted on Kaggle, but no further details about its creator, size, or specific content are provided. Its title indicates a focus on beauty-related products.
ReviewMovies2018 is a dataset of movie reviews published on Kaggle. The dataset likely contains user-generated text and associated metadata from the year 2018. The author, organization, and specific collection method are unknown.
MetaMovies2018 is a dataset published on Kaggle. Its title suggests it contains metadata related to movies, likely from the year 2018. The dataset's specific content, size, and origin are not detailed in the available metadata.
ReviewMagazine2018 likely contains textual reviews of magazines from the year 2018. Published on Kaggle, the dataset's specific content, size, and origin are not detailed in the provided metadata. Its structure and potential applications must be verified after download.
Kaggle hosts a dataset titled ReviewVideo2018. The dataset likely contains video review content from the year 2018. Its specific size, origin, and detailed contents are not documented in the provided metadata.
Replication data for a forthcoming Review of Economics and Statistics article by Hamid Firooz. The dataset supports analysis of cross-country heterogeneous responses to competition in international trade.
A dataset titled 'review-chekpoints--2026-05-09--13248-13248' was published on the Kaggle platform. The dataset's specific content, such as the number of rows or columns, is not described in the available metadata. Its title suggests it may contain data related to evaluating or reviewing model checkpoints in a machine learning workflow.
Twitter Customer Service Interaction Summarization is a dataset hosted on Kaggle. It likely contains text data from public interactions between customers and brands on the Twitter platform. The dataset's specific size, authorship, and update history are not provided in the available metadata.
A text dataset documenting Saudi dialects and cultural traditions. The data appears to cover all provinces of Saudi Arabia, providing a representation of regional linguistic and cultural diversity. The author, organization, and specific collection details are unknown.
Three simulation models for molten carbonate fuel cells, Fischer-Tropsch synthesis, and CO2 compression, developed by the British Geological Survey. The models enable investigation of fuel composition, design parameters, and process conditions on system performance and carbon capture efficiency. Key findings include the relationship between CO2 concentration in flue gas and the carbon capture factor.
Top News Dataset is a collection of news articles published on Kaggle. The dataset's specific size, source, and time period are unknown. Metadata is minimal; actual content requires verification after download.
Telugu movies is a dataset published on Kaggle. The dataset likely contains information related to films from the Telugu-language cinema industry. Metadata is minimal; actual content requires verification after download.
A collection of highly-rated movies sourced from The Movie Database (TMDB). The dataset is hosted on Kaggle, but its specific size, update history, and creator are unknown. Its content likely includes movie titles, ratings, and other metadata from the TMDB platform.
2026-03-25 updated boundaries represent subdivision review cases for the City of Austin Planning and Development Review Department. The dataset supports departmental business processes and is not the final recorded subdivision boundaries. Data formats include CSV, XML, RDF, and JSON.
52-dimensional MediaPipe blendshape features extracted from web-crawled images of Asian facial expressions. The dataset is structured for seven distinct emotion categories. Its origin and specific collection details are not provided in the available metadata.
A list of 100 top-rated movies from IMDb. The dataset includes genre, release year, and a description for each film. It was cleaned via an API, but the original author and update date are unknown.