Loading...
Loading...
News corpora, social media analysis, movie/music metadata, sports data, cultural datasets, misinformation
11,040 datasets
100-foot buffers of stream centerlines with drainage areas of 64 acres or greater, created by the City of Austin. The dataset includes locations of structures, trees, and other features of concern threatened by erosion.
Moroccan discussions and public opinion about artificial intelligence were collected from Reddit. The dataset appears to contain real-world text data from social media posts and comments. The author, organization, and specific collection date are unknown.
This replication package provides the data and code documenting academic competition and collaboration trends in political science from 2003 to 2023. Authored by Carolina Torreblanca for Perspectives on Politics, it contains the necessary materials to reproduce all figures and tables from the associated study.
Peter Temin's integrated analysis of the Great Depression covers the economic experience in Britain, France, Germany, and the United States. The work describes the causes, widespread and prolonged nature, and eventual recovery from the depression. It also draws parallels to more recent economic policies, such as those of the early 1980s and the Reagan administration.
Edward C. Stewart designed training to increase cultural awareness through simulated role-playing exercises. The report describes the development of paper-and-pencil tests as interim estimates of training objectives and presents preliminary data on efficacy. Possible variations in training format are suggested.
A national survey dataset on arts and cultural participation, likely containing responses on attendance, engagement, and demographics. The data originates from a 1997 initiative by the Wallace-Reader's Digest Funds to encourage community foundations to invest in broadening cultural engagement. The dataset author is Chris Walker and it is hosted on the paperswithcode platform.
Urban Institute's Arts and Culture Indicators in Community Building Project presents a conceptual framework for measurement. The work, authored by Maria-Rosario Jackson, outlines guiding principles for quantifying culture's role in communities. It is hosted on the paperswithcode platform, which aggregates resources for data science and AI practitioners.
AG News New Non is a text dataset hosted on Kaggle. The dataset likely contains news articles, as suggested by its title. Its specific content, size, and origin require verification after download due to minimal provided metadata.
Roy G. D’Andrade's study compares personal and cultural values across American, Japanese, and Vietnamese populations. The dataset likely contains questionnaire responses organized around a conceptual framework for analyzing institutionalized values. The data source is paperswithcode, but the license is closed.
Kaggle hosts a dataset titled 'TMDB 5000 Movies - Cleaned Dataset'. The dataset likely contains metadata for approximately 5,000 films sourced from The Movie Database (TMDB). The cleaning process suggests it has been processed for consistency, but the specific columns, time range, and original author are unknown.
Movie data from The Movie Database (TMDB) intended for data cleaning practice. The dataset is hosted on Kaggle, but its specific size, row count, and column details are unknown. The author, organization, and last update date are also unspecified.
TMDB user ratings provide a list of top-rated movies. The dataset likely contains movie titles, ratings, and other metadata. Its origin and specific contents are not detailed in the provided metadata.
A dataset containing physiological and motion signals for modeling mental fatigue. It includes EEG, ECG, and IMU data, likely collected from participants during physical or cognitive tasks. The dataset is intended for use in federated learning models, suggesting a structure designed for distributed training across multiple clients.
Twitter data sourced from Kaggle. The dataset likely contains text-based content from the social media platform. Metadata is minimal; specifics such as size, columns, and license are unknown.
IMDB Popular Movies, TV Shows, Web Series - 2026 is a dataset from Kaggle. It contains information on movies, TV shows, and web series, including ratings, votes, and genre data. The author, organization, and specific temporal coverage are not provided in the input metadata.
Abdullah ALDEMİR published this social science dataset in 2026 to examine the non-linear relationships between Fear of Missing Out (FoMO), cyberloafing, and social media addiction. The data focuses on identifying behavioral threshold effects and digital dependency tipping points specifically within professional environments.
Indian-News-Articles is a collection of news content created due to a lack of available Indian news data on Kaggle. The dataset includes article text along with metadata such as author, link, and publish date, scraped from firstpost.com using Python and BeautifulSoup. The license is Database: Open Database, Contents: Original Authors.
ReviewBeauty2018 is a dataset of user reviews for beauty products, sourced from Kaggle. The dataset likely contains textual feedback and ratings from consumers. Its specific content and scale require verification after download.
News_df is a dataset of news articles published on Kaggle. The dataset's specific source, size, and time period are not detailed in the available metadata. Further inspection after download is required to determine its exact scope and content.
A dataset titled 'review-chekpoints--2026-04-28--13237-13237' published on Kaggle. The platform tags suggest it relates to model evaluation and review checkpoints, likely containing tabular data for assessing machine learning models. The author, organization, and specific data characteristics are unknown.