Loading...
Loading...
News corpora, social media analysis, movie/music metadata, sports data, cultural datasets, misinformation
10,984 datasets
Date from a textile analysis of linen archaeological textiles. The dataset is authored by Payton Becker and was last updated in March 2026. It is a small dataset of 17.8 KB with an unknown number of rows and columns.
December 2025 slide decks summarising sector-level evidence for the 2025–26 refresh of London's Local Skills Improvement Plan (LSIP). These early resources reflect data and stakeholder input available at the time, with a fully reviewed evidence base to follow. The materials were produced by the Greater London Authority.
Ofcom annual reports detail the performance of the UK's fixed broadband, mobile, WiFi, digital television, and radio networks. The data is gathered from major fixed broadband Internet Service Providers, including BT, KCOM, Sky, TalkTalk, and Virgin Media, covering both retail and wholesale services. The dataset was last updated on March 25, 2026.
Tech News Daily is a dataset published on HuggingFace by author shaurya03. The title suggests it likely contains daily technology news articles or headlines. The dataset was last updated on 2026-04-18 17:29:00.
review-chekpoints--2026-06-05--13275-13275 is a dataset hosted on Kaggle. The title suggests it contains information related to checkpoints, likely for evaluating or reviewing machine learning models. No further metadata, such as author, size, or column descriptions, is available.
Daniel C. Hallin's study analyzes the complete New York Times corpus from 1961 to 1965 and a sample of hundreds of television reports from 1965 to 1973. The work draws on interviews with journalists and Defense Department footage to critique conventional wisdom about media influence. It provides a detailed account of what Americans read and watched about the Vietnam War.
134.6 hours of manually curated Khmer speech-text pairs focusing on Cambodian cultural topics, produced by DDD-Cambodia. The collection features recordings from 8 native speakers with an average duration of 8.54 seconds per utterance and includes detailed speaker metadata.
TMDB.MOVIES.CSV is a dataset from Kaggle, likely containing information about movies sourced from The Movie Database. The specific number of records, columns, and data collection timeframe are unknown. The dataset's author and organization are not provided.
Movies is a dataset published on Kaggle. Its specific contents, size, and origin are not detailed in the provided metadata. The dataset likely contains information related to films, such as titles, genres, or ratings.
You Liu's dataset on figshare analyzes the allocation and impact of great coaches on Olympic medal predictions. The 5.5 KB XLS file contains methodological analysis related to coaching systems like the Brazilian Swimming Team and Chinese Diving Team. It was last updated in March 2026.
You Liu's research dataset analyzes the allocation and impact of great coaches on Olympic performance. The dataset is 5.5 KB in size and is stored in an XLS file format. It employs methods like the Improved Cusum Algorithm and Decay Model for analysis.
Most Popular Movies is a dataset hosted on Kaggle. Its specific contents, such as the number of records, columns, and time period covered, are not detailed in the available metadata. The dataset likely contains information related to movie popularity metrics.
Kaggle hosts a dataset of wine reviews. The dataset's specific size, origin, and content are not detailed in the provided metadata. Its actual scope and quality require verification after download.
Vincent Barker's 182 KB dataset, shared under a CC-BY-4.0 license, supports a paper currently under review. Its title and tags suggest it contains metrics related to Btof and family firms, likely in a tabular format. Metadata is minimal; actual content requires verification after download.
Movies_5000 is a dataset published on Kaggle. The title suggests it contains information related to films, likely including titles and other attributes. The dataset's specific content, size, and origin require verification after download.
A dataset titled 'movies' published on the Kaggle platform. The specific content, size, and origin are unknown from the provided metadata. Further details such as columns, sample data, and license require verification after download.
The DigiFakeAV dataset contains synthetic and real video clips, with video frames stored as raw numpy arrays and audio provided in uncompressed WAV format. It was created by author 'cambrain' and last updated on March 12, 2026. The dataset is part of a project focused on audio-visual media.
Kaggle hosts a dataset focused on movie recommendations. The dataset likely contains user-movie interaction data, such as ratings or watch history, suitable for building recommendation systems. Its specific scale, origin, and creation date are not detailed in the provided metadata.
A dataset hosted on Kaggle by user Harshit76. The dataset likely contains text data related to depression counseling. Its specific contents, size, and creation details are not provided in the available metadata.
Movies dataset is a collection of data related to films, published on the Kaggle platform. The specific contents, scale, and creation details are unknown from the provided metadata. Users must download the dataset to verify its actual scope and features.