Loading...
Loading...
News corpora, social media analysis, movie/music metadata, sports data, cultural datasets, misinformation
10,992 datasets
A collection of TikTok-processed audio tracks intended for AI audio detection research. The dataset includes music generated by AI tools Suno and Udio alongside real music. Its author, organization, and specific size are unknown.
News reports from Old Time Radio Programs, primarily focused on World War II events. The dataset was compiled by Andemand11 from the Internet Archive and was last updated in April 2026. Specific details on the number of files, duration, and exact temporal coverage are not provided.
Overwatch-competitions-data-7-seasons contains cleaned match records from the Overwatch video game. The data was originally collected by a user on the Overwatch subreddit and later shared on Kaggle by Myles O'Neill. The dataset has been processed for easier visualization and analysis.
A dataset related to the TikTok platform, published on Kaggle. The specific content, size, and origin are not detailed in the provided metadata. Its recency and scope are unknown.
Sports is a dataset hosted on the Kaggle platform. The dataset's specific content, size, and structure are not described in the provided metadata. Its origin and creation date are unknown.
Komentar TikTok Koperasi Desa Merah Putih contains public comments from the social media platform TikTok regarding the KDMP program. The dataset was sourced from Kaggle, but its author, organization, and last update date are unknown. The description indicates the data consists of textual comments, but the total number of rows and specific column structure are not provided.
WatchEQ Index is a preview dataset providing market intelligence scores that rank U.S. cities for luxury watch retail expansion. The dataset appears to be a tabular ranking of cities based on factors relevant to the luxury watch market. The author, organization, and specific data collection method are unknown.
Kaggle hosts a dataset titled '[movie1m] recsys-lightgcnpp'. The title suggests it contains data for training and evaluating a LightGCN++ model for movie recommendation systems. The dataset's specific content, size, and origin are not detailed in the provided metadata.
South Korean movies dataset published on Kaggle. The dataset's specific contents, size, and creation details are not provided in the available metadata. Further details such as the author, number of rows, and time period covered require verification after download.
NASA's Parker Solar Probe WISPR instrument provides data cube files in IDL/SolarSoft MVI formats. A single file contains all information for a time-lapsed movie, including a file header, image headers, and byte-scaled image arrays. The dataset was last updated on March 13, 2026.
2013-2014 lab experiments conducted at the Cefas laboratory in Lowestoft, UK. Multiple marine phytoplankton taxa were preserved in different Lugol's Iodine solutions and enumerated over 8 months to investigate degradation rates. The data are the results of these preservation experiments.
Salcido, Alexis published this dataset on Harvard Dataverse in April 2026. It contains histological analysis results for cFos and pPDH expression in the Lateral Habenula brain region. The data likely compares expression levels across saline control and multiple IBU concentration doses (0.5x, 1x, 2x, and 10x).
An ESRI Shapefile vector line dataset showing the approximate designated route of the Pony Express National Historic Trail. The dataset is provided by the Department of the Interior and was last updated on March 4, 2026.
A hand-curated directory of approximately 15,900 online newspapers and magazines. The dataset is hosted on Kaggle and appears to be a structured list of global media sources. Details on the creator, specific columns, and last update date are unknown.
Review checkpoints data published on Kaggle. The dataset likely contains evaluation metrics or model states from a training process. Its specific content and scale require verification after download.
Trending movie releases data published on Kaggle. The dataset likely contains information on films considered popular or gaining attention. Its temporal coverage extends up to December 2031.
Water temperature, salinity, and pressure measurements collected by a CTD instrument from the vessel Kapitan Dranitsyn. The data were gathered in the Laptev Sea and Arctic Ocean during a cruise from August 18 to September 3, 2009. The dataset is provided by the National Oceanic and Atmospheric Administration.
London's first Cultural Infrastructure Map plots the location of cultural facilities and assets. Data sets were collected from summer 2024 to summer 2025 and published by the Greater London Authority (GLA) in 2024 and 2025. The map enables viewing this infrastructure alongside contextual data to inform policy and investment decisions.
Public comments collected from the social media platform TikTok regarding the Makan Bergizi Gratis (MBG) or Free Nutritious Meals program. The dataset likely contains user-generated text reflecting public opinion and discussion. The author, organization, and specific collection details are not provided.
Andrew Watson of the Ames Research Center reviews seven published formulas and proposes a new unified formula for pupil size. The formula incorporates the effects of luminance, adapting field size, observer age, and monocular or binocular adaptation. The work includes interactive demonstrations and software implementations.