Loading...
Loading...
News corpora, social media analysis, movie/music metadata, sports data, cultural datasets, misinformation
10,957 datasets
37,455 video clips extracted from the EPIC-KITCHENS-100 dataset, comprising the extension not included in EPIC-KITCHENS-55. The clips are stored as MP4 files, organized by participant and narration ID, and have been downscaled and compressed. The dataset was created by lightly-ai and last updated on Hugging Face in March 2026.
A review of internationally accepted standard atomic weights for the elements, published by the IUPAC Commission on Atomic Weights and Isotopic Abundances. The work summarizes the development of atomic-weight determinations in the 20th century and variations recognized in the latter part of the century. It reflects the increasing scientific interest in source-specific atomic weight differences and their impact on fundamental constants.
CTD data from 16 oceanographic stations in Mar de la Flota, Antarctica, collected aboard the icebreaker Almirante Irizar during the 1995/96 Summer Antarctic Campaign. The data was gathered by the Argentine Antarctic Institute using a Neil Brown Ins. Sys. Model MKIIIb CTD profiler. The dataset was last updated in May 1996.
TEMPO satellite observations provide cloud information on a regular grid covering its field of regard. Level 3 files are derived by combining Level 2 scan cycles and contain effective cloud fraction and cloud optical centroid pressure. The near real-time product is optimized for low-latency applications, with processing completed within 2-3 hours after observation.
Supplying percentage distributions of selected covariates associated with depression levels. It focuses on older adults across four districts, with data covering factors like social isolation, financial stress, and perceived health status. The dataset is 13.5 KB in size.
Film genre statistics likely contain data on movie classifications and their associated metrics. The dataset is published on Kaggle, but its specific contents, size, and creation details are unknown. Its recency and the identity of its author are also unspecified.
Kaggle hosts a dataset of user reviews for the ChatGPT application from the Google Play Store. The dataset likely contains textual feedback, ratings, and metadata submitted by users. Its exact size, column structure, and collection period are unspecified.
TMDB Movies Dataset is a collection of movie-related data from The Movie Database (TMDB). It was published on Kaggle, but the specific number of records, columns, and creation details are unknown. The dataset's content and structure require verification after download.
Fake News is a dataset hosted on Kaggle. The dataset likely contains text articles or social media posts labeled for veracity. Metadata is minimal; actual content and labeling methodology require verification after download.
A list of movies curated as 'best' and published on Kaggle. The dataset's specific contents, such as titles, ratings, genres, or release years, are not detailed in the available metadata. The author, organization, and temporal coverage are unknown.
A Bangla Fake News Detection Dataset containing both real and fake news articles. The dataset likely contains text articles labeled for authenticity. Its author, organization, and specific size are unknown.
Materials for the EGU26 SC2.5 session, published by climet-eu on GitHub under a CC-BY-4.0 license. The repository was last updated on 2026-05-12. The specific content and scale of the materials are not detailed in the available metadata.
The Milford Microalgal Culture Collection from NOAA's Department of Commerce holds over 200 live cultures representing 13 classes of algae. Cultures are maintained in three different media and undergo routine transfers with bacterial/mold contaminant testing four times a year. Transfer dates and test results are recorded, with the dataset last updated in March 2026.
Replication data for the paper 'Nodeless superconducting gap and electron-boson coupling in (La,Pr,Sm)3Ni2O7 films'. Jianchang Shen deposited this dataset on Harvard Dataverse in May 2026. The data likely contains experimental measurements supporting the findings on superconductivity in rare-earth nickelate films.
32.7 KB of expression ratios for candidate regulatory sequences (CRS) in two human liver cell lines. The data, authored by Kyle Leix and last updated in March 2026, quantifies activity by comparing cDNA to genomic DNA barcode counts across five independent replicates. Ratios are normalized to negative control sequences for each sample.
A full dataset of identified lipids from E. coli bacteria expressing either the wild-type Tox-Act1 protein or its C151A mutant variant. The data was generated using LC-MS/MS methods and includes phospholipids and lysophospholipids analyzed in negative mode. Author Gianlucca G. Nicastro published the dataset on figshare under a CC-BY-4.0 license, with a last update in March 2026.
Sandeerah Choudhary published this dataset on figshare in March 2026. It contains 14 input features related to the mix design and aggregate properties of recycled aggregate concrete. The dataset is stored in an XLSX file sized at 46.3 KB.
The World Wilderness Areas dataset is a reconnaissance-level inventory of undeveloped land shaped primarily by natural forces. It was created by the Sierra Club and the World Bank's Center for Earth Resource Analysis, integrated by UNEP/GRID, and published in 1989. The data covers approximately 50 million square kilometers, representing about one-third of the Earth's land surface, and includes 1089 polygons in ARC/INFO vector format.
Northern Hemisphere maps document the trajectories of all atmospheric pressure centers from January 1899 to June 1939. The data was encoded from a 40-year map series and archived by the National Climatic Data Center. It includes separate decade-by-decade track maps for low and high pressure systems.
Russian-language podcast recordings generated by Google NotebookLM. The dataset contains 417 episodes totaling 77 hours, 23 minutes, and 22 seconds of audio in WAV format at 24 kHz. It was uploaded by AigizK to Hugging Face and last updated on March 26, 2026.