Loading...
Loading...
News corpora, social media analysis, movie/music metadata, sports data, cultural datasets, misinformation
10,984 datasets
Several hundred TIFF scans contain black and white infrared images from the Geosynchronous Very High Resolution Radiometer on the ATS-6 satellite. Each scan file contains 2 or 3 pictures, originally produced on 70mm film from digital tapes. William E. Shenk of NASA Goddard Space Flight Center was the principal investigator for data collected from launch until August 15, 1974.
1975 to 1976 daily brightness temperature images from the Nimbus-6 Electrically Scanning Microwave Radiometer (ESMR), saved as TIFF files. The dataset contains images from 70-mm photofacsimile film strips, with about 5-10 months of data archived per ZIP file. It was produced by the GES DISC and is based on an instrument that operated until September 15, 1976.
Nimbus-5 satellite data provides daily brightness temperature images captured by the Electrically Scanning Microwave Radiometer (ESMR) instrument. The GES DISC archives this collection, which contains imagery from December 11, 1972, to May 16, 1977. Images are stored as JPEG 2000 files, with approximately two weeks of data bundled into each TAR archive.
Six species of Antarctic marine microalgae were examined for gas exchange and inorganic carbon uptake, revealing evidence of a CO2-concentrating mechanism. The dataset includes analyses of functional age and vitality in macroalgal populations, such as the brown alga Durvilla. Metadata originates from ASAC Project 102, managed by the Australian Antarctic Data Centre, with a last update recorded in March 1995.
Labour force estimates expressed in thousands with annual frequency from 2011 to 2025. The dataset likely contains counts disaggregated by sex and age groups. Its author, organization, and specific geographic scope are unknown.
A dataset about the nature and culture of Kurdistan. The dataset is hosted on Kaggle. The author, organization, and specific data collection details are unknown.
Authentic e-commerce customer reviews, ratings, and sentiment data from Lazada Vietnam. The dataset is hosted on Kaggle, but specific details like row count, author, and license are not provided. The description indicates the data is authentic, suggesting it was collected from the live platform.
A national poll conducted by CBS News and ORC, harvested into the Roper dataverse. The dataset was last updated on April 23, 2026. It likely contains survey responses on political and social topics.
Movie-related data sourced from The Movie Database (TMDB). The dataset's specific size, columns, and time range are not detailed in the provided metadata. It is hosted on Kaggle, a platform for data science projects.
Aya Shoshan's dataset contains bibliometric records for 6,701 studies on social media and social movements published between 2005 and 2023. The data supports analysis of canonical literature, research approaches, and methods used to study social movements in the age of datafication. It was created to map the evolution of scholarship and identify gaps, such as a scarcity of experimental research isolating causal mechanisms.
A daily-updated list of all active entertainment permits granted by the San Francisco Entertainment Commission. The dataset includes seven permit types, such as Place of Entertainment and Dance Hall, and is geocoded with latitude and longitude. It is maintained by the City of San Francisco and updated automatically by DataSF.
Four sand pack samples provide P-wave velocity and attenuation (1/Q) measurements across a spectrum of water saturation states, from dry to fully saturated. Data were collected in a 4.5-meter acoustic pulse tube at the National Oceanography Centre between March 2022 and January 2023, under 10 MPa effective pressure. The experiment, funded by NERC and led by Hanif Sutiyoso, fills a gap in sonic frequency (1-20 kHz) measurements for sediment analysis.
Mexico's President Andres Manuel Lopez Obrador's tweet metadata from October 2009 to March 2026, covering 6,780 original posts and replies. The corpus is organized into three political periods: pre-presidency, presidency, and post-presidency. It was collected by Julio César Chavelas for academic research on democratic erosion and populist communication.
review-chekpoints--2026-06-06--13276-13276 is a dataset published on Kaggle. The title suggests it contains review-related data, possibly with checkpoints indicating stages or versions. The dataset's specific content, size, and origin are not detailed in the available metadata.
A benchmark dataset for evaluating the robustness of image detectors against compression artifacts from major social media platforms. The description indicates it tests detector performance under compression from Instagram, Facebook, TikTok, and X. It is licensed under CC BY-NC 4.0.
Movies.Rating is a dataset hosted on Kaggle, likely containing user-submitted ratings for films. The dataset's author, organization, and specific details like row count and column definitions are not provided. Its content and structure must be verified after download.
Data for a project investigating the self-effects of AI-mediated communication. The dataset likely contains variables related to human and AI authorship, relational meta-perceptions, message evaluations, and linguistic markers of expressive writing. It was authored by Jorge Pena and published on the Harvard Dataverse platform, with a last update recorded on 2026-04-28.
54.6 KB dataset contains studies on Green Human Resource Management using Structural Equation Modeling. The collection was compiled by author Renata Skýpalová and last updated in March 2026.
OACIRR is a benchmark dataset for referential anchoring in composed image retrieval, released by HaHaJun1101. It supports research on tasks beyond semantic search, as presented in a CVPR 2026 paper. The dataset was officially released on March 24, 2026.
This dataset lists community solar projects identified from various sources as of Spring 2018. It includes project attributes such as State, Service Territory, and System Capacity. The database is maintained by the Department of Energy's National Renewable Energy Laboratory (NREL).