Loading...
Loading...
News corpora, social media analysis, movie/music metadata, sports data, cultural datasets, misinformation
10,932 datasets
Mars Express spacecraft raw measurements from the SPICAM ultraviolet spectrometer. The data was collected by NASA during the extension 3 mission phases on Mars. The dataset is a level 0A product, indicating raw, unprocessed instrument data.
Geoscience Australia Data provides a mineral systems framework review for high-purity silica exploration. The report evaluates five components of silica mineral systems, including sources, energy drivers, and fluid pathways, to identify key formation criteria. It aims to de-risk exploration and support future mineral potential assessments in Australia.
Eleven horses were studied in an experiment comparing invasive and oscillometric blood pressure measurements. Under isoflurane anaesthesia, blood was removed and returned in 5 ml/kg increments up to 25 ml/kg, with paired measurements taken before and during the procedure. The raw data was published by Colleen Hickey on figshare in March 2026.
Heard Island tidal measurements collected in March 1997 using a portable pressure transducer, yielding a 22-hour record. The data was connected to a local geodetic benchmark via leveling and a 2-hour GPS observation. The dataset also references historical tidal data from the National Tidal Centre for 1948, 1980, and 1986-1987.
Supplementary data from a cross-sectional content analysis of Bilibili and TikTok videos as sources of HPV-related medical information. The dataset, published on figshare by Zhaohui Jiang, is an 11.2 KB Excel file last updated in April 2026. Its specific content and row count are unknown from the provided metadata.
Supplementary material from a cross-sectional content analysis of HPV-related medical information on Bilibili and TikTok video platforms. The dataset is a 96.8 KB Excel file published on figshare by author Zhaohui Jiang under a CC-BY-4.0 license, last updated in April 2026. Its specific content likely contains coded or extracted information from the analyzed videos.
TalTechNLP provides transcriptions for approximately 40,000 video news stories from Estonian National Broadcasting (ERR), totaling around 4,000 hours of audio. The transcriptions were generated automatically using the Gemini-3-Flash-Preview speech recognition model, with contextual biasing applied using related textual news to improve quality. The dataset was last updated on March 31, 2026.
Townsville City Council provides data on interments at its four municipal cemeteries: Belgian Gardens, Magnetic Island, Manton, and Townsville War Cemetery. The dataset includes attributes for cemetery, section, plot, monument, and interment type. It is published by the Townsville City Council and was last updated in March 2026.
DeepWiki Public Repo Reviews is a single-turn instruction dataset of full technical summaries for GitHub repositories, sourced from the AI-generated wiki platform DeepWiki. The dataset contains 6,920 archived repository summaries, each providing a long-form answer covering architecture, components, data flows, APIs, and implementation details. It was created by author 'nisten' and was last updated on April 15, 2026.
Histological analysis for cFos, pPDH, and MOR expression in striatum across saline, 0.5x, 1x, 2x, and 10x IBU. The dataset was authored by Alexis Salcido and last updated on May 19, 2026. It is hosted on the Harvard Dataverse platform.
Hai-Hu Wen deposited all source data for the main text and supplementary information of a study on enhanced superconductivity. The dataset is 705.1 KB in size and was last updated on April 10, 2026. It is available under a CC-BY-4.0 license.
Western Australia's Department of Planning, Lands and Heritage maintains a list of places with cultural heritage significance established under local planning schemes. The dataset identifies properties requiring special planning approval for demolition, additions, and other works.
A 2013 compilation of short animations describes the key processes involved in tsunami generation. The video content is hosted by the Australian Ocean Data Network and is tagged for topics including earthquakes, coasts, and marine hazards. Its primary use is for geoscience education and public awareness of natural hazards.
A dataset from the Government of Alberta, last updated on 2026-03-18, presenting age- and sex-specific dispensation rates for antidepressant and antianxiety medications. The data is expressed per 1000 population and covers Alberta, nine health regions, and five Alberta Health Services Continuum Zones. It likely contains rates derived from community pharmacy prescription fills, categorized by Anatomical Therapeutic Chemical (ATC) names.
SporTabSet is a collection of sports commentary data for basketball and cricket, packaged by author ritup3 for Hugging Face. The dataset includes finalized basketball commentary, temporal partitions for basketball, and ODI and T20 cricket variants with temporal subsets. It was last updated on April 20, 2026.
A press kit document for the International Space Station's Expedition 04 mission from December 2001 to June 2002. The document was published by the National Aeronautics and Space Administration (NASA) and contains information about the mission overview, crew, timeline, benefits, and media contacts. The dataset is stored in XML format and was last updated on the platform in March 2026.
Polish articles from various media outlets, judged for subjectivity using four large language models. The dataset includes individual model reviews with reasoning and scores, as well as an aggregated average. It was created by Jakub Legutko and last updated on February 23, 2026.
270 high-resolution conductivity, temperature, and depth records were collected from 9 oceanographic casts during a 1994 NOAA cruise. The National Ocean Service submitted the data, which includes profiles of salinity, density, and dissolved oxygen at fine depth intervals. Measurements span a one-month period from May 20 to June 23, 1994.
Hydrocarbon, trace metal, and grain size data were collected from the Gulf of Alaska, specifically Cook Inlet, during two research cruises. The dataset includes analyses for water and sediment samples, submitted by Dr. Douglas Segar of the University of Alaska, Anchorage. Measurements were taken between June 22 and August 25, 1993.
IMDb's Top 250 list provides a curated ranking of the highest-rated films and television series. The dataset is described as clean and structured, containing features for each of the 250 titles. Its origin and specific features are not detailed in the provided metadata.