Loading...
Loading...
News corpora, social media analysis, movie/music metadata, sports data, cultural datasets, misinformation
10,944 datasets
A review synthesizes 51 comparative studies evaluating the performance of 62 spatial interpolation methods and sub-methods. It quantifies impacts on estimations using data from 77 cases and classifies 26 methods based on their features to aid selection.
GISTBench contains 4.2 million content engagements from approximately 1,000 anonymized users for evaluating large language models. Created by Facebook, this benchmark tests an LLM's ability to infer user interests from interaction history and ground those inferences in specific evidence. The dataset was last updated in April 2026.
Steam Game Reviews Dataset contains 561,843 English-language user reviews for games on the Steam platform. The dataset likely contains textual feedback and ratings from users. Its author, organization, and last update date are unknown.
250 Hollywood movie scripts compiled by Pratik Kalamkar. The dataset is described as balanced and includes age rating information. The specific source, time period, and script content details are not provided.
Temperature, salinity, pressure, and transmissivity profile data originate from two research cruises aboard the R/V Thomas G. Thompson off northern Oregon and southern Washington. The National Oceanic and Atmospheric Administration collected this data as part of a Department of Energy program investigating pollutant transport and sediment dispersal. This multidisciplinary study focused on coastal upwelling dynamics and phytoplankton productivity distribution.
April to October 1981 data from the Transient Tracers in the Ocean-North Atlantic study, collected aboard the RV Knorr. The dataset covers legs 1-7 of the study, including stations 1-250, and was submitted by R.T. Williams, F. Delahoyde, and K.M. Sanborn of the Scripps Physical and Chemical Oceanographic Data Facility. Data is stored in a non-NODC format across seven original tapes.
Southern Ocean and tropical Pacific data contains surface wave and sea surface temperature measurements collected from the R/V Shiyan #3 research vessel. The dataset was submitted by the South China Sea Institute of Oceanology as part of the TOGA COARE international project. Observations were recorded during a specific voyage from November 9, 1992, to February 24, 1993.
Fifty-four full-depth hydrographic casts were collected along Line W from the New England shelf toward Bermuda. The dataset includes calibrated CTD measurements of pressure, temperature, salinity, and dissolved oxygen, plus water sample analyses for chlorofluorocarbons (CFCs 11, 12, 113). Data was gathered by NOAA during a 2004 research cruise on the R/V OCEANUS.
High-resolution CTD/STD data from the RV Alpha Helix in the Gulf of Alaska from 1986-12-15 to 1986-12-19. The data, processed to the NODC F022 standard, likely contains nearly continuous profiles of temperature, salinity, density, and possibly dissolved oxygen or transmissivity at depth intervals as fine as 1 meter. Cruise information, position, date, time, and environmental conditions are reported for each station.
Bering and Chukchi Sea data from July 1986 contains high-resolution vertical profiles of oceanographic parameters. The dataset, processed to the NODC F022 standard, likely contains measurements of temperature, salinity, density, and possibly dissolved oxygen or transmissivity. Cruise information, station positions, and environmental conditions at the time of each cast are also reported.
North Pacific Ocean data from NOAA Ship Oceanographer's 1973 cruise provides high-resolution vertical profiles of ocean properties. The dataset, processed to the NODC F022 standard, likely contains nearly continuous measurements of temperature, salinity, density, and possibly dissolved oxygen or transmissivity at depth intervals as fine as 1 meter. Cruise information, station positions, date, time, and environmental conditions at the time of each cast are also reported.
High-resolution conductivity-temperature-depth data from the USNS Thomas Washington provides nearly continuous ocean profiles at intervals as fine as 1 meter. Collected in the North Pacific Ocean during a 26-day cruise in early 1980, the dataset includes temperature, salinity, density, and potentially dissolved oxygen or transmissivity. Each station record contains cruise metadata, position, date, time, and may include concurrent meteorological and sea surface conditions.
Australian waters were surveyed by the Commonwealth Scientific and Industrial Research Organization (CSIRO) aboard the R/V Franklin. The dataset contains Conductivity, Temperature, Depth (CTD) and expendable bathythermograph (XBT) measurements collected between July 6, 1990 and October 18, 1992. Data is structured by cruise station and has been processed and distributed by NOAA's National Centers for Environmental Information.
High-resolution CTD/STD data from NOAA Ship Miller Freeman provides near-continuous vertical profiles of oceanographic parameters. The dataset covers the Bering Sea and Gulf of Alaska during a cruise from January to April 1980. Data records likely contain temperature, salinity, density, and possibly dissolved oxygen or transmissivity at depth intervals as fine as one meter.
NCEI Accession 8400102 contains CTD (Conductivity-Temperature-Depth) and STD (Salinity-Temperature-Depth) data from the R/V SHIJIAN cruise EC-2, a joint U.S./China research effort from November 18-23, 1981. The data, released by Dr. Bob Beardsley of Woods Hole Oceanographic Institution, has been processed by NODC into the standard High-Resolution F022 format. It provides nearly continuous vertical profiles of temperature, salinity, density, and potentially dissolved oxygen or transmissivity at depth intervals as fine as one meter.
High-resolution CTD/STD data from the R/V WECOMA cruise in the North Pacific Ocean provides nearly continuous vertical profiles of ocean properties. The data, processed to the NODC F022 standard, likely contains temperature, salinity, density, and possibly dissolved oxygen or transmissivity at depth intervals as fine as one meter. Cruise information, station positions, and environmental conditions at the time of each cast are also reported.
NorwegianNewsTopics is a tabular dataset containing metadata and topic distributions for online news articles published in 2023. It includes articles from 22 Norwegian news outlets, with topic outputs derived from a 28-topic Latent Dirichlet Allocation model. The dataset was created by Steen Steensen to analyze topic diversity in journalism and enable comparisons across editorial types and platforms.
A public-only research snapshot of events from the TradeCat / btcusdt-um-amplitude-top100 event study chain. The dataset is a filtered subset containing only source_origin=public and review_status in {keep, promote}. It was authored by 123olp and last updated on April 12, 2026.
Greater London Authority provides data on London's Creative Enterprise Zones and other creative areas. The repository contains estimated job counts (2015-2023), business counts (2015-2024), and turnover figures (2015-2023) for creative sectors defined by DCMS SIC codes. Data is aggregated to middle super output areas (MSOAs) and sourced from ONS surveys and business registers.
Numerical simulation data from NCAR's High Altitude Observatory models the effects of ionization on compressible convection in a hydrogen fluid. The experiments study supersonic downdrafts and convective motions, with conditions intended to be similar to the solar convection zone. The dataset likely contains results from 256^3 numerical simulations exploring ionization fractions from 0 to 0.5.