Loading...
Loading...
Text classification, translation, QA, summarization, dialogue, sentiment analysis, language modeling, text corpora
41,487 datasets
One neutrally buoyant float collected continuous temperature and salinity profiles in the eastern tropical Pacific for 3.5 months during the SPURS-2 field campaign. The float, deployed in August 2016 and recovered in December 2016, drifted approximately 1800 km east of the central mooring at 10N,125W. Data from this NASA-funded oceanographic study aims to elucidate mechanisms behind near-surface salinity variations in rainfall-dominated ocean regions.
802 participants performed online and in-person restless bandit tasks across five experiments to study human foraging behavior in reinforcement learning contexts. The dataset, created by Meriam Zid and last updated in May 2026, includes behavioral data and model fits for tasks varying in volatility, option linkage, and the number of alternatives. Data is provided in formats for Python and MATLAB analysis.
A 22.5 KB PDF document authored by Jamie Davis presents a formal mechanical framework describing the universe as a Static Solid-State Matrix. The dataset details concepts like Isotropic Continuum Saturation and Velocity-Phase Solidification, proposing a bridge between theoretical physics and human biophysics. It was last updated on May 1, 2026, and is licensed under CC-BY-4.0.
Zonas de Protección para la Producción de Alimentos (ZPPA) delineate reference polygons for identifying and declaring food production protection areas across the 21 municipalities in the Córdoba department. The dataset includes columns for department, municipality, area in hectares, and geometry, generated in the MAGNA-SIRGAS 2018 coordinate system. It was last updated on 2026-05-18 and is hosted by www.datos.gov.co.
4.2 MB of supplementary PDF files for a 2026 computational chemistry article. The data supports a theoretical investigation of linear, all-trans polyenes using multireference configuration interaction calculations. It includes results for hexatriene, octatetraene, and decapentaene, focusing on their ground and three low-lying excited states.
Data from a study investigating copper-induced oxidative stress on the legume-rhizobia partnership in Lotus japonicus. The dataset, authored by Kathryn Lamoureux and last updated in May 2026, likely contains measurements of plant biomass, copper concentrations, malondialdehyde levels, nitrogen fixation, and ascorbate peroxidase activity under control and copper-treated conditions.
66 English compositions from students at Chinese application-oriented universities form a corpus for analyzing linguistic complexity. Jinhua Zhang collected and analyzed these texts hierarchically at lexical, syntactic, and textual levels. The dataset was last updated on 2026-05-21 and is shared under a CC-BY-4.0 license.
MSR data contains behavioral observations from mirror self-recognition tests conducted on a social group of four beluga whales at the New York Aquarium. The dataset, created by Alexander Mildener and shared on figshare, includes responses to mirror and control conditions. Two whales exhibited self-directed behaviors, with one adult female showing evidence of passing a mark test.
A 10.1 MB research dataset from figshare, last updated April 28, 2026. The dataset supports a proposed method for domain generalization using domain-specific regression with neural network approximation. It includes simulated and real data used to demonstrate the method's performance against existing approaches.
A pilot study by Young Woo Park, uploaded on 2026-05-14, investigates neurochemical biomarkers in mice exposed to high-altitude hypobaric hypoxia and mild traumatic brain injury. The dataset likely contains longitudinal metabolite measurements from the frontal cortex, hippocampus, and cerebellum, collected via 1H-MRS over a 14-week experimental period. It includes results on metabolites such as myo-inositol, total choline, total N-acetylaspartate, and total creatine.
Wanxue Wang authored a review document titled 'Table 1_Immunoengineering in the field of tendon and bone regeneration: immunomodulatory biomaterials, delivery platforms, and preclinical models for chronic diseases.docx'. The document was uploaded to figshare on 2026-05-21 and is licensed under CC-BY-4.0. It systematically summarizes recent paradigm shifts in bone immunoengineering for tendon-bone interface regeneration.
Geoscience Australia's dataset catalogs active Australian hydrogen projects in development, construction, or operation. It includes location data, proponent details, energy sources, production methods, and annual production amounts. The dataset underpins the Australia Hydrogen Opportunities Tool (AusH2) and was produced for the COAG Energy Council’s Hydrogen Working Group in 2019, with updates coordinated via HyResource.
Geoscience Australia collected 126 seabed sediment samples in Jervis Bay, New South Wales, across marine surveys from 2007 to 2009. Data includes bathymetric mapping, biogeochemical sediment analysis, infauna sampling, and underwater video observations concentrated in a 3x5 km grid. Samples were acquired using the MV Kimbla and stored in refrigerated plastic bags.
Buenos Aires Metropolitan Area data from 2021 and 2023 surveys analyzes patterns of intergenerational social mobility by family migratory origin. The dataset, created by Pablo Dalle and last updated in May 2026, applies multinomial logistic regression and social network analysis to compare class attainment among descendants of European, internal, and Latin American migrants. Results show cumulative disadvantages for subaltern ethnic groups but slightly greater short-range upward mobility for children of Latin American immigrants.
A dataset analyzing intergenerational social mobility by family migratory origin in the Buenos Aires Metropolitan Area. It uses pooled data from two probabilistic surveys (2021 and 2023) and applies multinomial logistic regression and social network analysis. The research was authored by Pablo Dalle and last updated in May 2026.
Germany-based qualitative research evaluating an online self-help program for family caregivers of individuals of Turkish descent living with dementia. The dataset includes interview data from eight program participants and eight non-participants, analyzed via structuring qualitative content analysis. The work was authored by Rona Bird and is available under a CC-BY-4.0 license.
Qualitative interview data from 16 participants explores the subjective costs and benefits of an online self-help program for family caregivers of individuals of Turkish descent living with dementia in Germany. The dataset includes interviews with eight program participants at two time points and eight individuals who declined participation. It was created by Rona Bird and published under a CC-BY-4.0 license on figshare in May 2026.
Two controlled heating experiments on Comiso limestone samples, conducted at University College London in February 2016. The dataset includes raw time-series measurements of temperature, acoustic emission amplitude, and counts. Data collection was led by Drs A. Castagna and J. Browning and published in the Journal of Geophysical Research: Solid Earth.
Cross-sectional survey data collected between late 2020 and early 2021 from the NYC Longitudinal Survey of Wellbeing. The dataset explores associations between living with a spouse or partner, material hardships, and mental distress among New York City residents during the COVID-19 pandemic. It was authored by Ao Shen and shared under a CC-BY-4.0 license.
Statistics Canada provides the percentage of sales made directly to clients or customers in the United States over the last 12 months. The data is broken down by the North American Industry Classification System (NAICS), business employment size, type of business, business activity, and majority ownership. The dataset was last updated on June 2, 2026.