Loading...
Loading...
Text classification, translation, QA, summarization, dialogue, sentiment analysis, language modeling, text corpora
39,939 datasets
Survey data from 22,887 U.S. adults aged 20+ from the National Health and Nutrition Examination Survey (NHANES) 2007β2018, used to examine associations between physical activity patterns and arthritis/osteoarthritis prevalence. The dataset was created by Weibao Zhang and last updated on 2026-06-03. It categorizes participants into four activity groups and uses survey-weighted logistic regression to analyze odds ratios.
Approximately 100,000 sample records of seabed and sub-seabed sediments from Australia's marine jurisdiction, including the Australian Antarctic Territory. The MARS database, managed by the Australian Ocean Data Network, contains analytical properties like grain size, carbonate content, and geochemistry. New data are added as they become available.
A 2026 study by Seunghyong Ryu provides transcripts from 249 patients with schizophrenia and 159 healthy controls across eight speech tasks. The dataset includes token-by-token surprisal metrics derived from Korean language models, analyzing the dynamic breakdown of contextual coherence. The data is hosted on figshare under a CC-BY-4.0 license.
A psychometric validation dataset for the Ukrainian version of the Dissociative Subtype of PTSD Scale (DSPS). The data includes responses from 1,119 trauma-exposed Ukrainian participants, collected via an online study using convenience and snowball sampling. The dataset was created by Anton Kurapov and last updated in May 2026.
Experimental data from a study investigating the role of the palmitoyltransferase DHHC7 in sperm function. The dataset includes results from immunofluorescent staining, sperm motility analysis, and measurements of protein palmitoylation, calcium levels, and reactive oxygen species in mouse and human sperm. It was authored by Haixia Zheng and last updated on June 1, 2026.
The NSW Department of Planning and Environment acquired bathymetry and backscatter data for the Solitary Islands Gumbaynggirr Yaegl Marine Park between 31 August 2022 and 31 July 2023. This dataset contains 5-meter resolution 32-bit floating point geotiff files derived from multibeam sonar surveys conducted onboard the RV Bombora. The data was processed as part of the SeabedNSW program to provide a baseline and map seabed types.
3.8 GB of dielectric function data generated for a materials screening study. The data includes traces of dielectric functions computed with PBE and TASK methods, sampled at 0.1 eV intervals from 0 to 20 eV. The dataset was created by Pedro Borlido and colleagues, with funding from FCT, and was last updated in May 2026.
A bathymetry survey conducted from 27 Feb 2019 to 14 Oct 2020 by the NSW government's Department of Planning and Environment. It provides 5m resolution geotiff files of seabed depth and backscatter for the Forster, Cape Hawke to Black Head area, processed from multibeam sonar data. The dataset was created as part of the SeabedNSW program funded by NSW Coastal Reforms and the HabMap Program.
5.5 KB of parameters for steel plate hoop-bolted connections in prefabricated structures. The dataset, authored by Zhiyuan Gao and last updated in May 2026, supports a comparative fragility analysis of six-story prefabricated and cast-in-place frame structures. It was used to validate finite element models developed in SAP2000 against quasi-static test results.
Concrete properties data supports a comparative fragility analysis of prefabricated and cast-in-place frame structures. The dataset was authored by Zhiyuan Gao and last updated on 2026-05-27. It is used to compare a two-parameter damage model with traditional drift-based seismic assessment methods.
Zhiyuan Gao's dataset contains detailed parameters for specimens used in a seismic fragility study of prefabricated structures. The data supports a comparative analysis between a two-parameter damage model and a traditional drift-based index for six-story frame structures. The dataset was last updated on 2026-05-27.
A 5.5 KB Excel file contains data from a study comparing seismic fragility assessment methods for prefabricated structures with steel plate hoop-bolted connections. The dataset, authored by Zhiyuan Gao and last updated in May 2026, likely includes results from incremental dynamic analysis (IDA) and finite element models developed in SAP2000. The analysis compares a traditional drift-based index with a two-parameter damage model for six-story prefabricated and cast-in-place frame structures.
Zhiyuan Gao published a dataset of 20 basic ground motion information records on figshare in May 2026. The data supports a comparative fragility analysis of prefabricated and cast-in-place frame structures using incremental dynamic analysis. The dataset is stored in an XLS file with a size of 9.5 KB.
101 representative texts of user notes and comments about AI presenters, collected from the Xiaohongshu platform. Yuan Wang curated this dataset, applying rigorous cleaning to remove promotional and non-substantive content. The final sample, updated in May 2026, supports a critical discourse analysis grounded in hyperreality theory.
An updated network meta-analysis of 20 randomized controlled trials, including 127,267 patients, published on figshare in May 2026. The study compares the risk of intracranial hemorrhage between direct oral anticoagulants and Vitamin K Antagonists. The supplementary document contains the detailed results and methodology for this systematic review.
NASA's Crustal Dynamics Data Information System provides a derived product set of Global Navigation Satellite System final orbit and reference frame data. Analysis Centers of the International GNSS Service produce daily satellite and ground receiver clock values, orbits, and Earth rotation parameters, considered the most consistent and highest quality IGS solutions. The dataset includes data from GPS, GLONASS, and, since 2011, other systems like Galileo, Beidou, QZSS, IRNSS, and SBAS.
Alpha-Fe2O3, the most common form of iron(III) oxide, has a rhombohedral corundum structure and occurs naturally as the mineral hematite. The dataset describes its antiferromagnetic properties below ~260 K and weak ferromagnetism up to 950 K, with magnetic properties dependent on pressure, particle size, and magnetic field. Authored by Green house and shared under a CC-BY-4.0 license, this 483.2 KB PDF was last updated in June 2026.
Atilla Topcu authored a research article investigating the protective effects of Longjing tea against gastric ulcers. The study, published on figshare under CC-BY-4.0, examines oxidative stress, inflammation, and apoptosis in an animal model. The dataset consists of a 132.4 KB PDF file last updated in May 2026.
A 2018 study presents a method for generating continental-scale pixel-based surface reflectance composites in coastal regions, addressing the challenge of tidal influences on satellite imagery. The approach uses a multi-resolution tidal model and a Voronoi mesh to capture spatial tidal variation and preserve spectral band relationships. The resulting composites, including mosaics of the Australian coastline at high and low tide, are designed for coastal change detection and environmental monitoring applications.
The Australiaβs Future Energy Resources project investigated energy resource potential in the Pedirka and western Eromanga basins region. It provides geological interpretations based on new biostratigraphic and reprocessed seismic data, leading to a review of basin definitions. The dataset, published by the Australian Ocean Data Network, was last updated on 2026-06-05.