Loading...
Loading...
Text classification, translation, QA, summarization, dialogue, sentiment analysis, language modeling, text corpora
39,925 datasets
Preliminary records of fatal external cause injuries occurring in Colombian territory from January 2025 to March 2026, as registered by the National Institute of Legal Medicine and Forensic Sciences. The dataset includes 30 columns detailing victim demographics, incident circumstances, and forensic classifications. It was last updated on May 18, 2026, on the datos.gov.co platform.
A metadata record describes the structure of "notary fees" for used real estate purchases in France. The dataset likely contains parameters and rates for the four main cost components: transfer tax, notary fees, disbursements, and a security contribution. The data is sourced from French legal decrees and tax codes and was last updated in May 2026.
The Moderate Resolution Imaging Spectroradiometer (MODIS) Near Real Time (NRT) collection comprises over 100 distinct Level-1 to Level-3 geophysical data products from NASA's Terra and Aqua satellites. These products provide daily, global observations of surface reflectance, land/sea surface temperature, aerosol optical depth, cloud properties, vegetation indices, snow/ice cover, thermal anomalies/fire, and atmospheric profiles. Data are delivered in HDF-EOS format at spatial resolutions ranging from 250 meters to 10 kilometers, with a temporal latency suitable for monitoring and rapid response applications.
A retrospective two-center study from 2010 to 2022 includes 143 patients aged ≤1 year undergoing biventricular aortic arch reconstruction. The dataset compares short- and mid-term clinical and computational morphological outcomes between autologous pericardial patches (n=81) and pulmonary artery patches (n=62). It was authored by Qi Jiang and published on figshare under a CC-BY-4.0 license.
A 4.5 MB PDF dataset published on figshare by Alejandro Salgado on 2026-05-28. It describes BlinkFusion, an open-source Python platform for quantifying labeling efficiency and photophysical properties in fluorescence microscopy, including super-resolution STORM. The platform provides metrics for experimental optimization.
An integrated clinicopathologic dataset of 229 thoracic and head-and-neck NUT carcinoma cases with definitive fusion annotations, compiled from 109 studies. The data includes lineage, fusion partner, and immune biomarker status such as PD-L1, MSI, and TMB. It was created by Shuang Xiang and last updated in May 2026.
4.6 KB of data from a study reporting the design and characterization of targeted degraders for testis-specific kinases TSSK1 and TSSK2. The dataset includes quantitative cellular profiling results, such as a DC50 of 10 nM for compound 5.1, and ex vivo sperm function assays showing up to an 80% reduction in TSSK2 and a 97% reduction in motility. The data was authored by Jerrett A. Holdaway and last updated on 2026-06-02.
Experimental data from a soft pneumatic robotic system designed to replicate clinically inspired tactile stimulation for neonatal therapy. The dataset includes force measurements from a neonatology specialist, actuator pressure relationships, and performance results for two control strategies. It was authored by Yarilenka Benites-Mozo and last updated on 2026-05-28.
Supplementary file 1 contains experimental data for a soft pneumatic robotic system designed for preterm infant tactile therapy. The dataset likely includes force measurements, pressure readings, and control performance metrics from a 3x3 actuator matrix. Yarilenka Benites-Mozo published the data on figshare in May 2026.
Between 2015 and 2023, 17,191 adults from the Kailuan study underwent pulmonary function testing. The dataset contains cross-sectional analysis results linking Life's Essential 8 cardiovascular health scores to pulmonary function impairment, authored by Yanhui Deng and published on figshare in 2026.
Data from the Urban Development Program dashboards covers the Greater Sydney region, which includes 33 local government areas such as Bayside, Blacktown, and Sydney. The dataset is provided by the NSW Department of Planning, Housing and Infrastructure and was last updated on 2026-06-03. Data is also available for other regions including Illawarra-Shoalhaven, Central Coast, Greater Newcastle, Upper Hunter, and the North Coast.
Preliminary data from January 2025 to March 2026 on non-fatal external injuries recorded by Colombia's National Institute of Legal Medicine and Forensic Sciences. The dataset contains case-level information for incidents known to the forensic medical system. It is published by the Colombian government via datos.gov.co to inform public policy and decision-making.
Indonesian children aged 9–12 years were surveyed to develop a culturally relevant psychometric instrument. The dataset contains responses from 514 elementary school children, split into independent samples for exploratory and confirmatory factor analysis. The instrument was developed and validated by Indri Utami Sumaryanti, with data last updated in May 2026.
A 2026 psychometric instrument developed and validated for assessing psychological readiness for puberty in Indonesian children aged 9-12 years. Created by Indri Utami Sumaryanti, the dataset includes responses from 514 elementary school children, split into independent samples for exploratory and confirmatory factor analysis. The instrument demonstrates a three-factor structure with high internal consistency and cross-gender applicability.
A dataset of 148 annotated images of Chinese timber architectural elements, used for AI-generated content research. The dataset was created by Qianru Yang and last updated on June 1, 2026. It focuses on the Qiqushan Great Temple in Sichuan and includes four element categories: eave decorations, arch structures, interior beams, and pattern motifs.
A dataset from figshare authored by Ziyu Zhang, last updated in June 2026, describes novel photosensitizer compounds for cancer photodynamic therapy. The data likely contains measurements for two selenium-incorporated azaBODIPY derivatives, including their near-infrared absorption properties and reactive oxygen species generation efficiency. In vivo results from a mouse tumor model are also referenced.
A seamless topographic colour map covering all of Australia, its outer islands, external territories, and the Australian Antarctic Territory. The service integrates data from Geoscience Australia, the Australian Antarctic Division, OpenStreetMap, and other sources, with some data checked in 2008-2009. It portrays cultural, hydrography, marine, transport, vegetation, and relief themes without labels.
Simulation code and supplementary materials for a 2026 paper on distributed estimation methods. Xirui Liu authored this research, which proposes two CPOU methods for handling non-randomly distributed and incomplete data across local machines. The 248.4 KB archive includes R code, PDF documentation, and text files to replicate the study's empirical results.
Data from 24 healthy young adults performing bench press and squat exercises under acoustic biofeedback and generic music conditions. Physiological, kinetic, and kinematic signals were collected at 1000 Hz using wireless biosensors, and perceived enjoyment was rated via questionnaire. The dataset was created by Dania Furk and published on figshare under a CC-BY-4.0 license in May 2026.
73 participants from a nursing home were assessed for presbyphagia and urinary incontinence. The dataset includes results from the 100ml water swallowing test, EAT-10, ICIQ-SF, and King's Health Questionnaire, collected by researcher Ziya Yıldız and last updated in June 2026. It is a small dataset of 5.5 KB stored in an XLS file.