Loading...
Loading...
DNA/RNA sequences, gene expression, protein structures, metagenomics, single-cell sequencing
23,842 datasets
Research data collected in support of a study to determine whether exposure to welding fumes results in changes in metals and metabolites found in urine samples from welders. The dataset is provided by the Government of Alberta and was last updated on April 17, 2026. It includes files in XLSX and HTML formats.
Annual statistics on the distribution of physicians by specialty based on service events submitted under Alberta's Alternative Relationship Plan (ARP). The table is published by Alberta Health as part of the Alberta Health Care Insurance Plan Statistical Supplement report. The Excel version was last updated on April 17, 2026.
Usman Ali published a dataset on figshare in April 2026 comparing model performance across four locations. The dataset, stored in a 9.5 KB XLS file, presents R-squared and Mean Absolute Error metrics as mean ± standard deviation for different train-test split ratios. The standard deviation indicates the internal consistency of predictions within each split ratio.
A 9.5 KB Excel file details the GermVarX workflow for joint germline variant discovery in whole-exome sequencing cohort studies. Thao Thi Phuong Nguyen authored this documentation, which was last updated in April 2026. It describes a modular Nextflow-based pipeline integrating GATK HaplotypeCaller and DeepVariant for variant calling.
New York City polygon-based spatial data represents the shape, location, and identity of Tax Lots from the Department of Finance Digital Tax Map. The dataset is extracted from DOF's internal system on the last Friday of each month and refreshed on ArcGIS Online on the 1st. It is provided by the New York City Department of Finance via data.cityofnewyork.us.
A 9.5 KB XLSX file containing mudstone water-induced softening test results. The data includes water absorption rate variation over time and corresponds to a figure in a published research article. The dataset was authored by Chi Li and last updated on April 30, 2026.
Pierre Darlu's dataset on figshare tracks the distribution of 60 surnames and their variants. It contains counts of births per surname for two periods: before the plague (1689–1720) and after (1721–1789). The data includes annualized figures, differences, and ratios between the periods, and notes surnames associated with plague deaths.
1689–1789 baptism records from Martigues, France, listing Type 1 geohapax surnames. The dataset includes counts of baptized individuals in two periods (1689–1720 and 1721–1789) and their occurrences in municipalities within Bouches-du-Rhône. Author Pierre Darlu published the data on figshare under a CC-BY-4.0 license.
11.6 KB of raw data supporting a figure in a published research article. The dataset, authored by Xuanying Shen and last updated in April 2026, contains reflectance outlier percentages categorized by slope classes. It is shared under a CC-BY-4.0 license on the figshare platform.
Raw data supporting a correlation analysis between reflectance and the cosine of the incidence angle for November, as shown in a specific research figure. The dataset is a 3.0 MB XLSX file authored by Xuanying Shen and shared under a CC-BY-4.0 license. It was last updated on April 30, 2026.
Raw data for the April reflectance–cos(i) correlation analysis shown in Figure 3 of a PLOS ONE article. The dataset is a 3.0 MB XLSX file authored by Xuanying Shen and shared under a CC-BY-4.0 license on figshare. It was last updated on April 30, 2026.
A genome-wide analysis identified 89 NBS-LRR genes in the sponge gourd (Luffa cylindrica) plant. The research, conducted by Xiaolin Yang and published in 2026, classifies these genes into seven subfamilies and analyzes their roles in resistance to Fusarium wilt and ToLCNDV. The work provides foundational data for molecular breeding.
Australian geoscience research is compiled in this volume of the AGSO Journal, containing six peer-reviewed articles. The studies cover diverse geological topics, including biostratigraphy of the Marion Plateau, tectonics of the Queensland Trough, and geochemical surveys in the Northern Territory. This collection likely contains detailed findings, maps, and data from specific field studies across Australia.
Replication data for a study on the relationship between college majors and earnings growth, accepted by the Journal of Labor Economics in 2025. The data package was authored by Woosuk Choi and last updated in June 2026. Its specific geographic scope is not detailed in the provided metadata.
Species belonging to the superfamily Orthotetacea form an important part of the Permian marine faunas of Western Australia. The dataset describes four genera and at least 23 species, many of which are useful for stratigraphical correlation. It is hosted by the Australian Ocean Data Network and was last updated in April 2026.
Paired RNA-sequencing and liquid chromatography-tandem mass spectrometry (LC-MS/MS) data from spatially delineated intra-tumour subpopulations in three human pancreatic ductal adenocarcinoma (PDAC) tumours. The dataset, created by Yong Chiang Tan and last updated on 2026-05-18, was generated using intact-protein MALDI imaging mass spectrometry to isolate subpopulations for downstream paired proteotranscriptomic profiling. It aims to address limitations of conventional bulk proteomics and transcriptomics in capturing post-transcriptional regulation within distinct tumour microenvironments.
Billboard Hot Latin Songs weekly chart data from 1986 to 2024, compiled by researcher Diego Olivares and hosted on Harvard Dataverse. The dataset includes chart performance metrics and has been enriched with artist gender, ensemble type, country of origin, and collaboration type variables. A complementary dataset provides genre classifications for individual songs sourced from iTunes.
Additional file 1 contains supplementary tables S1-S8 for a genome-wide association study on tuberculosis susceptibility. The dataset, published by Xuling Chang on figshare under a CC-BY-4.0 license, was last updated on 2026-05-27. Its 28.2 KB size suggests a limited scope, likely containing summary statistics or detailed results from the associated research paper.
Primers and probes used in experiments to study the role of ZmSKIP and ZmBAG8 genes in drought tolerance in maize (Zea mays L.). The dataset is shared by author Yao Wang on figshare under a CC-BY-4.0 license and was last updated in April 2026. The 197.5 KB XLS file contains the molecular tools for the described genetic assays.
87.8 MB of bimolecular fluorescence complementation (BiFC) images supporting a study on the ZmSKIP protein's role in drought tolerance in maize. The dataset, published by Yao Wang on figshare, includes images from experiments showing protein-protein interactions and phosphorylation events under drought stress. It was last updated on 2026-04-13.