Loading...
Loading...
DNA/RNA sequences, gene expression, protein structures, metagenomics, single-cell sequencing
23,724 datasets
A 2026 scoping review by Areeba Shahid analyzes 60 sources from 2000 to 2025 on ethical, regulatory, and implementation barriers to AI in healthcare for low- and middle-income countries. The review, following PRISMA-ScR guidelines, maps literature from PubMed, Scopus, and Cochrane Library alongside global health policy reports. It reports that 7.4% of LMICs have national AI strategies and over 60% of AI models rely on non-representative datasets.
Evaluation reports from Global Affairs Canada's Maternal, Newborn and Child Health Initiative covering the period 2010–11 to 2017–18. The reports serve as a practical management tool for reviewing program performance and improving future initiatives. The dataset is published under the OGL-CA-2.0 license.
859,651 CpG methylation sites were measured using the Illumina EPIC V2 platform for 594 blood samples from 297 Norwegian adults. Jon Bohlin authored this longitudinal study comparing pre-pandemic (2020) and pandemic-era (2023) samples from Long-COVID, COVID-19, and uninfected groups. No significant persistent epigenetic differences related to infection or Long-COVID were detected in this cohort.
34 dereplicated metagenome-assembled genomes (MAGs) derived from anaerobic enriched consortia, authored by Georgia Vayena, Ginevra Giangeri, Marie Karen Tracy Hong Lin, Raphaëlle Péguilhan, Antonio Grimalt-Alemany, and Irini Angelidaki. The MAGs describe a low-diversity anaerobic microbial community adapted to butyrate, with each genome estimated to be at least 70% complete and have less than 10% contamination.
De-identified clinical patient tables and processed TCGA data support a study on PLCG2 lipid-metabolic epithelial cells in HER2-positive gastric cancer. The dataset includes PLCG2-related statistical worksheets and protein expression scoring analysis tables used for figure plotting. Huang Qi'ao'qiao published this supplementary raw and analytical data on figshare in June 2026 under a CC0-1.0 license.
Australia's Fundamental Gravity Network (AFGN) is being modernized to support resource exploration, geodesy, and environmental studies. The work involves cleaning station data, expanding coverage in under-served regions, and investigating new gravimeter technologies. Presented at the 2025 Australasian Exploration Geoscience Conference (AEGC), the data is managed by Geoscience Australia.
Quebec's geodetic network contains approximately 112,000 landmarks grouped under six themes, including permanent GNSS stations and planimetric networks. The vector data includes fields for original number, service number, state of the point, and its theme. A separate network of permanent calibration bases for electronic rangefinders is established across eight main municipalities in the region.
Evaluation reports for the Innovation Platform for Maternal, Newborn and Child Health (IP4MNCH) program. The reports are generated by Global Affairs Canada to review program performance and improve future initiatives. The dataset is published under the OGL-CA-2.0 license and was last updated on 2026-05-26.
Acoustic Doppler Current Profiler data from a 27-day voyage monitoring the East Australian Current. The CSIRO National Collections and Marine Infrastructure processed the data collected by the RV Investigator between Hobart and Brisbane in May-June 2021. Both OS75 and OS150 ADCPs operated in narrowband mode, with transducers located approximately 8.0m below the water line.
Supplementary files to run predictors for a genomics dataset. The dataset likely contains data on the impact of half a million mutations on the alternative splicing of 600 human exons. It was authored by Gioia Quarantani and last updated on May 23, 2026.
Southern Ocean southwest of Tasmania is the region for this Acoustic Doppler Current Profiler (ADCP) dataset from the RV Investigator voyage IN2024_V02. The data, collected between March 31 and April 19, 2024, measures ocean currents using RDI Ocean Surveyor instruments in narrowband and broadband modes. It was processed with the UHDAS and CODAS systems and archived by CSIRO's National Collections and Marine Infrastructure.
1983-2017 biological data includes 20,047 length records, 8,915 weight records, and 2,212 age records for American eels. The collection also contains 5,814 electrofishing session records from New Brunswick rivers (1952-2019) and 1,838 otolith images. Data was compiled by D.K. Cairns (2020) for a potential range-wide stock assessment, with a focus on Canada's Atlantic Provinces.
A cohort of 18 Italian children with MIS-C or Kawasaki-like disease was analyzed for genetic risk factors. The study compares ABO blood group allele frequencies and rare variants in 207 immune-related genes against control groups of 79 children and 2,848 adults. The data was published by Luisa Ronzoni on figshare in April 2026.
18 individuals were enrolled in a study investigating genetic predisposition to multisystem inflammatory syndrome in children (MIS-C). The dataset likely contains results comparing the frequency of ABO tagging SNPs and rare variants in immune-related genes between patients and controls. The data was published by Luisa Ronzoni on figshare in April 2026.
Vladimir M. Vakhtinskii's research dataset compares expression efficiency and protective activity of circular and linear mRNA vectors. The data includes results from luciferase reporter assays, target protein expression, and both active and passive immunization studies against SARS-CoV-2. The dataset was last updated on April 24, 2026.
Confidential Classified Information was the classification level for these topographic maps, which were primarily intended for military purposes. The data covers the Bernau near Berlin area at a 1:25,000 scale and was produced between 1981 and 1989. It was created by the Bundesamt für Kartographie und Geodäsie and includes two editions: topographic maps (TK) and topographic city maps (TSP).
A retrospective cohort study of 44,609 singleton pregnant women from three medical centers between January 2018 and June 2024. The data likely contains serum uric acid levels measured before 20 weeks of gestation and outcomes for gestational diabetes mellitus (GDM), GDM requiring insulin, and GDM with pre-eclampsia. The study was authored by Qiong Li and published on figshare under a CC-BY-4.0 license.
Genomic analysis data supports the causal association of the fungus Ceratobasidium theobromae with Cassava Witches' Broom Disease (CWBD) in the Philippines. The dataset includes results from field surveys showing disease incidence above 50% and PCR assays with 91.79% sensitivity and 95.24% specificity. It was authored by Cris Q. Cortaga and last updated on April 17, 2026.
Philippine field surveys show cassava witches' broom disease (CWBD) present in all cassava-growing regions, with field incidence above 50%. The dataset contains genome analysis results supporting the causal association of the fungus Ceratobasidium theobromae with CWBD, based on PCR assays with 91.79% sensitivity and 95.24% specificity. It was authored by Cris Q. Cortaga and last updated on 2026-04-17.
53.2 KB of genomic analysis data supporting the causal association of the fungus Ceratobasidium theobromae with Cassava Witches' Broom Disease (CWBD) in the Philippines. The dataset includes results from field surveys showing disease incidence above 50% and PCR assays with 91.79% sensitivity and 95.24% specificity. It was authored by Cris Q. Cortaga and last updated on 2026-04-17.