Loading...
Loading...
DNA/RNA sequences, gene expression, protein structures, metagenomics, single-cell sequencing
23,804 datasets
A machine learning model for predicting Alzheimer's disease risk developed using data from 666 participants from the Alzheimer's Disease Neuroimaging Initiative (ADNI). The FLAME scorecard was externally validated on 4,876 participants from the National Alzheimer's Coordinating Center (NACC) and integrates cognitive measures, daily functioning, and demographics. The research was authored by Yumiko Wiranto and published on figshare in April 2026.
226.5 KB of research data from a study investigating the effect of exogenous γ-aminobutyric acid (GABA) on 2-acetyl-1-pyrroline (2-AP) biosynthesis in pumpkin seedlings. The dataset, authored by Jingjing Chang and last updated in April 2026, likely contains measurements of metabolite contents, enzyme activities, and gene expression levels. The study reports a 72.4% increase in 2-AP content and a 7.8-fold increase in glutamate following GABA treatment.
Ke Li's 2026 study provides a detailed economic analysis of intensive Dictyophora rubrovolvata cultivation in China. The dataset includes a total unit production cost of 204.82 CNY/kg, with cost breakdowns for capital depreciation and labor. It was created using Environmental Life Cycle Costing and Monte Carlo simulation based on data from a commercial facility.
CYC Internal Audit Reports for 2023 are presented to the Audit and Governance Committee and published on the York Open Data platform. The reports are grouped by calendar year and published as PDF files. The dataset is licensed under the UK Open Government Licence.
Student enrollment data from Colombia's national open data portal, www.datos.gov.co, last updated on 2026-05-18. The dataset tracks counts of students across categories such as Inscrito (registered), Admitido (admitted), Primiparo (first-time enrolled), and Matriculado (total enrolled). It is structured by academic period, program, modality, methodology, and schedule.
Colombian higher education data on student counts across admission and enrollment stages. The dataset includes figures for students who applied, were admitted, were first-time enrollees, and total enrollees, broken down by academic program, modality, methodology, and schedule. It is sourced from the Colombian open data portal, datos.gov.co, and was last updated on May 18, 2026.
24 CPP genes identified in the peanut genome show tissue- and stress-specific expression patterns. The dataset, created by Hongzhan Liu and last updated in April 2026, includes results from bioinformatics identification, synteny comparisons, and transcriptomic analysis. It provides a foundation for functional investigations into drought and salt stress responses in this crucial industrial crop.
A geospatial dataset from the U.S. Environmental Protection Agency's Facility Registry Service, last updated in April 2026. It contains location and identification information for facilities that have submitted Risk Management Plans for handling flammable or toxic substances, as mandated by the Clean Air Act. The data is centrally managed by the EPA, integrating information from national program systems and other federal and state sources.
A subset of the EPA's Facility Registry System containing location and identification information for facilities that link to the Risk Management Plan database. The RMP database stores plans from companies handling flammable or toxic substances, as required by the Clean Air Act. This dataset is integrated from EPA national programs, other federal agencies, and state and tribal records, and was last updated on 2026-04-12.
More than 650 toxic chemicals are tracked in this subset of the EPA's Facility Registry System, linking facilities to the Toxic Release Inventory. The data provides location and identification information for industrial and federal facilities that report on chemical use, manufacturing, treatment, transportation, and environmental releases. It is maintained by the U.S. Environmental Protection Agency and was last updated in April 2026.
A subset of the EPA's Facility Registry System containing location and identification data for facilities subject to oil spill prevention and response regulations. The data is integrated from EPA national programs, other federal agencies, and State and tribal records, providing a centrally managed source. These facilities are designated as 'substantial harm' due to the quantities of oil stored and their specific characteristics.
University of Quindio data on incoming and outgoing mobility, both nationally and internationally. The dataset includes columns for Institution of Origin, Type, Position, Period, Faculty/Dependency, Origin, Mobility, City of Origin, Program/Office, and Destination. It was last updated on 2026-05-18 and is hosted by the Colombian open data portal www.datos.gov.co.
Opra (Operational Risk Appraisals) scores categorize the environmental risk of UK industrial installations and waste operations. The scheme assessed risk based on Complexity, Emissions, Location, Operator Performance, and Compliance, each graded A-E (or A-F for Compliance). The Environment Agency published full profiles from 2014 to 2017, though the scheme was withdrawn after 2017.
CYC Internal Audit Reports 2021 are documents presented to the Audit and Governance Committee of York City Council. The reports are published as PDF files on the York Open Data platform and are grouped by calendar year. The dataset is made available under the UK Open Government Licence.
507.1 KB of underlying numerical values for figures in a PLOS Biology study. The data, provided by Qing-Xue Sun in an XLSX file, contains individual biological replicate measurements used to calculate summary statistics and error bars for figures 2 through 5 and supplementary figures S3, S6, S8, and S10.
KEGG pathway enrichment results derived from a multi-omic analysis of human brain samples by the PsychENCODE consortium. The 5.5 KB XLS file contains results from three analytical approaches applied to transcriptomic data from individuals with schizophrenia and healthy controls. The dataset was authored by Costas Bampos and last updated on April 15, 2026.
Costas Bampos published GO enrichment results on 2026-04-15. The dataset contains functional enrichment analysis outputs from a multi-method study of PsychENCODE transcriptomic data focused on schizophrenia. The 5.5 KB XLS file likely contains Gene Ontology and KEGG pathway enrichment scores for genes identified through co-expression network and dimensionality reduction analyses.
The PsychENCODE consortium generated transcriptomic data from human brain samples. This dataset contains Gene Ontology functional annotation results from a multi-method analysis of schizophrenia-associated gene modules. The 9.5 KB XLS file was authored by Costas Bampos and last updated on April 15, 2026.
An Excel file containing transcriptomic data from a study comparing neonatal and adult immune responses to Bordetella pertussis. The 725.8 KB dataset, authored by Soraya Matczak and last updated in April 2026, profiles an ex vivo whole-blood infection model. It likely contains gene expression data revealing a hyperinflammatory cytokine signature and B cell remodeling in cord blood.
Excel files containing cytokine and multi-omic data from an ex vivo whole-blood infection model comparing neonatal and adult immune responses to Bordetella pertussis. The dataset, authored by Soraya Matczak and last updated in April 2026, is shared under a CC-BY-4.0 license on figshare. Its 20.8 KB size indicates a focused, rather than large-scale, experimental dataset.