Loading...
Loading...
DNA/RNA sequences, gene expression, protein structures, metagenomics, single-cell sequencing
22,526 datasets
Experimental data from a study engineering the algal carbon-concentrating mechanism protein LCIB into tobacco plants. The dataset likely contains measurements of carbon assimilation rates, biomass accumulation, metabolite levels, and photosynthetic performance comparing engineered lines to wild-type plants. It was authored by Mirna Barsoum and last updated on June 4, 2026.
Drake H. Harbert published a transcriptomic analysis comparing sigma-1 (SIGMAR1) and sigma-2 (TMEM97) receptor co-expression architectures. The dataset includes genome-wide Spearman correlation results derived from the GTEx v8 dataset, covering five brain regions with 209 samples in the primary region and 16,225 expressed genes. It was last updated on June 4, 2026.
GTEx v8 data from 209 samples across five brain regions provides genome-wide co-expression analysis for SIGMAR1 and TMEM97 receptors. Drake H. Harbert performed Spearman correlations on 16,225 expressed genes, revealing divergent top networks despite shared global architecture. The dataset, last updated in 2026, includes Weighted Jaccard and Cosine similarity metrics.
Over 80% of colorectal cancer cases are linked to mutations in the APC gene. Alfred J. Simmons used bioinformatics to identify shared neoantigen epitopes from this gene's mutational cluster region and tested a self-amplifying RNA vaccine candidate in a mouse model. The dataset, last updated in June 2026, serves as proof-of-concept for an APC-targeted cancer vaccine.
A 2026 study by Alfred J. Simmons presents bioinformatics and in vivo data supporting a cancer vaccine concept. The research identifies shared neoantigen epitopes from the APC gene in colorectal cancer patients and tests a self-amplifying RNA vaccine delivered via virus-like nanoparticles. The dataset includes in vitro T-cell activation results and in vivo immune response data from mouse serum samples.
A research paper detailing the identification and in vivo testing of a shared neoantigen for colorectal cancer vaccination. The study utilized bioinformatics to identify epitopes from the mutational cluster region of the APC gene in multiple patients. It describes the development of a virus-like particle to deliver a self-amplifying RNA replicon encoding the neoantigen and presents results from in vitro T-cell assays and in vivo mouse model experiments.
Over 80% of colorectal cancer cases are linked to mutations in the APC gene. Alfred J. Simmons authored this research data, which details the identification of shared neoantigen epitopes from the APC gene's mutational cluster region and the development of a self-amplifying RNA vaccine candidate. The dataset, last updated on 2026-06-04, provides proof-of-concept for a vaccine approach against APC-associated colorectal cancer.
Over 80% of colorectal cancer cases are caused by mutations in the APC gene. Alfred J. Simmons used bioinformatics to identify shared neoantigen epitopes from the mutational cluster region of the APC gene in multiple patients. The dataset, last updated in June 2026, serves as proof-of-concept for a saRNA-expressed cancer vaccine against APC-associated colorectal cancer.
Longitudinal data from 248 mothers in the GUSTO cohort at 3-weeks (n=205) and 3-months (n=114) postpartum, with 71 matched cases. The dataset relates concentrations of 19 human milk oligosaccharides to maternal sociodemographic, genetic, and obstetric characteristics. It was authored by Han Zhang and shared under a CC-BY-4.0 license on figshare.
Amaya Lopez-Pascual's dataset contains transcriptomic profiling data for epigenetic and metabolic genes in cholangiocarcinoma (CCA). It includes expression data for 257 epigenetic genes, 96 metabolic genes, and 189 rate-limiting enzymes from human iCCA, eCCA, normal bile ducts, organoids, and tumoroids, alongside CRISPR-Cas9 DepMap data. The dataset was last updated on June 4, 2026.
257 epigenetic genes, 96 metabolic genes, and 189 rate-limiting enzymes were examined in transcriptomic data from intrahepatic and extrahepatic cholangiocarcinoma (CCA), normal bile ducts, organoids, and tumoroids. The dataset, created by Amaya Lopez-Pascual and last updated in June 2026, integrates CRISPR-Cas9 screening data and multi-omic profiling from mouse models. It highlights genes linked to poor prognosis and tumor microenvironment subtypes.
A transcriptomic dataset profiling 257 epigenetic genes, 96 metabolic genes, and 189 rate-limiting enzymes in human cholangiocarcinoma (iCCA, eCCA) and normal tissues. The data, authored by Amaya Lopez-Pascual and shared on figshare under CC-BY-4.0, includes CRISPR-Cas9 DepMap viability screens and analyses of tumor microenvironment subtypes. It was last updated on June 4, 2026.
746 mass spectrometry- and spectroscopy-based metabolomics tools across 37 categories are aggregated in this curated database. It tracks structural shifts in the field from 2021 to 2025, including a 2.4-fold increase in machine learning adoption and the rise of Python as the dominant programming language. The dataset was created by Daniel Domingo-Fernández and last updated in May 2026.
Sheela S. Sinharoy presents the ARISE scales, a set of 16 psychometric scales measuring women's empowerment in urban sanitation. The dataset contains cross-sectional survey data from 5,586 women across eight cities in Bangladesh, India, Senegal, Uganda, and Zambia, collected between August 2021 and June 2022. It was developed to provide reliable and valid measures for prioritizing, designing, and evaluating sanitation programs and policies.
A 2026 study by Sebastian Hacker measured serum 25-hydroxyvitamin D levels in 473 German national squad athletes. The research analyzed genetic and non-genetic determinants using a polygenic score and covariates like age, sex, UVB exposure, and supplementation. The full model explained 31.7% of the variance in vitamin D status.
Gargeda State Forest in western Ethiopia is the focus of a study assessing Participatory Forest Management. The research, conducted by Zelalem Telila, used surveys, interviews, and document reviews to evaluate governance performance, socio-economic outcomes, and barriers among Forest User Groups. Results include community awareness at 85%, women's participation at 15.6%, and willingness for conservation training at 83%.
Pascal Mohamed Mounchid's dataset examines child marriage rates in India and Zambia during the COVID-19 pandemic. It contains quantitative and qualitative data collected from 3,049 adolescent girls aged 13–18 in India and 1,615 aged 15–19 in Zambia between February and September 2022. The analysis focuses on socio-cultural and economic influences, including dowry and bride price practices.
9.5 KB of experimental data from a study optimizing a constructed wetland-microbial fuel cell (CW-MFC) for rural domestic wastewater treatment under low-temperature winter conditions. The dataset, authored by Tuodi Zhang and last updated in May 2026, contains results from a response surface methodology (RSM) experiment varying electrode plate projection coefficient, inter-electrode distance, and external resistance to maximize COD removal efficiency.
Daisuke Tsugama's repository contains processed datasets and code for analyzing codon-mediated gene expression regulation. The 2.2 GB collection includes RNA-seq and Ribo-seq data, predicted expression indices, and observed TPM, mRNA half-life, and protein abundance for Arabidopsis thaliana, Oryza sativa, Homo sapiens, and Mus musculus. It was last updated on May 27, 2026.
City of Melbourne Open Data provides historical readings for soil sensors monitoring salinity, temperature, and moisture in city parks. The dataset contains a large number of records and can be joined to a separate sensor locations dataset using a site ID. An attachment file contains the complete historical data for 2023.