Loading...
Loading...
DNA/RNA sequences, gene expression, protein structures, metagenomics, single-cell sequencing
23,807 datasets
A pan-cancer multi-omics analysis from figshare investigates the oncogenic role of the TRIP13 gene. The dataset, authored by Yuanqiao Zhao and last updated in April 2026, likely contains expression data, prognostic associations, and immune infiltration correlations across various cancer types. It is a 12.1 MB PDF file licensed under CC-BY-4.0.
298 cassava accessions were genotyped to identify genomic regions linked to post-harvest physiological deterioration. The study used both human visual scoring and an AI-powered phenotyping method, identifying 6 significant SNPs from each approach. This dataset, authored by Kwame Obeng Dankwa and last updated in April 2026, provides genetic targets for breeding PPD-tolerant cassava varieties.
25 synthetic Illumina datasets and a real-world wastewater dataset form a case study for benchmarking bioinformatics tools. The data, created by Gabriele Leoni and last updated in April 2026, simulates shotgun and amplicon sequencing to assess variables like mutation profiles and aligner choice. Results revealed substantial performance variability, highlighting the need for systematic benchmarking to inform workflow selection in public health contexts.
Melon (Cucumis melo L.) genome-wide identification of the ABCB gene family. The dataset contains 39 identified CmABCB genes, classified into five phylogenetic subgroups, with expression data from RNA-seq and qRT-PCR under IAA treatment. Author Jun Lai published the data on figshare in April 2026.
A genome-wide gene expression dataset from 126 minor salivary gland samples, comprising 92 patients with primary Sjögren's syndrome and 34 controls. The data was generated by Zhongshan Li via deep stranded total transcriptome sequencing and last updated in April 2026. It includes a constructed competing endogenous RNA network of 3,035 lncRNAs and 10,838 coding genes.
Deep stranded total transcriptome sequencing was performed on minor salivary gland samples from 92 patients with primary Sjögren's syndrome (pSS) and 34 non-SS controls. The dataset, authored by Zhongshan Li and last updated in April 2026, contains results from a genome-wide gene expression analysis, including a constructed competing endogenous RNA (ceRNA) network.
92 patient and 34 control samples from minor salivary glands were analyzed via deep stranded total transcriptome sequencing. The dataset contains results from a genome-wide competing endogenous RNA network analysis, identifying 3,035 lncRNAs and 10,838 coding genes. It was authored by Zhongshan Li and last updated on 2026-04 15.
A genome-wide gene expression dataset from 126 minor salivary gland samples, including 92 from patients with primary Sjögren's syndrome and 34 from non-Sjögren's controls. The data was generated by Zhongshan Li using deep stranded total transcriptome sequencing and was last updated in April 2026. It includes a constructed competing endogenous RNA network comprising 3,035 lncRNAs and 10,838 coding genes.
126 salivary gland samples from patients and controls were sequenced for a genome-wide transcriptome analysis. The dataset likely contains expression data for coding and noncoding genes, used to construct a competing endogenous RNA network with 3,035 lncRNAs and 10,838 coding genes. Author Zhongshan Li published this research on figshare in April 2026 under a CC-BY-4.0 license.
Deep stranded total transcriptome sequencing data from 126 minor salivary gland samples, comprising 92 patients with primary Sjögren's syndrome and 34 non-SS controls. The dataset includes genome-wide coding and noncoding gene expression, a constructed competing endogenous RNA network, and in vitro validation results. It was authored by Zhongshan Li, shared under a CC-BY-4.0 license, and last updated in April 2026.
A transcriptomics dataset from 126 minor salivary gland samples, including 92 from patients with primary Sjögren's syndrome and 34 controls. The data was generated by Zhongshan Li using deep stranded total transcriptome sequencing and last updated in April 2026. It includes a genome-wide competing endogenous RNA network constructed from 3,035 lncRNAs and 10,838 coding genes.
A 2026 study by Zhongshan Li presents gene expression data from 126 minor salivary gland samples, including 92 from patients with primary Sjögren's syndrome and 34 from non-SS controls. The dataset includes results from deep stranded total transcriptome sequencing, differential expression analysis, and a constructed genome-wide competing endogenous RNA network. The research identifies and validates regulatory roles for specific long non-coding RNAs in the disease.
126 minor salivary gland samples from patients with primary Sjögren's syndrome and controls were analyzed via deep stranded total transcriptome sequencing. Zhongshan Li published this dataset under a CC-BY-4.0 license in April 2026. The data supports the construction of a genome-wide competing endogenous RNA network comprising 3,035 lncRNAs and 10,838 coding genes.
43 healthy male participants completed a low-contrast visual search task under rhythmic and arrhythmic flicker conditions. The dataset includes behavioral metrics like reaction time and accuracy, plus high-density electroencephalographic data analyzed for power spectral density across delta, theta, alpha, and beta bands. The data was authored by Hongwei Wang and last updated on 2026-04-15.
50 concession contracts and 11 dividend distribution policies from Brazilian state sanitation companies, collected between July 2020 and August 2025. The dataset was created by Danilo Tavares da Silva to analyze the incorporation of legal restrictions on profit distribution following the 2020 sanitation law update. It includes variables on contract specifications, dividend limitation clauses, monitoring mechanisms, sanctions, and corporate guidelines.
A 2026 dataset from 162 pigs provides genomic, transcriptomic, and metabolomic data for integrative analysis of carcass yield and meat quality traits. Genomic data from whole-genome sequencing, RNA-seq from three tissues, and LC–MS/MS metabolomics from loin muscle are included. The data, shared by Patrick Tecku under CC-BY-4.0, supports GWAS, eQTL mapping, and network analyses to investigate genetic and metabolic mechanisms.
Australia's marine jurisdiction covers over 10 million square kilometres, with less than 25% of its seafloor mapped at high-resolution. The AusSeabed program is a national coordination consortium facilitated by Geoscience Australia, aiming to reduce duplication and improve data consistency. It focuses on developing a cloud-based data sharing infrastructure, common mapping tools, and standardized geomorphic mapping approaches.
Bassam Abomoelak's study provides metabolomic and lipidomic analysis of serum samples from 43 participants. The data captures metabolite and lipid concentration levels before and after an 8-week Preksha Dhyana meditation intervention. The dataset was last updated on 2026-04-15 and is licensed under CC-BY-4.0.
An inventory of information assets managed by the Governorate of Amazonas, Colombia. The dataset includes columns for asset description, responsible offices, availability, confidentiality, and custodians. It was published on the Colombian open data portal and last updated on May 18, 2026.
Alejandro Leyva's 5.5 KB Excel file contains performance metrics for a cell-level DNA damage response prediction model. The data, last updated in May 2026, reports results from a five-fold cross-validation procedure. Metrics likely include Pearson correlation, Spearman correlation, mean absolute error, mean squared error, and coefficient of determination.