DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

Genomics & Bioinformatics Datasets | DataSalon

All Categories

🧬

Genomics & Bioinformatics

DNA/RNA sequences, gene expression, protein structures, metagenomics, single-cell sequencing

23,807 datasets

TRIP13 Pan-Cancer Multi-Omics Analysis Supporting Oncogenic Role

A pan-cancer multi-omics analysis from figshare investigates the oncogenic role of the TRIP13 gene. The dataset, authored by Yuanqiao Zhao and last updated in April 2026, likely contains expression data, prognostic associations, and immune infiltration correlations across various cancer types. It is a 12.1 MB PDF file licensed under CC-BY-4.0.

TabularGene ExpressionMulti OmicsPan CancerHealthcarePrognostic BiomarkerImmune Infiltration+1

0 views

Genomics & Bioinformatics

GWAS of Post-Harvest Deterioration in Cassava Using Visual and AI Phenotyping

298 cassava accessions were genotyped to identify genomic regions linked to post-harvest physiological deterioration. The study used both human visual scoring and an AI-powered phenotyping method, identifying 6 significant SNPs from each approach. This dataset, authored by Kwame Obeng Dankwa and last updated in April 2026, provides genetic targets for breeding PPD-tolerant cassava varieties.

TabularCassavaPlant PhenotypingHealthcareLarge ScalePost Harvest DeteriorationAi PhenotypingGenome-wide association studySynthetic+1

0 views

Genomics & Bioinformatics

Benchmarking Data for SARS-CoV-2 Subgenomic RNA Detection Workflows

25 synthetic Illumina datasets and a real-world wastewater dataset form a case study for benchmarking bioinformatics tools. The data, created by Gabriele Leoni and last updated in April 2026, simulates shotgun and amplicon sequencing to assess variables like mutation profiles and aligner choice. Results revealed substantial performance variability, highlighting the need for systematic benchmarking to inform workflow selection in public health contexts.

TextSubgenomic RnaSars Cov 2Workflow SelectionHealthcareGenomic SurveillanceBioinformatics BenchmarkingSynthetic+1

0 views

Genomics & Bioinformatics

Melon ABCB Gene Family: 39 Identified Genes and Expression During Bud Development

Melon (Cucumis melo L.) genome-wide identification of the ABCB gene family. The dataset contains 39 identified CmABCB genes, classified into five phylogenetic subgroups, with expression data from RNA-seq and qRT-PCR under IAA treatment. Author Jun Lai published the data on figshare in April 2026.

TabularExcelPlant genomicsGene FamilyFinanceAxillary BudAbcb ProteinsMelon+1

0 views

Genomics & Bioinformatics

Gene Expression Analysis of lncRNAs in Primary Sjögren's Syndrome

A genome-wide gene expression dataset from 126 minor salivary gland samples, comprising 92 patients with primary Sjögren's syndrome and 34 controls. The data was generated by Zhongshan Li via deep stranded total transcriptome sequencing and last updated in April 2026. It includes a constructed competing endogenous RNA network of 3,035 lncRNAs and 10,838 coding genes.

TabularExcelGene ExpressionAutoimmune DiseaseBioinformaticsLncrnaHealthcareGenomics+1

0 views

Genomics & Bioinformatics

Primary Sjögren's Syndrome Gene Expression Data with 126 Patient Samples

Deep stranded total transcriptome sequencing was performed on minor salivary gland samples from 92 patients with primary Sjögren's syndrome (pSS) and 34 non-SS controls. The dataset, authored by Zhongshan Li and last updated in April 2026, contains results from a genome-wide gene expression analysis, including a constructed competing endogenous RNA (ceRNA) network.

TabularExcelGene ExpressionAutoimmune DiseaseBioinformaticsLncrnaHealthcareGenomics+1

0 views

Genomics & Bioinformatics

Gene Expression Analysis of lncRNAs in Primary Sjögren's Syndrome

92 patient and 34 control samples from minor salivary glands were analyzed via deep stranded total transcriptome sequencing. The dataset contains results from a genome-wide competing endogenous RNA network analysis, identifying 3,035 lncRNAs and 10,838 coding genes. It was authored by Zhongshan Li and last updated on 2026-04 15.

TabularExcelGene ExpressionAutoimmune DiseaseBioinformaticsLncrnaHealthcareGenomics+1

0 views

Genomics & Bioinformatics

Gene Expression Analysis of lncRNAs in Primary Sjögren's Syndrome

A genome-wide gene expression dataset from 126 minor salivary gland samples, including 92 from patients with primary Sjögren's syndrome and 34 from non-Sjögren's controls. The data was generated by Zhongshan Li using deep stranded total transcriptome sequencing and was last updated in April 2026. It includes a constructed competing endogenous RNA network comprising 3,035 lncRNAs and 10,838 coding genes.

TabularExcelGene ExpressionAutoimmune DiseaseLncrnaHealthcareGenomicsSalivary Gland+1

0 views

Genomics & Bioinformatics

Gene Expression and lncRNA Network in Primary Sjögren's Syndrome

126 salivary gland samples from patients and controls were sequenced for a genome-wide transcriptome analysis. The dataset likely contains expression data for coding and noncoding genes, used to construct a competing endogenous RNA network with 3,035 lncRNAs and 10,838 coding genes. Author Zhongshan Li published this research on figshare in April 2026 under a CC-BY-4.0 license.

TabularGene ExpressionAutoimmune DiseaseLncrnaHealthcareGenomicsSalivary Gland+1

0 views

Genomics & Bioinformatics

Primary Sjögren's Syndrome Gene Expression Data from 126 Salivary Gland Samples

Deep stranded total transcriptome sequencing data from 126 minor salivary gland samples, comprising 92 patients with primary Sjögren's syndrome and 34 non-SS controls. The dataset includes genome-wide coding and noncoding gene expression, a constructed competing endogenous RNA network, and in vitro validation results. It was authored by Zhongshan Li, shared under a CC-BY-4.0 license, and last updated in April 2026.

TabularGene ExpressionAutoimmune DiseaseLncrnaHealthcareGenomicsSalivary Gland+1

0 views

Genomics & Bioinformatics

Gene Expression Analysis of lncRNAs in Primary Sjögren's Syndrome

A transcriptomics dataset from 126 minor salivary gland samples, including 92 from patients with primary Sjögren's syndrome and 34 controls. The data was generated by Zhongshan Li using deep stranded total transcriptome sequencing and last updated in April 2026. It includes a genome-wide competing endogenous RNA network constructed from 3,035 lncRNAs and 10,838 coding genes.

TabularGene ExpressionAutoimmune DiseaseBioinformaticsLncrnaHealthcareGenomics+1

0 views

Genomics & Bioinformatics

Genome-Wide Gene Expression Data for Primary Sjögren's Syndrome

A 2026 study by Zhongshan Li presents gene expression data from 126 minor salivary gland samples, including 92 from patients with primary Sjögren's syndrome and 34 from non-SS controls. The dataset includes results from deep stranded total transcriptome sequencing, differential expression analysis, and a constructed genome-wide competing endogenous RNA network. The research identifies and validates regulatory roles for specific long non-coding RNAs in the disease.

TabularGene ExpressionAutoimmune DiseaseBioinformaticsLncrnaHealthcareGenomics+1

0 views

Genomics & Bioinformatics

Primary Sjögren's Syndrome Gene Expression Data from 126 Minor Salivary Gland Samples

126 minor salivary gland samples from patients with primary Sjögren's syndrome and controls were analyzed via deep stranded total transcriptome sequencing. Zhongshan Li published this dataset under a CC-BY-4.0 license in April 2026. The data supports the construction of a genome-wide competing endogenous RNA network comprising 3,035 lncRNAs and 10,838 coding genes.

TabularGene ExpressionAutoimmune DiseaseLncrnaHealthcareGenomicsSalivary Gland+1

0 views

Genomics & Bioinformatics

EEG and Behavioral Data from Rhythmic Visual Stimulation Study with 43 Participants

43 healthy male participants completed a low-contrast visual search task under rhythmic and arrhythmic flicker conditions. The dataset includes behavioral metrics like reaction time and accuracy, plus high-density electroencephalographic data analyzed for power spectral density across delta, theta, alpha, and beta bands. The data was authored by Hongwei Wang and last updated on 2026-04-15.

TabularTime SeriesNeuromodulationElectroencephalographyNeuroscienceVisual Perception+1

0 views

Genomics & Bioinformatics

Brazilian Sanitation Sector Dividend Policies and Concession Contracts, 2020-2025

50 concession contracts and 11 dividend distribution policies from Brazilian state sanitation companies, collected between July 2020 and August 2025. The dataset was created by Danilo Tavares da Silva to analyze the incorporation of legal restrictions on profit distribution following the 2020 sanitation law update. It includes variables on contract specifications, dividend limitation clauses, monitoring mechanisms, sanctions, and corporate guidelines.

TabularTime Series🇧🇷 BrazilZIPExcelConcession ContractsSanitationCorporate PolicyDividends+1

0 views

Genomics & Bioinformatics

Multi-Omics Data for Pig Carcass Yield and Meat Quality Traits, 162 Samples

A 2026 dataset from 162 pigs provides genomic, transcriptomic, and metabolomic data for integrative analysis of carcass yield and meat quality traits. Genomic data from whole-genome sequencing, RNA-seq from three tissues, and LC–MS/MS metabolomics from loin muscle are included. The data, shared by Patrick Tecku under CC-BY-4.0, supports GWAS, eQTL mapping, and network analyses to investigate genetic and metabolic mechanisms.

TabularMultimodalZIPMulti OmicsHealthcareLivestock BreedingPig GenomicsMeat QualityGwasSynthetic+1

0 views

Genomics & Bioinformatics

AusSeabed: Australian Seabed Mapping Coordination and Data

Australia's marine jurisdiction covers over 10 million square kilometres, with less than 25% of its seafloor mapped at high-resolution. The AusSeabed program is a national coordination consortium facilitated by Geoscience Australia, aiming to reduce duplication and improve data consistency. It focuses on developing a cloud-based data sharing infrastructure, common mapping tools, and standardized geomorphic mapping approaches.

Geospatial🇦🇺 AustraliaOceanographyComputer VisionSeabed MappingLarge ScaleMarine Geology+1

0 views

Genomics & Bioinformatics

Preksha Dhyana Meditation Serum Metabolomics and Lipidomics Data

Bassam Abomoelak's study provides metabolomic and lipidomic analysis of serum samples from 43 participants. The data captures metabolite and lipid concentration levels before and after an 8-week Preksha Dhyana meditation intervention. The dataset was last updated on 2026-04-15 and is licensed under CC-BY-4.0.

TabularLipidomicsBenchmarkHealthcareClinical TrialSerum AnalysisMeditation+1

0 views

Genomics & Bioinformatics

Amazonas Governorate Information Assets Inventory with Governance Metadata

An inventory of information assets managed by the Governorate of Amazonas, Colombia. The dataset includes columns for asset description, responsible offices, availability, confidentiality, and custodians. It was published on the Colombian open data portal and last updated on May 18, 2026.

TabularCSVXMLJSONInformation AssetsData GovernancePublic AdministrationGovernment Inventory+1

0 views

Genomics & Bioinformatics

DNA Damage Response Prediction Model Performance Metrics

Alejandro Leyva's 5.5 KB Excel file contains performance metrics for a cell-level DNA damage response prediction model. The data, last updated in May 2026, reports results from a five-fold cross-validation procedure. Metrics likely include Pearson correlation, Spearman correlation, mean absolute error, mean squared error, and coefficient of determination.

TabularExcelCross ValidationModel PerformanceCell BiologyDna Damage Response+1

0 views

PreviousPage 293 of 1190Next