DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

Genomics & Bioinformatics Datasets | DataSalon

All Categories

🧬

Genomics & Bioinformatics

DNA/RNA sequences, gene expression, protein structures, metagenomics, single-cell sequencing

23,848 datasets

Quaternary Aminostratigraphy of Eolianite on Lord Howe Island, Southwest Pacific Ocean

Quaternary Aminostratigraphy of Eolianite on Lord Howe Island, Southwest Pacific Ocean is a dataset from the Australian Ocean Data Network. It contains amino acid racemization (AAR) dating data used to correlate disparate eolianite successions and establish a geochronological framework from the Holocene to the Middle Pleistocene. The data includes D/L ratios for amino acids from land snails and whole-rock samples, defining three aminozones and linking dune deposition to periods of high sea level.

TabularGeochronologyPaleoclimateQuaternaryEolianiteAmino acid racemization+1

0 views

Genomics & Bioinformatics

China-US Mirror Trade Data for Customs-BoP Gap Analysis, 2018-2024

Replication data for a 2026 study testing hypotheses on China's customs-BoP gap using mirror trade data. The dataset likely contains Chinese-reported exports versus partner-reported imports for HS-2 chapters 84, 85, and 87 across 12 major partners from 2018 to 2024. It was created by Brian Peters of Demographics and Global Capital Allocation and updated on May 1, 2026.

TabularChina CustomsEconomic ResearchInternational TradeBalance Of PaymentsMirror Trade+1

0 views

Genomics & Bioinformatics

Geodynamic Synthesis of the Gawler Craton and Curnamona Province in South Australia

Geoscience Australia Data produced a report synthesizing the geodynamic setting, architecture, and age of the Archean to Mesoproterozoic Gawler Craton and Curnamona Province. The synthesis integrates results from geological synthesis, seismic interpretation, sequence stratigraphy, geochronology, and geochemistry. It was last updated on 2026-04-21.

Text🇦🇺 AustraliaGeochronologyGeologyGeodynamicsSeismic interpretation+1

0 views

Genomics & Bioinformatics

XAERDT_L2_VIIRS_SNPP: Aerosol Measurements from Suomi NPP Satellite

45 Science Data Set layers provide aerosol optical depth and related parameters at a 6-km spatial resolution, derived from the VIIRS sensor on the Suomi NPP satellite. This dataset is part of NASA's MEaSUREs project, which applies a consistent Dark Target algorithm across seven geostationary and low-Earth orbit sensors to ensure scientific maturity. Its 6-minute cadence generates approximately 130 data granules during daylight hours from January 2019 through December 2022.

Time SeriesGeospatialAerosol Optical DepthAerosol Optical Depth ThicknessAtmospheric ScienceEarth Science Aerosols Atmosphere Aerosol ParticleViirsBenchmarkSatellite Remote SensingEarth Science Aerosols Atmosphere Aerosol OpticalViirs SnppAerosol Particle PropertiesEarth Science Aerosols AtmosphereNasa Measures+1

0 views

Genomics & Bioinformatics

VIIRS/NOAA20: Aerosol Optical Depth from Dark Target Algorithm

NASA's VIIRS/NOAA20 Dark Target Aerosol L2 product provides satellite-derived measurements of Aerosol Optical Thickness (AOT) over land and ocean. The dataset, part of the NASA MEaSUREs GEO-LEO project, uses a 6-km at-nadir resolution algorithm and is produced at a 6-minute cadence. It contains 45 Science Data Set layers, including geolocation and geophysical parameters, and spans from January 2019 through December 2022.

Time SeriesGeospatialViirs Noaa20Aerosol Optical DepthEarth Science Aerosols Atmosphere Aerosol ParticleBenchmarkSatellite Remote SensingEarth Science Aerosols Atmosphere Aerosol OpticalGeospatial SwathEarth Science AerosolsEarth Science Aerosols Atmosphere+1

0 views

Genomics & Bioinformatics

Fal and Helford European Marine Site: Sublittoral Monitoring Survey 2002

A collation of marine surveys from the Fal and Helford European Marine Site conducted in 2002. The data were gathered by English Nature (Natural England) following established methodologies for environmental evidence. Surveys include condition assessments, verification for recommended Marine Conservation Zones, and investigations of Natura 2000 site features.

TabularGeospatialEnvironmental SurveyBenthic EcologyCoastal MonitoringMarine Biology+1

0 views

Genomics & Bioinformatics

Metabolomics Raw Data for Study on Enhancing the Feed Value of Cassava Residue through Sol

Untargeted metabolomics data from a study on enhancing the feed value of cassava residue through solid-state fermentation. The dataset contains results from OPLS-DA modeling and univariate analysis, identifying 2,642 differential metabolites. It was authored by caizhi wei and shared on figshare under a CC-BY-4.0 license in April 2026.

TabularCassava ResidueSolid State FermentationDifferential MetabolitesFeed Value+1

0 views

Genomics & Bioinformatics

Demographics and Gross External Asset Composition for 193 Countries, 1980-2024

Replication package for 'Demographics, Development, and the Composition of Gross External Assets' by Brian Peters (2026). The dataset is a 193-country panel from 1980 to 2024, containing demographic principal components and gross external asset composition data. It tests whether demographic aging systematically predicts shifts from official reserves toward private portfolio and direct investment.

TabularPanel DataAging PopulationInternational FinanceDemographicsExternal Assets+1

0 views

Genomics & Bioinformatics

Replication Data for: Donor-Side Fiscal Aging and Foreign Aid — Evidence from a 22-Country

Replication data for a 2026 study examining the link between donor-country demographic aging and official development assistance (ODA) cuts. The package includes a 22-country donor panel from 1990 to 2024, a bilateral gravity panel covering 22 donors and 190 recipients from 1995 to 2024, and analysis of three natural experiments. Key findings suggest aged donors compress total ODA uniformly rather than reshaping recipient allocation.

TabularGeospatialPanel DataGravity ModelFiscal PolicyLarge ScaleDemographicsForeign AidSynthetic+1

0 views

Genomics & Bioinformatics

South Walney Lagoons: 2001 Marine Conservation Management Study

A 2001 collation of marine surveys by English Nature to gather evidence for conservation management. The survey purposes include verification for recommended Marine Conservation Zones, condition assessments, and surveys of Natura 2000 site features. Data collection followed established methodologies and standards.

GeospatialMarine conservationEnvironmental SurveyCoastal management+1

0 views

Genomics & Bioinformatics

Lepidoptera Phylogenomic Data: 100 Nuclear Loci from 33 Specimens

A phylogenomic dataset for Lepidoptera, containing refined alignments for 100 nuclear protein-coding loci and a concatenated DNA matrix from 33 specimens. The dataset, created by Jiaxuan Li, also includes a phylogenetic tree output generated by IQ-Tree2. It was last updated on April 18, 2026.

TextZIPNuclear LociHealthcareLepidopteraPhylogenomicsMolecular AlignmentAmplicon Capture+1

0 views

Genomics & Bioinformatics

Italian Marine Monitoring Capacity for Essential Variables

The ITINERIS project analysis from 2023 systematically examines 155 facilities from eight marine Research Infrastructures in the central Mediterranean. It evaluates the capacity to produce 107 recognized Essential Variables, with 50% actively produced and over 90% meeting established requirements. The study provides key recommendations for strengthening Italy's contribution to global environmental monitoring.

TabularTime SeriesEnvironmental scienceOceanographyItalyMarine monitoringMediterranean Sea+1

0 views

Genomics & Bioinformatics

Italian Marine Monitoring Capacity for Essential Variables

155 facilities from eight marine Research Infrastructures actively produce 50% of the 107 recognized Essential Variables (EVs), with over 90% meeting established requirements. The analysis, conducted in 2023 under the ITINERIS project, evaluates the spatial, thematic, and technological coverage of EOVs, ECVs, and EBVs across the central Mediterranean Sea and Italian coasts. The dataset documents a mature network for oceanic and climate variables, while biodiversity observations are less represented.

TabularTime SeriesEnvironmental scienceOceanographyItalyMarine monitoringMediterranean Sea+1

0 views

Genomics & Bioinformatics

Italian Marine Monitoring Capacity for Essential Variables

Simone Toller's 2023 analysis within the ITINERIS project evaluates the capacity of 155 facilities from eight Italian and pan-European marine Research Infrastructures to monitor Essential Ocean, Climate, and Biodiversity Variables. The study finds that 50% of 107 recognized Essential Variables are actively produced, with over 90% meeting established requirements. The dataset is a 65.5 KB PDF report detailing spatial, thematic, and technological coverage in the central Mediterranean Sea and along Italian coasts.

TabularTime SeriesEnvironmental scienceOceanographyItalyMarine monitoringMediterranean Sea+1

0 views

Genomics & Bioinformatics

Italian Marine Monitoring of Essential Variables in the Mediterranean

A 2023 analysis of 155 marine research facilities across the Mediterranean Sea and Italian coasts, evaluating their capacity to produce Essential Ocean, Climate, and Biodiversity Variables. The study, conducted under the ITINERIS project, found that 50% of 107 recognized Essential Variables were actively produced, with over 90% meeting established requirements. It provides recommendations for enhancing coordination among national and European research infrastructures.

TabularTime SeriesOceanographyEnvironmental monitoringItalyMediterranean SeaMarine Biology+1

0 views

Genomics & Bioinformatics

NYC Building Plume Detection from Skyline Imagery, 2013-2015

Ben Steers at New York University's Center for Urban Science and Progress created a dataset of over 1,100 annotated plumes from Manhattan building emissions. The dataset was generated by applying a trained deep convolutional neural network to archival skyline imagery captured at 0.1 Hz by the Urban Observatory. It contains detections of plumes, classified by color, from two periods: October 26 to December 31, 2013, and January 1 to March 13, 2015.

ImageEnvironmental monitoringNew York CityComputer VisionUrban Air PollutionBuilding Plumes+1

0 views

Genomics & Bioinformatics

Detected and Annotated Tandem Repeats in the Human Genome

Results from a bioinformatics process for detecting and annotating tandem repeats in biological sequences. The dataset was authored by Roberto A. Pava-Díaz and is hosted on Harvard Dataverse. It was last updated on June 23, 2026.

TabularHuman GenomeBioinformaticsGenomicsTandem Repeats+1

0 views

Genomics & Bioinformatics

bnAb Escape Sequencing and Scoring Data for HIV-1 Strains

Theodora Hatziioannou's dataset contains variant frequency and mutation scores for HIV-1 strains subjected to selection by broadly neutralizing antibodies 3BNC117 or 10-1074. The data is shared under a CC-BY-4.0 license as a 1.4 MB XLSX file on figshare, last updated in May 2026.

TabularExcelMutation ScoringAntibody EscapeHIV-1Genomic Variants+1

0 views

Genomics & Bioinformatics

Human Cell Transcriptional Response to Space Radiation and Gamma Rays

NASA's dataset compares gene expression in human lymphoblastoid TK6 cells after exposure to simulated space radiation (1.67 Gy HZE iron ions) or 2.5 Gy gamma rays versus mock irradiation. Transcriptional profiling was performed with RNA harvested 24 hours post-exposure, using three independent biological replicates per condition. The data was generated at the NASA Space Research Laboratory (NSRL) of Brookhaven National Laboratory.

TabularGene ExpressionLymphoblastoid CellsRadiation BiologySpace Radiation+1

0 views

Genomics & Bioinformatics

Mouse Skeletal Muscle Gene Expression After 11-Day Spaceflight and Ground-Based Analogs

A microarray expression analysis of gastrocnemius muscle from mice flown on the STS-108 shuttle mission for 11 days and 19 hours, compared to Earth-based controls. The dataset also includes results from ground-based hindlimb suspension (12 days) and reloading (3.5 hours) experiments to simulate unloading and post-landing effects. The data was produced by the National Aeronautics and Space Administration.

TabularMouse ModelGene ExpressionSpaceflightMicroarraySkeletal Muscle+1

0 views

PreviousPage 354 of 1192Next