Loading...
Loading...
DNA/RNA sequences, gene expression, protein structures, metagenomics, single-cell sequencing
23,848 datasets
Quaternary Aminostratigraphy of Eolianite on Lord Howe Island, Southwest Pacific Ocean is a dataset from the Australian Ocean Data Network. It contains amino acid racemization (AAR) dating data used to correlate disparate eolianite successions and establish a geochronological framework from the Holocene to the Middle Pleistocene. The data includes D/L ratios for amino acids from land snails and whole-rock samples, defining three aminozones and linking dune deposition to periods of high sea level.
Replication data for a 2026 study testing hypotheses on China's customs-BoP gap using mirror trade data. The dataset likely contains Chinese-reported exports versus partner-reported imports for HS-2 chapters 84, 85, and 87 across 12 major partners from 2018 to 2024. It was created by Brian Peters of Demographics and Global Capital Allocation and updated on May 1, 2026.
Geoscience Australia Data produced a report synthesizing the geodynamic setting, architecture, and age of the Archean to Mesoproterozoic Gawler Craton and Curnamona Province. The synthesis integrates results from geological synthesis, seismic interpretation, sequence stratigraphy, geochronology, and geochemistry. It was last updated on 2026-04-21.
45 Science Data Set layers provide aerosol optical depth and related parameters at a 6-km spatial resolution, derived from the VIIRS sensor on the Suomi NPP satellite. This dataset is part of NASA's MEaSUREs project, which applies a consistent Dark Target algorithm across seven geostationary and low-Earth orbit sensors to ensure scientific maturity. Its 6-minute cadence generates approximately 130 data granules during daylight hours from January 2019 through December 2022.
NASA's VIIRS/NOAA20 Dark Target Aerosol L2 product provides satellite-derived measurements of Aerosol Optical Thickness (AOT) over land and ocean. The dataset, part of the NASA MEaSUREs GEO-LEO project, uses a 6-km at-nadir resolution algorithm and is produced at a 6-minute cadence. It contains 45 Science Data Set layers, including geolocation and geophysical parameters, and spans from January 2019 through December 2022.
A collation of marine surveys from the Fal and Helford European Marine Site conducted in 2002. The data were gathered by English Nature (Natural England) following established methodologies for environmental evidence. Surveys include condition assessments, verification for recommended Marine Conservation Zones, and investigations of Natura 2000 site features.
Untargeted metabolomics data from a study on enhancing the feed value of cassava residue through solid-state fermentation. The dataset contains results from OPLS-DA modeling and univariate analysis, identifying 2,642 differential metabolites. It was authored by caizhi wei and shared on figshare under a CC-BY-4.0 license in April 2026.
Replication package for 'Demographics, Development, and the Composition of Gross External Assets' by Brian Peters (2026). The dataset is a 193-country panel from 1980 to 2024, containing demographic principal components and gross external asset composition data. It tests whether demographic aging systematically predicts shifts from official reserves toward private portfolio and direct investment.
Replication data for a 2026 study examining the link between donor-country demographic aging and official development assistance (ODA) cuts. The package includes a 22-country donor panel from 1990 to 2024, a bilateral gravity panel covering 22 donors and 190 recipients from 1995 to 2024, and analysis of three natural experiments. Key findings suggest aged donors compress total ODA uniformly rather than reshaping recipient allocation.
A 2001 collation of marine surveys by English Nature to gather evidence for conservation management. The survey purposes include verification for recommended Marine Conservation Zones, condition assessments, and surveys of Natura 2000 site features. Data collection followed established methodologies and standards.
A phylogenomic dataset for Lepidoptera, containing refined alignments for 100 nuclear protein-coding loci and a concatenated DNA matrix from 33 specimens. The dataset, created by Jiaxuan Li, also includes a phylogenetic tree output generated by IQ-Tree2. It was last updated on April 18, 2026.
The ITINERIS project analysis from 2023 systematically examines 155 facilities from eight marine Research Infrastructures in the central Mediterranean. It evaluates the capacity to produce 107 recognized Essential Variables, with 50% actively produced and over 90% meeting established requirements. The study provides key recommendations for strengthening Italy's contribution to global environmental monitoring.
155 facilities from eight marine Research Infrastructures actively produce 50% of the 107 recognized Essential Variables (EVs), with over 90% meeting established requirements. The analysis, conducted in 2023 under the ITINERIS project, evaluates the spatial, thematic, and technological coverage of EOVs, ECVs, and EBVs across the central Mediterranean Sea and Italian coasts. The dataset documents a mature network for oceanic and climate variables, while biodiversity observations are less represented.
Simone Toller's 2023 analysis within the ITINERIS project evaluates the capacity of 155 facilities from eight Italian and pan-European marine Research Infrastructures to monitor Essential Ocean, Climate, and Biodiversity Variables. The study finds that 50% of 107 recognized Essential Variables are actively produced, with over 90% meeting established requirements. The dataset is a 65.5 KB PDF report detailing spatial, thematic, and technological coverage in the central Mediterranean Sea and along Italian coasts.
A 2023 analysis of 155 marine research facilities across the Mediterranean Sea and Italian coasts, evaluating their capacity to produce Essential Ocean, Climate, and Biodiversity Variables. The study, conducted under the ITINERIS project, found that 50% of 107 recognized Essential Variables were actively produced, with over 90% meeting established requirements. It provides recommendations for enhancing coordination among national and European research infrastructures.
Ben Steers at New York University's Center for Urban Science and Progress created a dataset of over 1,100 annotated plumes from Manhattan building emissions. The dataset was generated by applying a trained deep convolutional neural network to archival skyline imagery captured at 0.1 Hz by the Urban Observatory. It contains detections of plumes, classified by color, from two periods: October 26 to December 31, 2013, and January 1 to March 13, 2015.
Results from a bioinformatics process for detecting and annotating tandem repeats in biological sequences. The dataset was authored by Roberto A. Pava-Díaz and is hosted on Harvard Dataverse. It was last updated on June 23, 2026.
Theodora Hatziioannou's dataset contains variant frequency and mutation scores for HIV-1 strains subjected to selection by broadly neutralizing antibodies 3BNC117 or 10-1074. The data is shared under a CC-BY-4.0 license as a 1.4 MB XLSX file on figshare, last updated in May 2026.
NASA's dataset compares gene expression in human lymphoblastoid TK6 cells after exposure to simulated space radiation (1.67 Gy HZE iron ions) or 2.5 Gy gamma rays versus mock irradiation. Transcriptional profiling was performed with RNA harvested 24 hours post-exposure, using three independent biological replicates per condition. The data was generated at the NASA Space Research Laboratory (NSRL) of Brookhaven National Laboratory.
A microarray expression analysis of gastrocnemius muscle from mice flown on the STS-108 shuttle mission for 11 days and 19 hours, compared to Earth-based controls. The dataset also includes results from ground-based hindlimb suspension (12 days) and reloading (3.5 hours) experiments to simulate unloading and post-landing effects. The data was produced by the National Aeronautics and Space Administration.