Loading...
Loading...
DNA/RNA sequences, gene expression, protein structures, metagenomics, single-cell sequencing
23,772 datasets
NASA's TROPESS project provides vertical profiles of peroxyacetyl nitrate (PAN) at 16 levels from the surface to 0.1 hPa, captured during the 2019-2020 Australian wildfire outbreak. This standard product is derived from the CrIS instrument on the Suomi-NPP satellite using the MUSES optimal estimation algorithm. Daily netCDF files offer a 14 km spatial resolution focused on the Australia region (60S-0S, 100E-177.5E).
Global satellite data provides the vertical distribution of peroxyacetyl nitrate (PAN) from February 1 to May 21, 2021. The dataset contains retrieved atmospheric states, formal uncertainties, and diagnostic information from the CrIS instrument on the Suomi-NPP satellite, processed using the TROPESS project's MUSES optimal estimation algorithm. Data is reported at 16 vertical levels from the surface to 0.1 hPa with a spatial resolution of 14 kilometers.
August to October 2020 data captures the vertical distribution of peroxyacetyl nitrate (PAN) over the contiguous United States during major wildfire outbreaks. This satellite-derived dataset provides daily atmospheric profiles at 16 vertical levels, retrieved from the Cross-track Infrared Sounder (CrIS) on the Suomi-NPP platform using the MUSES optimal estimation algorithm. It is designed for analyzing the transport and chemistry of a key ozone precursor generated by biomass burning.
Spatial extents and collection day details for garbage and recycling zones within the Northern Grampians Shire Council. The dataset includes route names, collection days, frequencies, and start dates for both rubbish and recycling services. It was created and is maintained by the Northern Grampians Shire Council, with a last update recorded in April 2026.
Spatial extents and collection day details for garbage and recycling zones within the Northern Grampians Shire Council. The dataset includes route names, collection days, frequencies, and start dates for both rubbish and recycling services. It was created and is maintained by the Northern Grampians Shire Council, with a last update recorded in April 2026.
A training dataset for mathematical embedding models built on the principle that concepts can be expressed in multiple surface forms. It includes informal natural language, rephrasings, and Lean 4 type signatures and declarations, designed for contrastive embedding training. The dataset was created by uw-math-ai and was last updated on 2026-05-29.
~22,000 non-overlapping 131 KB bins of the human genome (hg38/GRCh38) are embedded into a 3,072-dimensional latent space using the ALPHAGenome foundation model. The embeddings were created by the lagosproject and were last updated on the platform in May 2026. These pre-computed embeddings power an interactive browser visualization for exploring latent relationships between genomic regions.
From February 1 to May 21, 2021, this dataset provides daily global vertical profiles of atmospheric ammonia (NH3) concentrations. It is derived from the Cross-track Infrared Sounder (CrIS) instrument aboard the Suomi-NPP satellite, processed using the MUSES optimal estimation algorithm. Data are reported at 15 vertical levels from the surface to 0.1 hPa with a spatial resolution of 14 km.
TROPESS CrIS-SNPP L2 Ammonia for West Coast Fires provides vertical distribution data for atmospheric ammonia (NH3) measured by the CrIS instrument on the Suomi-NPP satellite. This standard product focuses on the CONUS region from August to October 2020, capturing emissions during the outbreak of major U.S. West Coast wildfires. Data are reported at 15 vertical levels from the surface to 0.1 hPa with a spatial resolution of 14 km, generated using the MUSES optimal estimation algorithm.
From February 1, 2021 to May 21, 2021, this dataset provides daily global vertical profiles of atmospheric methane (CH4) concentrations and their uncertainties. The data are derived from the Cross-track Infrared Sounder (CrIS) instrument on the Suomi-NPP satellite, processed using the NASA TROPESS project's MUSES optimal estimation algorithm. Each netCDF file contains measurements at 26 vertical levels from the surface to 0.1 hPa, with a spatial resolution of 14 km at nadir.
Australia's Regional Development Authority (RDA) administrative boundaries for the 2015-16 period, created by the Department of Infrastructure and Regional Development. The dataset maps the national network of RDA committees, built from the ABS/PSMA Local Government Area (LGA) 2015 boundary dataset. It was published in April 2016.
A 2015-16 spatial dataset maps 55 Regional Development Australia (RDA) committee regions across Australian states and territories. The Department of Infrastructure and Regional Development created it by aggregating Local Government Area (LGA) boundaries from the ABS/PSMA 2015 dataset. It includes updates for council amalgamations and special administrative areas like external territories.
Abeer Elshater's dataset supports the study "Epistemic Stewardship and Gatekeeping in Scholarly Journals" and documents a structured data-mining and content-analysis process. It is based on a corpus of 132 manuscripts and is organized into two tables, including query strategies and coded thematic outputs. The dataset was last updated on 2026-05-10 and is licensed under CC-BY-4.0.
359 genomes of Shiga toxin-producing Escherichia coli O118 were profiled to establish a phylogenomic framework and define virulence gene boundaries. The dataset includes supplementary material such as PNG images and XLSX files totaling 8.3 MB. Irvin Rivera authored the dataset, which was last updated on May 5, 2026.
A gene expression dataset for diabetic foot ulcers (DFU) analysis, identified from bulk and single-cell RNA sequencing. The data includes ten differential expressed lactylation-related genes and a diagnostic model built on five genes (ADH1B, ARTN, KY, PFKFB2, PFKFB4). The 1.2 MB dataset was authored by Xiaotian Zhang and last updated on April 30, 2026.
Serum biomarker data was collected using the Open Data Kit (ODK) platform and exported in CSV format. The 14.4 KB file corresponds to the data published as S2 Data in a PLOS ONE article. Author Philemon Mohammed Seid uploaded it to figshare with a CC-BY-4.0 license, last updated on 2026-04-30.
Urinary biomarker data collected using ODK and exported in comma-separated values format. The dataset is 131.7 KB in size and was authored by Philemon Mohammed Seid. It was last updated on April 30, 2026.
Cole E. McGuire's 235.9 KB CSV file provides predicted confidence rankings for yeast genes involved in mitochondrion organization. The data includes scores from three models: MEFIT, SPELL, and a neural network, with genes labeled as positive, negative, or neutral. It was used to construct figures in a PLoS ONE article published in 2026.
Daily Public Transport Patronage data records passenger boardings by service type, such as light rail and bus. The dataset is provided by the ACT Government Open Data and was last updated in March 2026. Paper tickets from Ticket Vending Machines are excluded, except for those on light rail platforms.
A dataset from 2026 defining periods of human development and adulthood using single-nucleus RNA sequencing data. The data is provided in an XLSX file sized 9.4 KB and is licensed under CC-BY-4.0. It was authored by Jiafang Li and last updated on figshare in May 2026.