Loading...
Loading...
DNA/RNA sequences, gene expression, protein structures, metagenomics, single-cell sequencing
23,725 datasets
5.7 GB of metabolomics data from cecal and serum samples supports a study on primate intestinal microbes. The dataset, authored by Anthony Pulvino, is associated with research under review at Microbiology Spectrum. It was last updated on May 14, 2026.
Lord Howe Island eolianite deposits are dated using amino acid racemization (AAR) to establish a geochronological framework from the Holocene to the Middle Pleistocene. The dataset, published by the Australian Ocean Data Network, defines three aminozones based on racemization extents in land snails and whole-rock samples, correlating with Oxygen Isotope Stages 5 and 7. These AAR data support independent lithostratigraphic interpretations and indicate eolianite deposition occurred over a longer interval linked to high sea-level periods.
573,661 reviewed protein sequence embeddings from UniProtKB and two labeled datasets for protein-protein interaction prediction. The balanced dataset contains 249,814 protein pairs, and an oversampled version contains 1,082,662 pairs. Authored by Md Shahidul Islam and last updated on 2026-05-02.
OSR2 expression correlates with poor prognosis in several cancers and is strongly associated with cancer-associated fibroblasts in the tumor microenvironment. This 26.8 MB CSV file contains integrated pan-cancer analysis data from TCGA and GEO, authored by Shijie Liu and last updated in April 2026. Gene set enrichment analysis suggests OSR2 may promote tumor progression through epithelial-mesenchymal transition.
Mendelian randomization evidence from genome-wide association study data clarifies causal links between basal metabolic rate, adiposity, and obstructive sleep apnea. The dataset includes univariable, multivariable, and bidirectional analysis results, such as odds ratios and confidence intervals. It was authored by Mengjie Zhang and shared on figshare in April 2026.
Over 3,000 well-preserved specimens of the brachiopod super-family Productacea from Permian marine sediments in Western Australia. The collection includes at least 34 species across seven genera, sourced from three major basins covering approximately 150,000 square miles. This dataset was published by Geoscience Australia Data and was last updated on 2026-04-30.
BAG provides a national registry of buildings, addresses, residences, and public spaces in the Netherlands, each with associated geometry. The Amsterdam municipality supplements this with legally mandatory BAG-plus features, such as building names and specialized function classifications. Data is provided by the Dutch Ministry of the Interior and Kingdom Relations under a CC0-1.0 license.
Afghanistan's border monitoring data from 2022, collected by UNHCR. The dataset captures estimates of weekly flows and movement composition at over 50 unofficial crossing points to Iran and Pakistan, derived from interviews with key informants. It is an anonymized version of the original data, published in PDF and web app formats.
NASA's JPSS-2/NOAA-21 satellite provides terrain-corrected geolocation data for the Visible Infrared Imaging Radiometer Suite (VIIRS) Day/Night Band. The product includes latitude, longitude, surface height, and solar, lunar, and sensor viewing angles for each 750-meter pixel. On-orbit validation corrected geolocation errors, bringing uncertainties to within 75 meters (1-sigma) in both scan directions.
Afghanistan's border monitoring data from 2023, collected by UNHCR. The dataset contains anonymized results from interviews at over 50 locations to understand refugee flows and barriers to movement. It reports that 81% of movements to Iran and less than 18% to Pakistan occur via unofficial crossings.
Geological Survey of Victoria provides interpreted geological data derived from airborne magnetic, radiometric, and gravity surveys mapped at a 1:250,000 scale. The dataset includes areas where newer 1:250,000 mapping supersedes older 1:100,000 mapping, such as St Arnaud, Bendigo, and Grampians regions. It is accompanied by related datasets for sub-surface polygons, structural lines, and metamorphism.
June 2025 administrative data from Employment and Social Development Canada provides aggregated statistics on Registered Disability Savings Plan beneficiaries, take-up rates, and financial details. The release includes figures for the Canada Disability Savings Grant, Canada Disability Savings Bond, contributions, and total RDSP assets up to December 2024. Statistics are published annually on the Open Government Portal and complement the program's official annual reports.
Victoria, Australia, contains interpreted geological data for deeply buried units beneath surface geology. The dataset combines interpretations from airborne magnetic, radiometric, and gravity surveys mapped at scales of 1:100,000 and 1:250,000. It was collected by the Geological Survey of Victoria and is accompanied by related datasets for geological polygons, boundaries, and structural lines.
Part 2 of a three-part geological report details the Permian stratigraphy of the Carnarvon Basin, an epicontinental basin with a maximum known Permian thickness of 15,200 feet. The report, published by Geoscience Australia, describes marine sediments, including glacial deposits, that rest unconformably on older rocks. It provides detailed thickness breakdowns for Sakmarian, Artinskian, and Kungurian stages.
A map selection of iconic heritage sites in the province of South Holland, based on seven thematic storylines. The storylines cover trade and economy, agriculture, water, (military) defences, governance, urban development, and architectural styles. The selection was compiled from existing sources like the Canon of South Holland and supplemented by expert sessions.
A dataset from the Dutch Ministry of the Interior and Kingdom Relations classifying settlements based on their historical and architectural value. It uses criteria of coolness, rarity, and coherence to assign categories of 'very high', 'high', and 'reasonably high' value. The data is available via WMS, WFS, PNG, and HTML formats under a CC-PDM-1.0 license.
Nine heavy metals were analyzed in adult muscle, in-utero eggs, oviposited eggs, hatchlings, and sand from two olive ridley turtle rookeries in India. Arsenic was the most prominent metal in adult turtles, suggesting bioaccumulation, while Selenium was higher in egg components. The dataset, authored by Sharon Pradhan and last updated in April 2026, provides evidence of maternal and environmental heavy metal transfer to this endangered species.
Interpreted geological data for deeply buried units in Victoria, derived from combined airborne magnetic, radiometric, and gravity surveys at a 1:250,000 scale. The dataset was created by the Geological Survey of Victoria and includes polygon, boundary, structural line, and metamorphism data. Recent high-quality 1:250,000 mapping supersedes older 1:100,000 mapping in specific areas like St Arnaud, Bendigo, and Grampians.
Replication data for a 2026 study accepted by the Journal of Labor Economics. The dataset likely contains information on parental labor supply and earnings in Australia during the COVID-19 pandemic, specifically focusing on periods of school disruption. It was authored by Nicolas Salamanca and last updated on June 17, -2026.
5.5 KB of summary statistics for logarithmic daily returns of copper, nickel, aluminum, and zinc futures from the London Metal Exchange and Shanghai Futures Exchange. The dataset, created by Cunhai Pan and last updated in April 2026, supports analysis of dynamic price spillovers between these two major markets over an eight-year period.