Loading...
Loading...
DNA/RNA sequences, gene expression, protein structures, metagenomics, single-cell sequencing
23,869 datasets
Part 2 of a geological report details the Permian stratigraphy of the Carnarvon Basin in Western Australia. The report, published by the Australian Ocean Data Network, provides thickness data for geological periods, including Permian sediments up to 15,200 feet thick. It describes the basin's structure and sedimentary history from the Proterozoic to the Tertiary.
30 species are described from Devonian coral faunas in Western Australia's West Kimberley region, with 22 from the Pilbara Limestone. The dataset documents fossil coral species from the Silurian of New South Wales and the Devonian of Western Australia, published by the Australian Ocean Data Network. The record was last updated in April 2026.
Pre-extracted embeddings from 26 Earth-observation foundation models evaluated on 24 downstream tasks. The dataset was created by AI2 in 2025 to support the findings in the OlmoEarth paper. It contains model outputs for train, validation, and test splits using paper-best hyperparameters.
10,314 robust polymorphic markers and 13 curated variants from a 22K SNP array designed for grapevine genetics. The data was generated by genotyping 144 genotypes from two diversity panels to validate the array. Laura Costantini published this dataset on figshare under a CC-BY-4.0 license, with a last update in March 2026.
Australia's offshore seabed, including international waters, is mapped in this multibeam bathymetry dataset compiled by Geoscience Australia. The data is gridded at a 50-meter resolution and represents holdings as of June 2018. It is accessible via Geoscience Australia's Marine data portal and is not intended for navigational use.
Data Sheet 1 presents in silico performance results for a targeted enrichment sequencing approach to classify Mycoplasma bovis strains in milk. The dataset is based on 620 M. bovis whole-genome sequences from NCBI, with 162 originating from milk, and simulates mock milk samples with varying strain mixtures. It was authored by Marit M. Biesheuvel and last updated on March 18, 2026.
A synthetic benchmark dataset of 620 Mycoplasma bovis whole-genome sequences, of which 162 (26.1%) originated from milk. The dataset, authored by Marit M. Biesheuvel and last updated in March 2026, contains results from an in silico evaluation of two classification tools (Kraken2 and Themisto/mSWEEP) on simulated targeted enrichment sequencing data for strain-level identification. Performance metrics include read classification accuracy, sensitivity, and positive predictive value across varying numbers of genomically clustered sequence variants (GSVs) and enrichment proportions.
Australian Ocean Data Network hosts a fossil catalog detailing the brachiopod super-family Productacea in Permian faunas of Western Australia. The collection comprises over 3,000 specimens representing at least 34 species across genera like Aulosteges, Dictyoclostus, and Taeniothaerus. This record was last updated in April 2026.
Australian Ocean Data Network provides geochemical data from Lake Frome, a playa lake in South Australia. The dataset includes analysis of major and minor elements in elastic sediments and hypersaline brines, with stratigraphic units defined from auger hole samples covering approximately 17,000 years. The data was last updated in April 2026.
Uniform Commercial Code (UCC) Lien Filings is a dataset of active liens filed with the Business Services Division of the Office of the Secretary of the State in Connecticut. The data includes UCC, vessel, aircraft, IRS, state labor, and municipal liens that are active or less than one year past lapse. The source data is from business.ct.gov as of July 2021, and the dataset was last updated on the platform in April 2026.
Chinese A-share listed companies' data from 2011 to 2021, sourced from Wind, CSMAR, and Bloomberg databases. The dataset was created by Chen, Minzhi and hosted on Harvard Dataverse. It was last updated on May 26, 2026.
Pre-trained model weights for the research paper 'Precision culturomics enabled by unlabeled single-cell morphology and Raman spectra'. The 4.3 GB resource was published by an author named Liang under an MIT license and was last updated on April 26, 2026. The weights are stored in ZST and PTH file formats.
Victoria, Australia's Vicmap Hydro Water Point layer contains point features delineating hydrological features such as rapids, springs, waterfalls, and dams. The data is provided by the Department of Transport and Planning and was last updated on 2026-04 09. It is attributed for names, but the description notes this information is not necessarily populated for all features.
Vicmap Elevation - Morphology Line is a geospatial vector dataset from the Department of Transport and Planning. It contains line features delineating landforms such as embankments, cuttings, sand dunes, levees, and cliffs. The data is part of the Vicmap Elevation 10-20 Contours & Relief subset and was last updated on 2026-04 09.
Supporting information for the manuscript 'Enabling the prediction of phage receptor specificity from genome data'. The dataset includes supplementary tables, datasets, and high-resolution figures. Author Lucas Morinière published the 80.3 MB collection on figshare under a CC-BY-4.0 license, last updated on 2026-04-18.
CyMeta-ImaGating is a dataset of single-cell metabolic atlases generated using metabolomic cytometry with an image-enabled gating strategy. The data includes verified single-cell profiles from scraped HeLa cell suspensions, spleen, liver, and cancer cells, demonstrating transitions in metabolic states after pathogen activation and drug treatment. The dataset, authored by Yuanyi He and last updated in March 2026, is shared under a CC-BY-NC-4.0 license.
Over 40,000 retired address points document locations that once existed but no longer do in Washington, D.C. The data was created as part of the Master Address Repository (MAR) for the Office of the Chief Technology Officer and Department of Buildings. It includes features such as placement location, created date, and last edited date, with a data dictionary available online.
RAF-Ready Folktables is a processed dataset used in the paper 'Retrieval-Augmented Dataset Assembly for Fair Clustering'. The data were prepared by author pengyueli for evaluating RAF, a framework for retrieving external samples to reduce minority under-representation in clustering tasks. The dataset was last updated on 2026-05-15.
The Office of Temporary and Disability Assistance (OTDA) aggregates public assistance recoveries from lottery winnings intercepts. Data is organized by month, year, district, and assistance type, showing the total amount recovered. The dataset was last updated on 2026-04-16.
Yukon Canada's Advisory Council on Women's Issues documented its meetings, events, and actions for the 2021-22 fiscal year. The Government of Yukon published this report, which is available in HTML and PDF formats. The dataset was last updated on the platform in April 2026.