Loading...
Loading...
DNA/RNA sequences, gene expression, protein structures, metagenomics, single-cell sequencing
23,758 datasets
Experimental data from a rat model of myocardial infarction investigating neural mechanisms of heart-lung crosstalk. The dataset includes molecular profiling, immunofluorescence, tissue clearing, and functional assay results, totaling 464.8 MB in TIF, PNG, and other formats. It was authored by Hanjun Wang and last updated on April 23, 2026.
Results from a 2026-01-30 analysis by the Type (Strain) Genome Server (TYGS) provide digital DNA-DNA hybridization (dDDH) values and intergenomic distances for Bacillus pseudomycoides strain CHAES I 2_2. The data includes comparisons against ten closest type strains determined via MASH and GBDP algorithms. Author Alina Kharchuk published the findings on figshare in April 2026.
Digital DNA-DNA hybridization (dDDH) values for the bacterial strain Bacillus pseudomycoides CHAES I 2_2, calculated via the Type (Strain) Genome Server (TYGS). The analysis was performed by Alina Kharchuk and results were generated on 2026-01-30. The dataset is small, comprising a single Excel file of 9.8 KB.
Geoscience Australia Data published 'Continuity of Earth Observation Data for Australia - Operational Requirements to 2015 for Lands, Coasts and Oceans'. The document outlines the use of space-based Earth observation data for monitoring Australia's environment and natural disasters. It was last updated on 2026-05-14.
A computational model of calcium ion diffusion within the sarcoplasmic reticulum of amphibian skeletal muscle during excitation-contraction coupling. The model, authored by Katherine J.-X. Lin and uploaded to figshare in 2026, utilizes established muscle anatomy and calcium buffering parameters to simulate concentration gradients over millisecond to second timescales.
AusSeabed consortium guidelines provide recommended procedures for survey planning, data acquisition, and submission of multibeam sonar data. Version 2 incorporates content from the Seafloor Mapping Field Manual for Multibeam Sonar and aims to improve interoperability and standards for Australian waters. The guidelines are designed to complement specific survey requirements and align with international initiatives like Seabed2030.
Part of uv-scripts, a collection of self-contained scripts for local or Hugging Face Jobs execution. This script removes duplicate or near-duplicate text samples from a Hugging Face dataset using SemHash with Model2Vec embeddings, which is CPU-optimized and requires no GPU. The dataset page was last updated on 2026-06 05.
Maternal mortality cases in the Antioquia department of Colombia from 2005 to 2024. The dataset is updated annually with data from the latest year and includes municipality-level details. It originates from the Colombian open data portal www.datos.gov.co.
Yunsheng Liu published a 2.0 GB dataset on figshare in May 2026 for TMC-Llama models. It includes Transition Metal Complex (TMC) data with TMC-SMILES strings for publication figures, a dataset for fine-tuning models, and the resulting PyTorch model weights.
Quaternary geology investigations in the Coal River map sheet (NTS 95D) during the 2009 field season focused on characterizing surficial materials and their distributions. The Government of Yukon produced this dataset, which includes observations of moraine deposits, streamlined glacial landforms, and glaciolacustrine deposits.
A jurisdictional scan compares how British Columbia, Alberta, New Brunswick, Nova Scotia, and Northwest Territories manage public emergency communications. The report provides insights and recommendations for Yukon's approach, highlighting integrated platforms, multi-channel strategies, and governance policies. Published by the Government of Yukon in April 2025.
International Student Enrolments - State Schools is a dataset published on data_gov_au. The data is provided by the Queensland Department of Education ([email protected]) and was last updated in May 2026. The dataset's specific content, such as enrollment numbers by year or school, is not detailed in the provided metadata.
Global satellite-derived aerosol data is provided on a 1x1 degree latitude-longitude grid, aggregated monthly from daily observations. The dataset contains 45 science data layers, including aerosol optical depth and Angstrom exponent, derived from the VIIRS instrument on the Suomi NPP satellite. Monthly grid elements are only considered valid if they contain data from at least three valid days within the month.
NASA's VIIRS/SNPP Deep Blue Level 3 daily aerosol dataset provides a global, gridded view of atmospheric aerosols. The product contains 45 Science Data Set layers, including arithmetic mean and standard deviation for aerosol optical depth, derived from quality-filtered satellite retrievals. Data collection began on March 1, 2012, with daily files requiring at least three valid measurements per 1x1 degree grid cell.
Ethiopian Plasmodium vivax samples (Pv01–Pv03) were analyzed to derive consensus sequences for Matryoshka RNA virus (MaRNAV) segments. The dataset includes nucleotide and protein sequences, BLAST validation reports, phylogenetic alignments, and maximum-likelihood tree outputs. It was created by Frank Nyondo and last updated on 2026-04-19.
Paraguayan prisons provided data for 836 incarcerated individuals (621 men, 215 women) in a cross-sectional study. The dataset contains screening results for ADHD symptoms and their associations with psychological variables like suicide risk, hostility, and anxiety. Julio Torales authored the study, with data last updated in April 2026.
A 3.7 MB dataset from figshare, last updated on 2026-05 07. It contains a list of Microbacterium species genomes used for comparative and phylogenomic analyses, along with tables detailing pairwise genomic comparisons, activity experiment results, and predicted gene clusters. The dataset was authored by Jina Kim and is shared under a CC-BY-4.0 license.
218.7 KB spreadsheet contains metadata for samples used in a genomics study. The data includes sample IDs, species, locality, and gene-capture statistics, split across two sheets for new and previously sequenced samples. Sean P. Heighton published the dataset under a CC-BY-4.0 license in May 2026.
May 8 to June 3, 2021, Acoustic Doppler Current Profiler (ADCP) data collected from the RV Investigator voyage IN2021_V03, monitoring the East Australian Current at 27 degrees South. The data were collected using OS75 and OS150 ADCPs in narrowband mode, processed with CODAS, and archived by CSIRO's National Collections and Marine Infrastructure. Transducers were located approximately 8.0 meters below the water line for the voyage's duration.
RNA sequencing data generated from primary astrocytes isolated from control and valproic acid-exposed rats. The dataset is 1.6 MB in size, stored in an XLSX file, and was last updated on May 31, 2026. It was authored by 'zou' and is shared under a CC-BY-4.0 license.