Loading...
Loading...
DNA/RNA sequences, gene expression, protein structures, metagenomics, single-cell sequencing
22,990 datasets
E12.5–E14.5 single-cell RNA-seq data from a reanalysis of public datasets, characterizing epithelial state changes during early mouse molar development. The dataset was created by Yuanjing Jiang and is licensed under CC-BY-4.0. It was last updated on 2026-05-19.
A multicenter study of 1,301 patients developed a stacking ensemble model for preoperative prediction of visceral pleural invasion in lung adenocarcinoma. The model integrates 3D intratumoral heterogeneity scores from chest CT scans with clinicoradiologic features, achieving an AUC of 0.878. The dataset, authored by Qunzhi Ouyang and last updated in May 2026, contains the quantitative scores and features used in this research.
A multicenter retrospective study from three medical centers includes data from 1,301 patients with lung adenocarcinoma (LUAD). The dataset was used to develop a stacking ensemble model integrating three-dimensional intratumoral heterogeneity scores with clinicoradiologic features for preoperative prediction of visceral pleural invasion. The model achieved an area under the curve of 0.878, with the 3D ITH score identified as the most influential predictor.
University of Newcastle researchers captured media attention in 2017 with a study modelling tsunami risk for Sydney, considering scenarios from minor disruptions to rare one-in-5000-year disasters. The Australian Institute for Disaster Resilience and the Australian Tsunami Advisory Group published the Tsunami Emergency Planning in Australia Handbook on 5 November 2018. This handbook outlines causes, characteristics, and planning considerations for coastal and maritime communities under the Australian Tsunami Warning System.
251 sonobuoys deployed during the 2021 TEMPO survey captured approximately 460 hours of underwater acoustic recordings and Antarctic krill 3D coordinates. The Marine National Facility RV Investigator collected this data on a voyage from Hobart between 29 January and 24 March 2021. Recordings provide complementary data on marine mammal presence, with blue whales and sperm whales particularly well-detected acoustically.
23 files totalling 11.1 GB of raw multibeam echosounder data were collected aboard RV Investigator voyage IN2018_V04 from 11 September to 8 October 2018. The Kongsberg EM710 MKII system acquired seafloor bathymetry, backscatter, and watercolumn backscatter data at a nominal frequency of 40-100 kHz. Processed data includes line data in *.gsf and ASCII formats, and bathymetry/backscatter grids in GeoTIFF format.
West Coast Tasmania bathymetry data was acquired by CSIRO aboard the TV Bluefin between 22 February and 21 March 2022. The multibeam survey was conducted for the Institute for Marine and Antarctic Studies (IMAS) and processed into a 2-meter resolution GeoTIFF grid. A detailed report on the survey is publicly available.
Illumina NovaSeq X Plus sequencing data from RNA samples extracted from sorghum and foxtail millet leaf tissue collected after cold exposure at dawn, dusk, or under control conditions. Three replicates were collected, each representing an independent growth trial under a 14-hour light/10-hour dark photoperiod. FASTQ files are available through the European Nucleotide Archive, with two foxtail millet samples and one sorghum sample excluded due to poor quality control.
Global satellite data from the VIIRS/SNPP instrument provides atmospheric aerosol loading measurements for daytime, cloud-free, and snow-free scenes. The Level 2 product, version 2.0, has a nominal at-nadir resolution of 6 km and uses the Deep Blue algorithm over land and the SOAR algorithm over ocean to retrieve Aerosol Optical Thickness at a 550 nm reference wavelength. This dataset is part of the Collection 2.0 product, which includes algorithmic improvements and is also available for the NOAA-20 satellite.
NASA's VJ103IMG product provides terrain-corrected geolocation vectors for the VIIRS sensor aboard the JPSS1 satellite. It contains geodetic latitude, longitude, surface height, solar and sensor angles, a land/water mask, and a quality flag for every pixel at a 375-meter resolution. The data is derived from satellite ephemeris, attitude data, and digital terrain models.
A study of 257 commercial Duroc×(Landrace×Yorkshire) pigs integrates genome-wide and microbiome-wide association studies. The data includes genetic and microbial influences on intramuscular fat, meat color, marbling, and moisture. The dataset was authored by Zhuoda Lu and last updated on 2026-05-19.
75.8% of Queensland's existing dwelling stock consists of detached dwellings, according to this dataset. It was published by the Queensland Department of Environment, Tourism, Science and Innovation and last updated in May 2026. The description notes that building approvals for high-rise dwellings were relatively high in the 2016–2017 period.
624 patients participated in a randomized controlled trial to evaluate an AI-based warfarin management system. The AI-WAR software, developed by Yuan Li, integrates remote follow-up and a bidirectional LSTM dosing model. It significantly improved median Time in Therapeutic Range from 48.7% to 81.3% and reduced adverse event rates.
Francesco Forte's 1.8 GB dataset integrates genome-wide ddRAD-seq data, geometric morphometrics, and paleodistribution models for the grasshopper genus Italohippus. The data supports a model of Pleistocene climatic oscillations driving microgeographic diversification in Mediterranean sky islands. It was last updated on May 8, 2026, and is shared under a CC-BY-4.0 license.
Community Counts data provides taxfiler income statistics segmented by age group and sex for Canadian provinces and counties. The dataset is archived and no longer maintained, with users directed to Statistics Canada for current information. Its columns suggest it contains annual income totals and distributions across eight distinct age brackets.
Sheel Chandra's dataset provides a summary of CpG sites and single nucleotide polymorphisms (SNPs) for multiple species, last updated on June 1, 2026. The analysis excludes genic regions using NCBI RefSeq annotations and, for human data, also excludes phylogenetically conserved regions defined by phastCons. The dataset is a 9.4 KB XLSX file licensed under CC-BY-4.0.
A 9.5 KB Excel file published on figshare by Lisa H. Verzier on 2026-05-18. The dataset likely contains selection criteria for CRISPR-based functional genomics screens in engineered HC-04 hepatocytes to study host factors involved in Plasmodium falciparum sporozoite traversal and invasion.
A collection of RNAScope in situ hybridization and immunofluorescence microscopy images from a 2026 study on Shiga toxin-producing Escherichia coli O157:H7. The dataset visualizes the initial adherence of bacterial strains to bovine rectoanal junction tissues in vitro. It was created by author Indira T. Kudva and shared under a CC-BY-4.0 license.
Indira T. Kudva's dataset contains RNAScope in situ hybridization and related data on Shiga toxin-producing Escherichia coli O157:H7 adherence to bovine intestinal tissue. The 23.9 MB XLSX file includes results from RAJ squamous epithelial cell and organ culture assays, comparing wild-type, slp deletion mutant, and complemented bacterial strains. The dataset was last updated on 2026-05-18 and is shared under a CC-BY-4.0 license.
Jiazhen Dong published raw data on Kaposi’s Sarcoma-associated herpesvirus (KSHV) lytic replication on figshare in May 2026. The data likely contains experimental results on the interaction between viral protein K8 and host transcription factor ATF3. The dataset is stored in an XLSX file sized 60.6 KB.