Loading...
Loading...
DNA/RNA sequences, gene expression, protein structures, metagenomics, single-cell sequencing
22,730 datasets
Level 1B multispectral imagery was collected by the MODIS/ASTER Airborne Simulator (MASTER) instrument during four flights over California and New Mexico in October 2008. The dataset provides georeferenced calibrated radiance across 50 spectral bands from 0.460 to 12.879 micrometers at approximately 30-meter resolution. This data was produced by NASA and the U.S. Department of Energy as part of the HyspIRI mission's preparatory airborne campaign, focusing on the USDA Jornada Experimental Range.
A bioinformatics analysis identified copyback and deletion-type defective viral genomes (DVGs) in dengue virus using NGS data from patients, mosquitoes, and cell lines. The dataset, authored by Jianhai Yu and last updated in May 2026, includes results from clustering algorithms that predicted DVGs stable across different hosts, with one candidate experimentally validated. The file is a 915.5 KB DOCX document containing the analysis table.
SoySNP50K array data for mapping population parents and bulks, plus phenotype and KASP genotype data, are included in this 15.5 MB dataset. It was authored by Habib Widyawan and last updated on May 19, 2026. Supplementary files contain QTL-seq results, marker assay details, pathogen isolate information, and annotated candidate genes.
A 723.6 MB dataset from figshare supports research into the molecular mechanisms of Candida albicans's commensal-pathogenic transition. The data, published under CC-BY-4.0 by MINGYANG Ma, includes experimental results on the Mcu1 protein's role in carbon source utilization and white-to-opaque switching. It contains files in PDF, XLSX, TIF, DOCX, and ZIP formats, last updated in June 2026.
Vista-LA is a GIS database mapping over 33,000 potential methane-emitting facilities and infrastructure across the South Coast Air Basin of California. NASA compiled this dataset from public agency records between 2012 and 2017 to address gaps in urban methane inventories. The database organizes entries into thirteen distinct infrastructure maps.
22 pairs of lung cancer specimens were analyzed via proteomics to compare chemoradiotherapy-sensitive and resistant subtypes. The data, published by Weiwei Ouyang in 2026, includes a predictive model identifying IKBIP and CIT proteins as key biomarkers. Functional analysis revealed distinct pathway enrichments, linking fatty acid oxidation to sensitivity and immune activation to resistance.
22 pairs of lung cancer specimens were analyzed via proteomics profiling to compare CRT-sensitive versus CRT-resistant subtypes. The dataset includes a LASSO regression-based predictive model identifying a protein signature involving IKBIP and CIT. It was authored by Weiwei Ouyang and last updated on June 2, 2026.
Six atmospheric gases—CH4, CO, HDO, NH3, O3, and PAN—are profiled at 17 vertical levels from the surface to 0.1 hPa. This summary product contains vertical distributions and formal uncertainties from the CrIS instrument on the Suomi-NPP satellite, centered on a 3x3 degree region over Los Angeles. Data are provided in daily netCDF4 files with a spatial resolution of 14 km, processed using the MUSES optimal estimation algorithm.
Colombian Caribbean mosquito metagenomic assemblies yielded a 14,185 bp mitochondrial genome draft. The package includes the draft FASTA sequence, annotation tables, validation outputs, and phylogenetic analysis files. Richard Hoyos Lopez published this data package on figshare in May 2026.
A 2008 survey by the CERF Marine Biodiversity Hub collected co-located physical and biological data across three study areas on the southern Carnarvon Shelf. The collaboration between the Australian Institute of Marine Science and Geoscience Australia aboard RV Solander aimed to test physical parameters as surrogates for benthic biodiversity. The report describes methods and initial interpretations of data including multibeam sonar, sediment samples, and underwater video.
A 680.5 KB collection of simulation and application results comparing Bayesian regularization priors for factor analysis. The dataset, authored by Yifan Zhang and last updated in May 2026, likely contains performance metrics from studies evaluating graphical lasso, horseshoe, and spike-and-slab priors for estimating sparse latent factor correlations. It includes findings from a personality-inventory application demonstrating how partial regularization improves model interpretability and fit.
California monthly time series of respiratory-related deaths are modeled using a novel spatiotemporal procedure. The dataset contains WAIC (Watanabe-Akaike Information Criterion) values for the proposed model and alternative reference models. The data and code were authored by Jeffrey Wu and published on figshare in 2026.
Modeling results and supporting data for a spatiotemporal procedure applied to monthly time series of respiratory-related deaths across California. The dataset includes surrogate variables for air quality exposure and social deprivation indices used to learn graph and kernel structures. Data and code are available via a linked Dryad repository.
5.5 KB of out-of-sample Root Mean Square Percentage Error (RMSPE) values for a novel spatiotemporal model and reference models. The data accompanies a paper modeling monthly respiratory-related deaths across California, using social deprivation indices and air quality surrogate variables. The dataset was authored by Jeffrey Wu and last updated on 2026-05-21.
A case-control study of 261 individuals, including 166 hospitalized COVID-19 patients, conducted by Edyta Paradowska. It analyzes ten pattern-recognition receptor (PRR) gene polymorphisms and their associations with disease severity and cytokine profiles. The data was last updated on 2026-05-28.
A cytogenomic dataset for the Southern Caracara (Caracara plancus) detailing its highly derived avian karyotype. The data includes results from classical cytogenetics, fluorescence in situ hybridization (FISH), and satellitome analysis, characterizing chromosome organization and repetitive DNA distribution. It was authored by Felipe Lagreca Bitencourt and last updated on May 28, 2026.
A 942.2 KB supplementary document by Patrik Majcen, last updated on 2026-05-28, investigates the relationship between chromatin organization and satellite DNA detection in the beetle Tenebrio molitor. The study uses DNA staining, immunodetection of the H3K9me3 epigenetic mark, and fluorescence in situ hybridization (FISH) to examine male germline development. Findings show that chromatin condensation and constitutive heterochromatin organization strongly influence the detectability and spatial distribution of high- and low-copy satellite DNAs.
A bioinformatics study identifies potential hub genes for recurrent pregnancy loss associated with antiphospholipid syndrome. The analysis integrates differential expression, machine learning, and experimental validation from datasets retrieved from the Gene Expression Omnibus (GEO) database. The work was authored by Huan Zeng and last updated on June 4, 2026.
10 common differentially expressed genes were identified from RPL and APS datasets retrieved from the GEO database. The study by Huan Zeng, last updated in June 2026, used machine learning and experimental validation to identify hub genes. It suggests an imbalance of immune system-associated cells and molecules may be a common characteristic in the pathophysiological processes of both conditions.
Gene expression data analysis identifying potential hub genes for Recurrent Pregnancy Loss (RPL) associated with Antiphospholipid Syndrome (APS). The study, authored by Huan Zeng and last updated in June 2026, integrated datasets from the GEO database, differential expression analysis, and machine learning algorithms. It identified 10 common differentially expressed genes and three hub genes (NAA30, ARHGAP44, SUGT1) validated through computational and experimental methods.