Loading...
Loading...
DNA/RNA sequences, gene expression, protein structures, metagenomics, single-cell sequencing
23,861 datasets
OSGeo Nepal provides building outlines across Nepal, last updated on 2026-05-08. This dataset combines crowd-sourced OpenStreetMap data with machine-learning building detections from Microsoft, Google, and Esri via the Overture Maps project. The data is available in common geospatial file formats.
ISGM-SGC is a standardized classification tool for seabed geomorphology developed by the International Seabed Geomorphology Mapping Working Group. It assigns five hierarchical classification levels and 16 additional attributes to describe seabed features. The tool was released as an Esri ArcGIS Pro Python tool and is maintained by geoscience agencies from the UK, Norway, Ireland, and Australia.
A sediment drift deposit over 30 metres thick was discovered on the East Antarctic continental shelf in an 850 metre deep glacial trough off George Vth Land. Radiocarbon dating indicates a period of rapid deposition occurred in the mid-Holocene, between about 3,000 and 5,000 years before present. The dataset is hosted by the Australian Ocean Data Network.
Flavonoid compounds brazilin and its acetylated derivative significantly reduced cell viability and proliferation in triple-negative MDA-MB-231 breast cancer cells. The dataset likely contains results from cell viability assays, oxidative stress measurements, mitochondrial integrity tests, and transcriptomic RNA sequencing analysis. Miriam Zuñiga-Eulogio published this 1.1 MB Excel file on figshare in April 2026.
10,611 high-quality DArTseq SNP markers assess genome-wide diversity and trait-driven population structure among indigenous rice genotypes from Northeast India. The dataset was authored by Thanga Suja Srinivasan and last updated on April 21, 2026. It is available in CSV and VCF formats and has a size of 378.4 KB.
A prospective longitudinal study of 227 women assessed the effect of disgust sensitivity on breastfeeding. Participants completed the Disgust Scale-Revised (DS-R) and Three Domains of Disgust Scale (TDDS) during the third trimester and again postpartum, and provided breastfeeding information at hospital discharge and later. The dataset, authored by Jana Benešová and last updated in April 2026, includes results from ordinal regression analysis adjusted for maternal and infant factors.
LasB elastase cleaves junctional protein E-cadherin and alters Claudin-4 localization in lung epithelial models. Roya Shafiei published this dataset on figshare in April 2026, detailing transcriptomic changes and cytokine reductions induced by LasB. The 3.3 MB ZIP file likely contains experimental data supporting antivirulence therapy research.
SoilMicrobeDB is a Kraken2 genome database hosted on AWS. It likely contains a collection of high-quality genomes for soil organisms, including uncultured and fungal species. The data is provided by Zoey Werbin and has no restrictions on use.
NOAA's Unified Forecast System (UFS) Global Ensemble Forecast System version 13 (GEFSv13) Replay dataset provides retrospective global forecasts for atmospheric and oceanic variables like temperature, humidity, winds, salinity, and currents. The dataset was generated by replaying the coupled UFS model against ERA5 atmospheric and ORAS5 ocean reanalyses, with land and sea-ice data assimilated from observational sources. It covers January 1994 to October 2023 at a nominal ¼ degree horizontal resolution and is hosted on AWS and GCP cloud services.
Chromosome-level genome assemblies and annotation files for Drosophila melanogaster from a study on impaired piRNA-mediated transposon silencing. The data includes files for both an experimental Hen1cas9 group and a control group. The dataset was authored by JI Yonghao and last updated on June 4, 2026.
Data.mo.gov provides a listing of approved training events. The dataset includes columns for TITLE, SPONSOR, LOCATION, START DATE, END DATE, CLOCKHRS, and CONTACT INFO. It was last updated on 2026-05-19.
Additional file 1 accompanies a study on mitochondrial introgression in cryptic yellow fever vectors. Published on figshare by Filipe Vieira Santos de Abreu in May 2026, this 37.6 KB XLSX file likely contains supporting genetic data for the research. The columns suggest it includes sequence or analysis results for Haemagogus capricornii and Hg. janthinomys mosquitoes.
Transport Canada provides geospatial data identifying designated alternate ballast water exchange areas for vessels entering Canadian jurisdiction. The dataset covers four key maritime regions: the Gulf of St. Lawrence, Atlantic Canada-Western Canada, the Canadian Eastern Arctic, and the Canadian Western Arctic. It is intended for regulatory illustration, not for navigation or legal purposes.
Lord Howe Island in the southwestern Pacific Ocean contains stratigraphic data on skeletal carbonate eolianite and beach calcarenite. The data likely includes lithostratigraphic divisions, cement types, karst features, and a chronology constructed from U/Th, TL, AMS 14C, and amino acid racemization dating. The dataset is hosted by the Australian Ocean Data Network.
2764 adult twins from the Dutch Twin Registry were analyzed for associations between neighborhood obesogenic characteristics and BMI. The study used the OBCT-index, integrating food-environment healthiness, walkability, drivability, and sports facilities within a 1,000-meter buffer. Heritability estimates were 9% for the OBCT-index and 75% for BMI.
632 Japanese individuals' whole genome sequencing data analyzed for pharmacogenomics insights. The dataset compares allele frequencies for three drug classes—SSRIs, opioid analgesics, and statins—against broader East Asian population data. Charles W. Crawford published this secondary analysis pipeline on figshare in March 2026.
A 5.5 KB Excel file containing model performance metrics, including area under the precision-recall curve (AUPRC). The dataset was authored by Yik-Shun Lin and last updated on May 5, 2026. It reports results from 5-fold cross-validation, prospective testing, and external validation.
59.7 MB of statistical analysis results from single-molecule sequencing of Lilium leichtlinii subsp. maximowiczii. Jingxing Xu published this table on figshare in April 2026 under a CC-BY-4.0 license. It quantifies the regulatory effects of a 6-day white light treatment on gene transcription levels compared to a control group.
Supplementary gene annotation tables for the boll weevil (Anthonomus grandis) reference genome, derived from the study 'Pangenomics Links Boll Weevil Divergence with Ancient Mesoamerican Cotton Cultivation'. The data includes gene ontology terms and coordinates from selectively swept regions and structural variations between two subspecies, identified using RAiSD and PCAdapt programs. The dataset is provided by the Department of Agriculture and was last updated in March 2026.
2001-2017 phytoplankton cell abundance data collected from automated SmartBuoy moorings at five locations in UK waters. Samples were preserved with Lugols Iodine and analyzed using the Utermohl method on inverted microscopes. The dataset was provided by the Government Digital Service.