Loading...
Loading...
DNA/RNA sequences, gene expression, protein structures, metagenomics, single-cell sequencing
23,249 datasets
A collection of 20 Excel tables from figshare, authored by Alexandros G. Sotiropoulos and last updated in April 2026. The data details genomic analyses of Blumeria graminis fungal isolates, including pangenome statistics, assembly quality, transposable elements, and mating type genes.
219 complete genomes of the sexually transmitted pathogen Neisseria gonorrhoeae were assembled using long-read sequencing to resolve the multiple copies of opacity-associated (Opa) genes. The dataset, created by QinQin Yu and last updated in May 2026, reveals that Opa genes are on average 74 times more diverse than the rest of the genome. Each genome contains an average of 7 distinct opa alleles across 9–12 loci, with fewer genes in frame than expected by chance.
A 2026 study by David A. Strand investigates environmental DNA (eDNA) monitoring for the bacterial pathogen Phocoenobacter atlanticus in Atlantic salmon aquaculture in Norway. The dataset includes results from storage condition trials and three field trials assessing optimal sampling locations and infection status screening. The data is stored in an XLSX file and shared under a CC-BY-4.0 license.
A 15.3 KB Excel file contains the modeling inputs for a Markov model comparing HPV self-sampling and cytology screening strategies in Mexico. Authored by Olga Georgina Martínez Montañez and uploaded to figshare in May 2026, the model simulated a cohort of 100,000 women over 10 years to evaluate cost and effectiveness. The analysis found HPV self-sampling prevented more cancer cases at a lower cost per case.
181 pregnant women were provided HBV DNA testing in a two-phase study across 10 hospitals in nine regions of Uganda. The dataset, created by Linda Kisaakye Nabitaka and last updated in May 2026, contains quantitative data extracted from facility registers and qualitative feedback from study teams. It assesses the operational feasibility, accessibility, and cost of implementing point-of-care Xpert HBV DNA testing in antenatal care settings.
A 3.5 MB research artifact by Xinyu Li, last updated in May 2026, proposing a novel matrix factor model where factor loadings evolve as functions of observable covariates. The associated files include code and documentation for a nonparametric estimation procedure combining kernel smoothing with PCA. Empirical applications demonstrate the method's performance on international trade flows and portfolio returns.
A high-quality, chromosome-scale, haplotype-resolved genome assembly for the grape variety Vitis sp. 'Zhuosexiang' comprises two fully phased haplotypes totaling 520.9 Mb and 518.5 Mb. The dataset includes gene annotations for over 34,000 protein-coding genes per haplype, functional assignments, and identified structural variations. It was authored by Ji and last updated on figshare in April 2026.
Boundary data for 55 Regional Development Australia committees, established by the Australian Government to coordinate regional growth. The dataset was created and is maintained by the Department of Infrastructure, Transport, Regional Development, Communications, Sport and the Arts, originally released in September 2011. It is built from the Australian Bureau of Statistics' Local Government Area boundaries for 2011.
Boundary data for 55 Regional Development Australia committees, established by the Australian Government to coordinate regional growth. The dataset was created and is maintained by the Department of Infrastructure, Transport, Regional Development, Communications, Sport and the Arts, originally released in September 2011. It is built from the Australian Bureau of Statistics' Local Government Area boundaries for 2011.
World Bank data provides a detailed picture of debt stocks and flows for developing countries. The Quarterly External Debt Statistics enable a more complete understanding of global financial flows for high-income countries and emerging markets. Data are gathered from national statistical organizations, central banks, and major multilateral institutions.
A replication package for a study examining the conditions under which AI reduces corporate greenwashing. The dataset, Stata code, and logs are provided by author Zhaolu Tang to reproduce statistical analyses. The files total 64.4 MB and were last updated on May 25, 2026.
Great Barrier Reef data demonstrates a method for mapping surficial coral reef facies using entropy-ratio maps. The ternary classification uses detritus, framework encrustation, and pavement as end-members, subdivided by their degree of mixing. The classification is sensitive to all reef environments and can be superimposed on other classifications.
25 metagenome-assembled genomes (MAGs) of anaerobic methanotrophic archaea (ANME) recovered from sediment cores at an erupting cold seep in the South China Sea. The dataset includes genomic data revealing reverse methanogenesis pathways and accessory metabolic traits. Author Huan Zhang published this dataset under a CC-BY-4.0 license on figshare in May 2026.
World Bank data on China's external debt stocks and flows. The dataset likely contains quarterly external debt statistics for high-income and emerging markets, as well as public sector debt data for central, state, and local government. It was last updated on 2026-04-27.
Quarterly external debt statistics for Canada, sourced from the World Bank's data portal. The data likely includes debt stocks and flows for high-income countries and emerging markets, with further detail on public sector valuation methods and debt instruments. The dataset was last updated on 2026-04-27.
Deakin University Marine Mapping lab collected this bathymetric survey over two days in January 2018. The data was acquired using a Kongsberg EM2040c sonar system aboard the Motor Vessel Yolla. It was created as part of a Parks Victoria project to map all marine parks within Victorian state waters.
Brazil's external debt statistics from the World Bank's data portal, focusing on debt stocks and flows. The data likely contains quarterly external debt statistics for high-income and emerging markets, as well as public sector debt details. It was last updated on 2026-04-27 and is provided under a CC-BY-4.0 license.
Scottish Water provides data on non-reported overflow events from 2022 to 2025. The dataset includes discrete event details like start/stop times and durations, plus annual summaries at the measurement point level. It was published on 11/05/26 and last updated on 2026-05 14.
A 2026 study by Ye, Zhengyang from Borealis Harvested Dataverse uses landscape genomics to analyze climate adaptation in the interior spruce hybrid complex. The dataset includes 41,253 SNPs genotyped from 1,692 individuals across 252 natural populations in western Canada. It delineates seed zones and predicts genomic offset to future climate scenarios, validated by common-garden experiments.
Geoscience Australia provides metadata from two internal catalogues containing national satellite image collections, geological and topographical maps, and boundary datasets. The metadata was mapped to the ISO19115 standard and exposed via a GeoNetwork instance. This work enables external access to geoscientific information for decision-making.