Loading...
Loading...
DNA/RNA sequences, gene expression, protein structures, metagenomics, single-cell sequencing
23,808 datasets
Colombia's Amazonas Governorate maintains a catalog of its published information. The dataset likely contains metadata describing content categories, responsible parties, and publication formats. It is hosted on the Colombian open data portal www.datos.gov.co and was last updated on May 18, 2026.
A five-gene prognostic model for epithelial ovarian cancer constructed from transcriptome and clinical data from TCGA, GTEx, and GEO databases. The model's validity was confirmed using patient samples from TCGA and GEO, and it demonstrated predictive accuracy superior to traditional clinical factors alone. Author Tingting Yu published this research document under a CC-BY-4.0 license on figshare in April 2026.
Automatically updated records of resolutions made by the City of Ballarat council, as provided in past meeting minutes. The dataset is intended to inform the public of past council decisions and is available in multiple formats including CSV, JSON, and geospatial files. The information was collected as the resolutions were made.
The International Astronomical Union's official record of named topographic and albedo features on planets, satellites, and some ring systems. This gazetteer contains detailed information for all IAU-approved names from its founding in 1919 through the present time. The dataset is maintained by the National Aeronautics and Space Administration.
Historical cadaster municipality of Ede_1832-1967 is a register of all registered parcels in the municipality of Ede, including Gelders Veenendaal, from 1832 to 1967. The dataset, provided by the Ministerie van Binnenlandse Zaken en Koninkrijksrelaties, contains information such as parcel identification, type, surface area, and changes over time. It is available in CSV format under a CC0-1.0 license.
FALCON-VLA provides camera parameters for the CALVIN-3D dataset, supporting research on grounding Vision-Language-Action models in spatial foundation priors. The dataset was last updated on 2026-05-25. It accompanies a paper accepted at ICLR 2026.
Additional file 1 from a study investigating the role of B chromosomes in plant invasiveness. The dataset, published on figshare by Cui Wang under a CC-BY-4.0 license, comprises 21 supplementary figures in an XLSX file. It is a 509.1 KB supplementary resource for the associated research paper.
A PDF protocol describes an in planta genome editing method for tomatoes, developed by Misaki Kobayashi. The study combines transient expression of Cas9, guide RNAs, and developmental regulators via agroinfiltration to avoid stable transfection and tissue culture. Chimeric mutants were obtained with an efficiency of 11.7%, and most observed mutations were single base substitutions.
Facility location and identification data from the EPA's Facility Registry System for sites linked to Facility Response Plans. These substantial harm facilities are subject to federal oil spill prevention and response requirements. The dataset is managed by the U.S. Environmental Protection Agency and was last updated in early April 2026.
A panel of isogenic Borrelia burgdorferi reporter strains expressing seven major OspA serotypes was developed to evaluate monoclonal antibodies. Structural analysis of Fabs in complex with OspA ST1 identified overlapping epitopes and a key susceptibility determinant, Lys-107. This 5.5 KB Excel dataset by Graham G. Willsey, last updated in April 2026, supports structure-based vaccine design.
A 5.5 KB dataset published on figshare by Graham G. Willsey, last updated on 2026-04-21. It contains characteristics of monoclonal antibodies (mAbs) targeting Outer surface protein A (OspA) of Borrelia burgdorferi, used in a study evaluating cross-serotype functionality. The data likely supports structural analysis for the design of a broadly protective Lyme disease vaccine.
Proteomics results from a study on essential protein kinases in the pathogenic yeast Candida albicans. The dataset, authored by Bernardo Ramírez-Zavala and last updated in April 2026, likely contains protein identification and quantification data from co-immunoprecipitation and liquid chromatography-mass spectrometry experiments. It is a 2.4 MB XLSX file shared under a CC-BY-4.0 license.
Xiaoyan Li's dataset from the 2021 China Social Survey (CSS) analyzes youth perceptions of air pollution and evaluations of governmental environmental performance. The 5.5 KB XLS file contains results from an ordered logistic model and mediation analysis using the Karlson-Holm-Breen (KHB) method. The dataset was last updated on 2026-04 21.
2021 China Social Survey data analyzes youth perceptions of air pollution and evaluations of government environmental performance. The 5.5 KB Excel file contains results used in an ordered logistic model and mediation analysis. Author Xiaoyan Li published the dataset on figshare in April 2026 under a CC-BY-4.0 license.
5.5 KB of analysis results from the 2021 China Social Survey (CSS) investigating the relationship between youth perception of air pollution and evaluations of governmental environmental performance. The dataset, authored by Xiaoyan Li and last updated in April 2026, contains ordered logistic model results, mediation analysis, and relative contribution decomposition using the Karlson-Holm-Breen (KHB) method.
5.5 KB of regression analysis results from the 2021 China Social Survey (CSS). The data, authored by Xiaoyan Li, was last updated on April 21, 2026, and examines the relationship between youth perception of air pollution and evaluations of governmental environmental performance.
Eighteen supplementary tables from a study investigating the induction of antimicrobial resistance in Escherichia coli. The data includes minimum inhibitory concentration (MIC) values for colistin, ceftazidime, and gentamicin, resistance maintenance assays, and phenotypic evaluation of mcr-1 gene expression after exposure to subinhibitory colistin doses. The dataset was authored by Thalita Hellen Nunes Lima and last updated on April 21, 2026.
A study of DNA barcoding identification challenges using a group of Quercus herbivores (moths) in Europe as a model system. The research, authored by Álvaro Gaytán from Stockholm University, found a low proportion of barcodes from southern Europe in the Barcoding of Life Data Systems (BOLD). This geographical bias complicates species identification in genetically diverse southern regions, where GMYC models suggest the presence of cryptic species.
Late Hauterivian-Barremian stratigraphic and paleontological data from the oldest Cretaceous sequence in the Giralia Anticline, Carnarvon Basin. The dataset, sourced from the Australian Ocean Data Network, describes outcrop and well data, including the Birdrong Sandstone and Muderong Shale formations. It was last updated on 2026-04-28.
A guide authored by Yannick Wurm of Queen Mary University of London, focusing on the risks and responsibilities in modern genomics research. It highlights the transition to data-intensive biology, where single students can generate data once costing millions, and discusses common pitfalls like technical artifacts, hidden biases, and pseudoreplication. The description references the cautionary case of researcher Geoffrey Chang to illustrate the high cost of invisible analytical errors.