Loading...
Loading...
DNA/RNA sequences, gene expression, protein structures, metagenomics, single-cell sequencing
23,795 datasets
A 696.2 KB dataset by Junhyeok Jo demonstrates a YaxAB nanopore sensing approach for discriminating protein-drug complexes. The method, described in a study uploaded to figshare in April 2026, achieves near-atomic resolution by detecting subtle mass differences as small as 2.5 Daltons. Electrical recordings and molecular dynamics simulations confirm the single-molecule detection of BRD4 protein interactions with histone peptides and small-molecule drugs.
Lingyu Zeng published a genome annotation dataset for the Khapra beetle (Trogoderma granarium) on figshare in May 2026. The dataset contains protein sequences annotated using the EggNOG database and manually labeled into a GTF file. The associated genome sequences are provided in FASTA format, totaling 371.6 MB.
A 6.7 MB supplementary document from a study on postmenopausal osteoporosis in rats, authored by Ya-Qing Li and last updated in April 2026. The data includes results from RNA sequencing, molecular docking, and experimental validation of Astragaloside IV treatment in ovariectomized rats. It is shared under a CC-BY-4.0 license on figshare.
UPTC (Universidad Pedagógica y Tecnológica de Colombia) records of students who transferred internally between academic programs in the first semester of 2019. The dataset includes columns for student demographics, previous and new faculties and programs, and campus information. It is hosted on the Colombian open data portal www.datos.gov.co and was last updated on 2026-05-18.
Huimin Ning's dataset, last updated April 2026, describes results from a multiplex fluorescence PCR-capillary electrophoresis (MPCE) system for detecting genetic markers in hypervirulent Klebsiella pneumoniae. The assay simultaneously profiles 16 targets, including five virulence genes, two capsular serotype genes, seven resistance genes, and two internal controls. The data likely contains results from clinical validation showing strong concordance with next-generation sequencing.
A multiplex fluorescence PCR-capillary electrophoresis (MPCE) system for detecting 16 genetic markers in hypervirulent Klebsiella pneumoniae. The assay, validated against next-generation sequencing, achieved a limit of detection of 10² copies/μL and processed results in 152 minutes. The dataset, authored by Huimin Ning and last updated in April 2026, is shared under a CC-BY-4.0 license.
A 28.8 MB dataset containing a sample-by-gene count matrix and multiQC report from a gene expression analysis. The data was generated by author Frieder Hadlich and last updated on 2026-05-19. It focuses on the liver tissue of a mouse model that underwent long-term selection for high running performance.
A proteomic dataset comparing cancer stem cell-enriched spheroids to parental adherent cells from SW620 and HCT-116 colorectal cancer lines. The data was generated by Ola J. Hussein using mass spectrometry-based label-free shotgun proteomics and was last updated on 2026-04-20. It includes results of differential protein abundance analysis and pathway predictions via Ingenuity Pathway Analysis.
A 2.1 MB supplementary file contains proteomic data comparing cancer stem cell-enriched spheroids from SW620 and HCT-116 colorectal cancer cell lines to their parental adherent cells. The data was generated using mass spectrometry-based label-free shotgun proteomics and analyzed with Ingenuity Pathway Analysis. The dataset, authored by Ola J. Hussein and last updated on 2026-04-20, is shared under a CC-BY-4.0 license.
A proteomic dataset from 2026 by Ola J. Hussein, comparing colorectal cancer stem cells (CSCs) from two cell lines (SW620 and HCT-116) with their parental adherent cells. The 101.2 KB Excel file contains results from mass spectrometry-based label-free shotgun proteomics and subsequent pathway analysis. It highlights differentially abundant proteins and dysregulated pathways related to stemness, metabolism, and immune modulation.
BUSCO assessment tables for chromosome-level Coleoptera (beetle) genome assemblies used in comparative genomics analyses. The 54.3 MB repository contains TSV files generated by Dwayne Tally, last updated in May 2026.
Mouse renal transcriptome sequencing data from a study characterizing IgA nephropathy (IgAN). The dataset includes 8 IgAN samples and 4 control samples from mice sacrificed at 20 weeks of age. RNA libraries were sequenced by OE Biotech, Inc., Shanghai, China, and the data was uploaded by an author named Qin.
53.7 KB of integrated experimental data on rice immunity against Rhizoctonia solani. The dataset, authored by Ge-Ning Song and last updated in April 2026, includes results from antioxidant enzyme assays, phytohormone profiling, transcriptomics, and metabolomics to investigate the mechanisms of Validamycin A-induced resistance.
Lauren Dineen's dataset provides tRNA gene sequences for 1154 Saccharomycotina yeast species, integrated with annotations for tRNA modification enzymes. It includes comparative genomic analysis and Nano-tRNAseq modification profiles for three focal species: Saccharomyces cerevisiae, Hanseniaspora uvarum, and Yarrowia lipolytica. This work offers a multi-species view of tRNA sequence conservation and enzyme repertoire variation across a eukaryotic subphylum.
Six DNA extraction methods were compared for recovering diatom sedimentary ancient DNA from Antarctic marine sediments. The study used samples from two sites, U1536C in the Scotia Sea and KC02 near the Totten Glacier, to assess methods based on DNA recovery, fragment length, and taxonomic diversity. This dataset, hosted by the Australian Ocean Data Network, was last updated in April 2026.
A measure of Economic Fairness from the Greater London Authority, this dataset tracks the percentage of families with less than £1,500 in savings. It was last updated on 2026-06-24.
A collection of phage, plasmid, and phage-plasmid genomes used to train a random forest classifier. The dataset was authored by James Mullet and is hosted on figshare under a CC-BY-4.0 license. It was last updated on May 26, 2026.
Metabolomics data from wheat seeds at the wax-ripening stage, analyzed using GC-MS and electronic nose technology. The dataset likely contains flavor compound levels linked to the feeding preferences of house sparrows and pigeons. It was authored by Siyi Wang and last updated on 2026-04-24.
A research document analyzing the transcription factor OSR2 across multiple cancer types using data from TCGA and GEO. The analysis evaluates OSR2 expression, prognosis, and correlations with tumor mutational burden, microsatellite instability, and immune infiltration. The document, authored by Shijie Liu and last updated in April 2026, includes functional validation in lung adenocarcinoma cells.
A 2026 research article by Shijie Liu analyzes the transcription factor OSR2 across multiple cancers. The study integrates data from TCGA and GEO to evaluate OSR2 expression patterns, prognostic significance, and correlations with tumor mutational burden, microsatellite instability, and immune infiltration. Findings highlight OSR2's role in the tumor microenvironment and its potential as a prognostic biomarker and therapeutic target.