Loading...
Loading...
DNA/RNA sequences, gene expression, protein structures, metagenomics, single-cell sequencing
23,822 datasets
A 102.1 KB dataset uploaded on 2026-04-13 by Cunlin Zhang under a CC-BY-4.0 license. It contains results from a study where a PCOS model was induced in rats using letrozole, followed by treatment with sheep placental extract. The data likely includes measurements of inflammatory cytokines, sex steroid hormones, metabolic markers, transcriptomics, and 16S rRNA sequencing results.
A research article analyzing gene expression data from the GEO database to explore immune modulation in osteoarthritis. The study identified 1,171 upregulated differentially expressed genes and three hub genes—PTPRC, CX3CR1, and ITGB2—using bioinformatics methods. The work was authored by Zhengyao Zhang and last updated in April 2026.
Three key genes—EPYC, MAGED1, and LAP3—were identified as potential diagnostic markers for rheumatoid arthritis using integrated bioinformatics and machine learning. The dataset, authored by Zhibin Zhang and last updated in April 2026, contains results from the analysis of public GEO transcriptomic datasets. It includes findings from feature selection, immune cell infiltration, and in vitro validation using TNF-α–stimulated HFLS-RA cells.
A ceRNA regulatory network of 10 circRNAs, 41 miRNAs, and 145 mRNAs was constructed from GEO datasets of Mycobacterium tuberculosis-infected dendritic cells. Four core genes—STAT1, BCL2, TRAF6, and IL1A—were identified and enriched in tuberculosis-related pathways. The dataset, created by Xiaohong Sun and last updated in April 2026, includes validated diagnostic axes with ROC AUC >0.7.
A 30.4 KB Excel file published on figshare in April 2026 by Xiaohong Sun. It contains a constructed competing endogenous RNA (ceRNA) regulatory network from dendritic cells infected with Mycobacterium tuberculosis strains H37Ra and BCG. The network includes 10 circRNAs, 41 miRNAs, and 145 mRNAs, with four core genes and two validated regulatory axes identified.
A 174.5 KB Excel dataset from a 2026 figshare upload by Xiaohong Sun. It contains a constructed competing endogenous RNA (ceRNA) regulatory network from dendritic cells infected with Mycobacterium tuberculosis strains H37Ra and BCG. The network includes 10 circRNAs, 41 miRNAs, and 145 mRNAs, with validation results for specific regulatory axes.
An analysis dataset from a study integrating transcriptomic stratification and two-sample Mendelian randomization to identify prognostic biomarkers in esophageal cancer. The data likely contains results from RNA-seq and clinical data for 184 tumors from TCGA-ESCA and 119 tumors from an external cohort, identifying ADORA2B and SAPCD2 as candidate genes. The dataset was authored by Bihan Xia and last updated on April 13, 2026.
Data from a 2026 study integrating transcriptomic and clinical data from 184 tumors and 13 normal samples in TCGA-ESCA, plus an external cohort of 119 tumors and 119 normals. The dataset, authored by Bihan Xia and shared on figshare, supports the identification of prognostic biomarkers ADORA2B and SAPCD2 in esophageal cancer using histone chaperone-based stratification and two-sample Mendelian randomization.
814.8 MB of processed bulk and single-cell RNA sequencing data from tumor-associated macrophages and cancer cells, generated by Zemin Zhang and last updated in April 2026. The data supports research into the SPP1-SOCS1 pathway's role in shaping an immunosuppressive tumor microenvironment and resistance to immune checkpoint blockade therapy.
178 metagenome-assembled genomes (MAGs) were recovered from sediment in a subterranean estuary in Houmen, Shanwei, China. Annotations for these genomes were generated using the Prokka tool, providing gene predictions, protein sequences, and nucleotide sequences. The dataset was authored by zhang baoshan and last updated in May 2026.
Libby et al's supplemental data provides supporting figures, tables, and files for a study on the functional diversification of the QueC protein superfamily (PF06508). The 304.3 MB collection includes CSV files of proteins analyzed in Sequence Similarity Networks and genomic neighborhoods, a ChimeraX file of protein structures, and PDFs. Author Geoffrey Hutinet deposited the data on figshare in April 2026.
A single case report documents a 6-month-old boy diagnosed with four distinct tumors, including Wilms tumor and pleuropulmonary blastoma, linked to a mosaic DICER1 RNase IIIb hotspot mutation. The 756.9 KB document details the patient's surgical and chemotherapy treatments and a 27-month follow-up. This clinical text dataset was authored by Peiyi Yang and shared under a CC-BY-4.0 license on figshare.
The Carnarvon Shelf Survey (SOL4769) collected underwater video footage and still images from 122 stations across water depths of 13-125 meters. The survey was conducted by Geoscience Australia and the Australian Institute of Marine Science aboard the R.V. Solander between 12 August and 15 September 2008. Its objective was to gather co-located data to test physical parameters as surrogates for benthic biodiversity patterns.
Mapping of buildings and other constructions of heritage interest in the urban planning code (CDU) on the territory of Laval. The dataset is provided by the Government and Municipalities of Québec and was last updated on 2026-04-22. It is available under a CC-BY-4.0 license in multiple geospatial formats.
Single-cell RNA sequencing data from a patient with chronic chromoblastomycosis reveals an expanded population of exhausted CD4+ T cells. The dataset, uploaded by Kexin Lei on April 17, 2026, includes bioinformatics analyses and multiplex immunofluorescence validation in a mouse model. Findings suggest a differentiation trajectory from naive to exhausted T cells driven by monocytes/macrophages.
Honesty-Scratchpad-600x is a dataset containing 600 high-quality synthetic examples. It is designed to train language models, particularly smaller ones with 1B to 8B parameters, to become more truthful, recognize knowledge boundaries, and use an internal verification scratchpad before answering. The dataset was created by Aadeshisdoingsomething and was last updated on June 3, 2026.
Puntos Vive Digital Plus en el Valle del Cauca lists digital access points administered by the government of Valle del Cauca, Colombia. The dataset is provided by www.datos.gov.co and was last updated on 2026-05-18. It includes columns for Subregión, Municipio, Dirección, and Año.
Capricorn & Bunker Reefs, southern Great Barrier Reef, are the subject of this field excursion guide for the 12th International Sedimentological Congress. The content likely contains detailed geological and sedimentological observations from a scientific conference. Metadata is minimal, but its cross-platform presence on a national data portal suggests it is a recognized scientific reference.
4.0 GB of raw data supporting a paper on intramembrane chaperone mechanisms, uploaded by Shi Ho Kim. The data includes molecular dynamics trajectories, structural files, and analysis scripts. The dataset was last updated on 2026-05-12.
Polygon data represents Commission decisions modifying agricultural zone boundaries in Quebec. The dataset tracks inclusions that add land to the agricultural zone and exclusions that remove it, sourced from topographic data, cadastral compilations, orthophotos, and plans. Decisions are registered over time, with later file numbers superseding earlier overlapping areas.