Loading...
Loading...
DNA/RNA sequences, gene expression, protein structures, metagenomics, single-cell sequencing
22,526 datasets
The first complete mitochondrial genome of the Lophophora williamsii (peyote) cactus, native to the Chihuahuan Desert. Xingliang Liu assembled and annotated the 2,422,778 bp genome using PacBio HiFi long-read sequencing, with analyses including repeat identification and phylogenetic reconstruction. The dataset was last updated on June 2, 2026.
A 3.7 MB DOCX file uploaded on 2026-05-20 contains the complete genome sequence of the probiotic strain Lactobacillus pentosus HP-B1718. The genome is 3,257,491 bp with a GC content of 44.58% and includes a plasmid, 3,081 CDS, and various RNA genes. The study identifies the Aryl-phospho-beta-D-glucosidase (ApgA) enzyme responsible for converting liquiritin to liquiritigenin, with a reported conversion rate of 99% under optimized conditions.
A genomic dataset presents results from a de-correlated composite of multiple signals (DCMS) analysis on the X chromosome in three indigenous Indian sheep breeds. The analysis identified a significant breed-specific genomic region in Changthangi sheep, annotated candidate genes and quantitative trait loci, and prioritized genes via protein-protein interaction networks. Saptha Nath authored the dataset, which was last updated on 2026-05-18.
Sapna Nath's research dataset from 2026 contains genomic analysis results for three indigenous Indian sheep breeds. It includes data from 79 sheep genotyped with the Illumina OvineSNP50 BeadChip, applying a de-correlated composite of multiple signals approach. The analysis identified a candidate selection region on the X chromosome specific to the Changthangi breed.
ArchaeaHQ is a systematically curated reference database of 21,644 archaeal genomes compiled from NCBI assemblies. All genomes pass a standardized quality control requiring ≥70% completeness and ≤10% contamination, with 44.2% achieving ≥90% completeness. The database was created by Pedro Leão and last updated in May 2026.
A 2026 study by Jieqiong Zeng integrates single-cell and bulk RNA sequencing data to analyze immune cell composition in Kawasaki disease. The dataset likely contains gene expression profiles from peripheral blood mononuclear cells of pediatric patients and a corresponding mouse model. Findings suggest HMOX1 upregulation in mononuclear phagocytes correlates with inflammatory pathway activation.
A deployment ledger codifying Module 89 of the Davis Logic V2 architecture, authored by Jamie Davis. The module is a hardware-clamped, lightweight frequency estimation engine for high-rate telemetry streams. It was last updated on May 31, 2026, and is licensed under Creative Commons Attribution 4.0 International.
Module 88 of the Davis Logic V2 architecture introduces a hardware-clamped diagnostic monitor for telemetry streams. The module continuously calculates the active dynamic amplitude range over fixed intervals using a zero-heap, zero-copy architecture. Jamie Davis authored this open-access asset, last updated on May 31, 2026.
Linear regression results for the mean scaled entropy model from a simulation study. The model explains 68.2% of the variance in mean scaled entropy across 90,000 simulated forests. The dataset was authored by Cyril Geismar and last updated on June 1, 2026.
Haplotype-resolved genome assemblies for two table grape cultivars, 'Shine Muscat' and 'Muscat Hamburg', elucidating berry skin color mechanisms. The dataset includes over 50,000 cultivar-whole genes and millions of sequence variations detected between haplotypes. It was authored by Wen Liu and last updated on 2026-06-02.
Ebony Argaez published raw data for a study on RNA interference effects in the milkweed bug Oncopeltus fasciatus. The 136.5 KB dataset includes results from injection and feeding experiments measuring developmental defects, survival, and fertility. It was last updated on 2026-05-29.
A retrospective cohort study of 800 low-risk parturients from June 2024 to February 2026, assessing the association of a spinal analgesia-based management strategy with labor progression and maternal/neonatal outcomes. The dataset includes 610 patients in a non-SA group and 190 in an SA group, with primary outcomes of prolonged second-stage labor, emergency cesarean delivery, and umbilical arterial pH. The study was conducted by Shunsuke Maruyama and published on figshare.
19 participants from the 'HIIT or MISS UK' trial provided qualitative interview data on their experiences with High Intensity Interval Training (HIIT) and Moderate Intensity Steady State (MISS) exercise. The dataset was created by Charlotte Williams and last updated on 2026-05-29. It contains participant characteristics and likely includes qualitative themes from semi-structured interviews conducted via VoIP.
Data from a secondary analysis of the prospective Program to Improve Mobility in Aging (PRIMA) cohort trial, involving 213 older adult participants. It was authored by Daniel S. Rubin and last updated on 2026-05-29. The dataset contains measurements of walking cadence during usual-pace gait speed and 6-minute walk tests, used to evaluate the identification of responders to mobility interventions.
Data from 213 participants in the Program to Improve Mobility in Aging (PRIMA) cohort trial, used to evaluate whether changes in walking cadence can identify responders to mobility interventions. Logistic regression models assessed the ability of cadence changes to predict clinically important improvements in gait speed and 6-minute walk test distance. The dataset was authored by Daniel S. Rubin and last updated on 2026-05-29.
213 participants from the prospective Program to Improve Mobility in Aging (PRIMA) cohort trial had their walking cadence measured during usual-pace gait speed testing and the 6-minute walk test (6MWT). The dataset, authored by Daniel S. Rubin and shared under a CC-BY-4.0 license, was used to evaluate whether changes in cadence could identify responders to a walking intervention. The data was last updated on 2026-05-29.
PPARG was identified as a diagnostic biomarker for sepsis with an AUC of 0.994. This dataset contains whole-blood transcriptomic data from two cohorts, GSE236713 and GSE65682, used to discover and validate biomarkers linked to CD14/NF-κB signaling. Yingyi Ji published the data on figshare in 2026.
Southern California retail chicken products were purchased from 2017 to 2021 and cultured for Escherichia coli. Vanessa Quinlivan measured susceptibility to 19 antimicrobials using the disk diffusion method to assess the impact of California Senate Bill 27, which restricted antimicrobial use in food-animal production.
dados_elite.csv contains 991,186 bibliographic records for articles published between 2010 and 2024 in A1 and A2 ranked journals in Communication, Information Science, and Museology. The dataset was extracted from the OpenAlex API on January 23, 2026, by Skrol Salustiano and is preserved in a raw state without author or affiliation normalization. It includes variables such as journal name, publisher, DOI, authors, citations, and open access status.
23% of 4,112 initially arthritis-free participants developed the condition over a median 14.2-year follow-up. This dataset likely contains prospective findings from the English Longitudinal Study of Ageing (ELSA), analyzing associations between six anthropometric indices and incident arthritis. The data was published by Yanqi Du under a CC-BY-4.0 license in 2026.