Loading...
Loading...
DNA/RNA sequences, gene expression, protein structures, metagenomics, single-cell sequencing
23,857 datasets
SecKnowledge 2.0 is an evaluation benchmark suite from the research paper 'Toward Cybersecurity-Expert Small Language Models' (ICML 2026). The dataset, released by author cyber-pal-security, assesses large language models on core cybersecurity capabilities not covered by existing public benchmarks. It was last updated on May 20, 2026.
Gold-standard qPCR-based measurements of blood mitochondrial DNA copy number from participants of the Bogalusa Heart Study. The dataset provides raw, QC-passed values intended for research into early biomarkers for Alzheimer's disease and cognitive decline. Author Yang Pan published this data under a CC-BY-4.0 license, with a last update recorded for April 20, 2026.
Global Affairs Canada provides information on the Respectful Maternity Care project, which operates in challenging maternal health settings in South Sudan and the Democratic Republic of the Congo. The dataset, last updated in May 2026, describes a project focused on saving lives and ensuring safe, dignified care by placing respect at the heart of every birth. The data is published under the OGL-CA-2.0 license.
A list of opa genes from Neisseria gonorrhoeae that did not align within locally collinear blocks with the FA1090 reference genome. The dataset was authored by QinQin Yu and last updated on May 11, 2026. It is a small dataset, 9.9 KB in size, shared under a CC-BY-4.0 license.
26.2 KB of data detailing NG-STAR types, associated antibiotic resistance markers, and available minimum inhibitory concentrations for Neisseria gonorrhoeae genomes used in a specific study. The dataset was authored by QinQin Yu and last updated on May 11, 2026. It is shared under a CC-BY-4.0 license on figshare.
36.1 KB of anonymized data in XLSX format, uploaded by Derek Daigle to figshare. The dataset is described as the minimal set necessary to replicate specific research findings. It was last updated on May 11, 2026.
11.4 KB of data on nucleotide differences in plasmid and virus-derived RNA sequences, shared by Ichorio Misumi on figshare. The dataset was last updated on May 11, 2026.
9.5 KB of data on internal causal attributions cited for favorable and unfavorable feedback for selfies and elsies. The dataset was authored by Malinda Desjarlais and last updated on May 11, 2026. It is available in XLS format under a CC-BY-4.0 license.
A dataset of 180 records examining factors associated with child underweight status. The data is provided by Sneha Deepak Mallya on figshare and was last updated on May 11, 2026. It is stored in an XLS file format and is licensed under CC-BY-4.0.
180 records explore the relationship between child wasting and sociodemographic, maternal, and child characteristics. The dataset is hosted on figshare under a CC-BY-4.0 license and was last updated in May 2026. Author Sneha Deepak Mallya contributed this 33.5 KB Excel file.
A dataset of 180 records explores the relationship between child stunting and sociodemographic, maternal, and child characteristics. It was authored by Sneha Deepak Mallya and is available under a CC-BY-4.0 license. The dataset was last updated on May 11, 2026.
RNA-seq data from retinal tissues of 3 streptozotocin-induced type-1 diabetic retinopathy rats and 3 normal controls. Ming Yang published this 1.1 GB dataset on figshare in 2026, containing small RNA and whole-transcriptome sequencing files. These data support the construction of a circRNA-miRNA-mRNA ceRNA regulatory network.
385 drugs are analyzed for correlations between baseline gene expression and z-score-corrected models. The dataset contains Delta Pearson and weighted Pearson correlation metrics, authored by Ginte Kutkaite and last updated in May 2026. It is a small dataset of 34.1 KB, shared under a CC-BY-4.0 license on figshare.
A model built on gene expression data for 266 samples, incorporating z-score- and residual-corrected performance metrics. The dataset is provided by Ginte Kutkaite and was last updated on May 11, 2026. It is shared under a CC-BY-4.0 license as a 49.4 KB XLSX file.
A 70.5 KB Excel file containing performance metrics for ridge, lasso, and elasticNet models applied to gene expression data. The dataset was authored by Ginte Kutkaite and last updated on May 11, 2026. It is shared under a CC-BY-4.0 license on the figshare platform.
Ivan P. Gorlov's dataset compares mean gene expressions between different stages of Metabolic Dysfunction-Associated Steatotic Liver Disease (MASLD) in Black and White individuals. The data is stored in a 1.1 MB XLS file and was last updated on May 4, 2026. It is shared under a CC-BY-4.0 license on the figshare platform.
Xiaosa Wang published a dataset on figshare in 2026 containing logistic regression analysis results for cell-free fetal DNA (CffDNA) and low birth weight (LBW). The dataset is stored in a 13.5 KB XLS file. Column-level details and row counts are not specified in the metadata.
13.5 KB of statistical analysis results examining the relationship between cell-free fetal DNA (cffDNA) and preterm birth (PTB). The dataset, authored by Xiaosa Wang, was last updated on May 11, 2026, and is shared under a CC-BY-4.0 license on figshare.
Baseline characteristics before and after propensity score matching for a study on the association between cell-free fetal DNA and low birth weight. The dataset was authored by Xiaosa Wang and last updated on May 11, 2026. It is a 13.5 KB Excel file available under a CC-BY-4.0 license.
A small dataset of 13.5 KB in XLS format contains baseline characteristics for a study on the association between cell-free fetal DNA (CffDNA) and preterm birth (PTB). It includes data before and after propensity score matching (PSM). The dataset was authored by Xiaosa Wang and last updated on May 11, 2026.