Loading...
Loading...
DNA/RNA sequences, gene expression, protein structures, metagenomics, single-cell sequencing
23,321 datasets
ACAPS aggregates data from multiple sources to track the humanitarian situation across Afghan districts. The dataset monitors key drivers like conflict, commodity prices, exclusion, and access to services, alongside their impacts on food insecurity and cholera. It was last updated on 2026-05-14 and is provided under a CC-BY-4.0 license.
A manually categorized set of priority-related labels from the 5000 most-starred GitHub repositories as of 2022-06-01. The labels have been ranked and normalized into three values: High, Medium, and Low. The dataset was produced by J. Caddy and C. Treude for a research paper on prioritizing GitHub labels.
83 domestic cats across five age groups were analyzed using 16S rRNA gene sequencing to characterize gut microbiome differences. Alpha-diversity was lowest in Pre-weaning kittens, peaked in Young adults, and declined in Mature adults, with beta-diversity showing distinct clustering among groups (PERMANOVA, R²=0.33). The study by Yan Wang, last updated in 2026, identified taxonomic shifts, such as enrichment of Proteobacteria in Pre-weaning kittens and higher Faecalibacterium abundance in Young adults.
83 domestic cats were analyzed using 16S rRNA gene sequencing across five age-defined groups: Pre-weaning (1.5 months, n=16), Early kitten (3 months, n=16), Late kitten (6–10 months, n=15), Young adult (2 years, n=20), and Mature adult (7–10 years, n=16). The dataset, authored by Yan Wang and last updated in 2026, shows significant differences in microbial diversity and composition across these groups.
A new real-time PCR assay using SYBR Green provides higher sensitivity for detecting low levels of the fungal pathogen Pneumocystis jirovecii. The protocol, validated by Susana Ruiz-Ruiz, detected P. jirovecii in three nasal aspirate samples that were negative by a nested-PCR method. The dataset, last updated in 2026, compares results from this single-round protocol against a nested-PCR method targeting the mitochondrial large subunit rRNA.
Multiple datasets provide the locations of municipal buildings for the cities of Rimouski and Repentigny in Quebec, Canada. The data is available in several geospatial and tabular formats, including SHP, GEOJSON, KML, and CSV. It is published under an open CC-BY-4.0 license by the Government and Municipalities of Québec.
A 2026 academic paper by Vivien Jiaqian Zhu compares textual changes between the 1498 Hongzhi edition and Jin Shengtan's edition of Wang Shifu's Xixiang Ji. The paper examines the limitations of Jin's commentary and connects it to commentary in The Story of the Stone. The dataset is a 785.9 KB PDF file shared under a CC-BY-4.0 license.
Hanningfield reservoir in Essex, UK, is the location for this acoustic telemetry data on European eels (Anguilla anguilla). The dataset likely contains fine-scale movement records for 104 eels tagged across two periods in 2015 and 2016, alongside biometric measurements like length, weight, and lipid percentage. It was contributed by the Marine Environmental Data & Information Network.
OSFI provides publicly available financial data filed by federally regulated life insurance companies in Canada. The data includes monthly and quarterly returns, with scheduled publication dates listed for 2026, and reflects a transition to IFRS 17 accounting standards starting in 2023. Industry totals must be calculated by combining 'Total Canadian Life Companies' and 'Total Foreign Life Companies'.
The Office of the Superintendent of Financial Institutions Canada publishes scheduled publication dates for retail associations' monthly and quarterly returns for the year 2026. The data includes specific due and publication dates for each month and quarter, sourced from the open_canada platform. The dataset was last updated on 2026-04-16.
Monthly and quarterly financial data filed by federally regulated fraternal benefit societies in Canada, published on a scheduled basis for 2026. The dataset is provided by the Office of the Superintendent of Financial Institutions Canada and includes separate totals for Canadian and foreign societies. Data from fiscal 2023 onward reflects IFRS 17 accounting standards.
A prognostic signature for glioma developed from RNA-seq profiles from the CGGA, GTEx, and TCGA cohorts. The dataset likely contains results from consensus clustering and a SuperPC-based model identifying 28 differentially expressed genes. It was authored by Kun Wang and last updated on May 8, 2026.
117 machine-learning algorithm combinations were screened to develop a prognostic signature for glioma. The signature stratifies patients into high- and low-risk cohorts based on chromatin remodeling-related genes. Kun Wang authored this dataset, which was last updated on May 8, 2026.
A transcriptomic analysis of myoepithelial cells isolated from seven archived human breast cancer specimens and adjacent normal tissue. The dataset includes gene expression changes related to extracellular matrix interactions, epithelial-mesenchymal transition, and cellular signaling. It was authored by Mohamed M. Haq and last updated on 2026-05-12.
Multibeam echosounder data collected during the RV Investigator voyage IN2018_V04, which sailed from Hobart on 11 September 2018 and returned on 8 October 2018. The Kongsberg EM710 MKII system acquired seafloor bathymetry, backscatter, and watercolumn backscatter data, stored in 23 raw files totaling 11.1 GB. Processed data grids are available in GeoTIFF format, with additional products potentially available on request from the Australian Ocean Data Network.
Data Sheet 1_Filaggrin as a potential biomarker in gastric cancer: insights from multi-omics analysis and experimental validation.zip is a 4.2 MB dataset shared by author Nan Xia on figshare under a CC-BY-4.0 license. The data was last updated on 2026-05 08 and supports research into the role of the Filaggrin (FLG) gene in gastric cancer progression, prognosis, and the immune microenvironment.
A supplementary document from a study investigating the therapeutic mechanisms of a Sanghuangporus vaninii-based functional food formulation (FSV) for type 2 diabetes mellitus (T2DM). The research used UPLC-QTOF/MS chemical profiling, network pharmacology, a T2DM mouse model with a 4-week intervention, hepatic metabolomics, and molecular docking. The 23.9 MB DOCX file was authored by Zifeng Huang and last updated on 2026-05-08.
Acremonium chrysogenum is a fungus used for industrial cephalosporin antibiotic production. Zhen Chen developed a tRNA-gRNA array-based CRISPR/Cas9 multiplex genome-editing system for this organism, enabling gene knockout, large-fragment deletion, and overexpression. The dataset, last updated in April 2026, likely contains experimental results from this system.
91 individuals from the Colorado Adoption/Twin Study of Lifespan behavioral development and cognitive aging (CATSLife1) provided samples for comparing 15 DNA methylation clocks across saliva, buffy coat, and peripheral blood mononuclear cells. Mean Spearman correlations between chronological age and DNA methylation ages were moderate, ranging from r=0.34 to r=0.40 across tissues. The dataset, authored by Ryan Bruellman and last updated in April 2026, shows saliva-based clocks yield older methylation ages than blood-based ones, while buffy coat and PBMC results are comparable.
IJCAI conference papers are stored as raw PDF files in this repository, which is a shard of a larger AI Conference & Journal Papers dataset project. The repository, created by GenAI4ELab, was last updated on 2026-06-20 16:45:02. It contains only the binary PDF files and does not include searchable metadata such as titles, abstracts, or authors.