Loading...
Loading...
DNA/RNA sequences, gene expression, protein structures, metagenomics, single-cell sequencing
23,765 datasets
A tumor reactivity scoring algorithm (TRACE) with open model weights, released in April 2026. The associated data likely contains TRACE scores applied to hundreds of patient samples from tumor atlases spanning lung, colorectal, and pancreatic cancer. The algorithm was developed by David Monteiro using aggregated scRNA-seq data from multiple publications to predict tumor-reactive CD8+ T cells.
Khalid Akkour published a proteomics dataset on figshare in April 2026. It contains label-free quantitative mass spectrometry data from urine samples of 40 women, including 20 endometrial cancer patients and 20 controls. The study identified 193 differentially expressed proteins and three potential biomarkers.
A 2026 case-control study of 40 women (20 endometrial cancer patients, 20 controls) identified 193 differentially expressed urinary proteins. The research, conducted at King Khalid University Hospital, used label-free LC-MS/MS and bioinformatics to pinpoint three key biomarker candidates with high AUC scores. The dataset is published by Khalid Akkour under a CC-BY-4.0 license.
A case-control study of 40 women (20 endometrial cancer patients and 20 controls) recruited from King Khalid University Hospital. Untargeted label-free LC-MS/MS mass spectrometry identified 193 differentially expressed proteins (117 upregulated, 76 downregulated) in urine samples. The dataset was created by Khalid Akkour and last updated on April 9, 2026.
Dean Callaghan authored this research project investigating velocity-based methods for monitoring neuromuscular fatigue in rugby league players. The project comprises two systematic reviews and three original studies, including a season-long monitoring period of a semi-professional team over 24 weeks. The data was last updated on 2026-05-27.
A synthetic internal company workspace and evidence-linked benchmark table for evaluating enterprise retrieval-augmented generation systems. The dataset is authored by Karmane and was last updated on June 8, 2026. It is designed around a realistic fictional company, AsteraOps Cloud, with multiple departments and projects.
5.5 KB of data in an XLS file examines the effect of population origin and maternal line source on germination percentage. Diego Fernando Escobar Escobar authored this dataset, which contains seeds harvested in 2021 from the Carajás FLONA. The dataset was last updated on June 3, 2026.
Carajás FLONA in Brazil is the geographic scope for this dataset. It likely contains germination proportions for jaborandi seeds harvested in 2020, examining the effect of population source and maternal line source. The dataset was authored by Diego Fernando Escobar Escobar and last updated in June 2026.
Scanned images of Ordnance Survey maps and aerial photographs compiled for the Falkland Islands Government and the UK Foreign and Commonwealth Office. The dataset includes original printer films and final paper printed maps, as well as geological field slips compiled by the British Geological Survey. Access to the data is restricted to BGS staff for BGS purposes.
British Geological Survey stores historical Ordnance Survey maps and aerial photographs of the Falkland Islands. The maps are printer films and final paper originals compiled for the Falkland Islands Government and the UK Foreign and Commonwealth Office. The aerial photographs and overlays are copies of geological field slips compiled by BGS under contract to the Falkland Islands Government.
Transcriptome data from continuous cultivations of Acididesulfobacillus acetoxydans, an acid-tolerant sulfate-reducing bacterium. The dataset includes raw counts, reads per kilobase, transcripts per million, and scaling factors, and was uploaded by Reinier A. Egas in May 2026. It is provided in a 2.6 MB XLSX file.
Xenia Schmalz from the University of Padua authored this Open Access paper analyzing the replication crisis in psychological science. The text discusses three key contributing factors: underpowered studies, publication bias, and questionable research practices. It focuses on potential solutions to the problem of underpowered studies.
Part 2 of a three-part geological report detailing the Permian stratigraphy of the Carnarvon Basin, an epicontinental basin containing sediments from the Proterozoic to the Tertiary. The report, published by the Australian Ocean Data Network, includes a summary and references, with text-figure numbers continued across all parts. Permian marine sediments, including glacial deposits, reach a maximum known thickness of 15,200 feet and rest unconformably on older rocks.
The Plan Anual de Auditorias Internas (P.A.A.I) details the 2021 internal audit plan for a municipal administration. It includes columns such as OBSERVACIONES, LINEA ESTRATEGICA, PROGRAMA, INDICADOR, NOMBRE, and SECTOR. The dataset is hosted on the Colombian open data portal www.datos.gov.co and was last updated on 2026-05-18.
An enterprise resource planning dataset modeled on SAP-style systems. It was created by VynFi and last updated on June 13, 2026. The description indicates it contains a complete artifact tree from a single end-to-end run.
The Australian Ocean Data Network hosts a study on the Permian brachiopod super-family Productacea in Western Australia. The collection consists of over three thousand specimens representing at least 34 species across three major sedimentary basins covering approximately 150,000 square miles. The study examines shell structure, musculature, and other morphological features of these marine fossils.
A 2026 inventory of buildings located within the City of Sherbrooke, Québec. The dataset categorizes structures as business, hospital, school, or municipal building, with each category associated with a specific subtype code. It is provided by the Government and Municipalities of Québec under a CC-BY-4.0 license.
A retrospective real-world study analyzing 4,157 pregnant women with chronic hepatitis B virus (HBV) infection and 4,192 infants to explore factors influencing mother-to-child transmission (MTCT) in cases with extremely high HBV DNA loads (≥ 1×10⁷ IU/ml). The dataset, authored by Wei Yi and last updated in April 2026, contains clinical records where pregnant women received antiviral therapy in the second or third trimester, and infants were followed up after 7 months.
A 2026 meta-analysis by Sandra Gusi-Martínez integrates publicly available 16S rRNA amplicon sequencing data from terrestrial ecosystems that resemble Icy Moons like Europa and Enceladus. The study identifies key environmental drivers and shared molecular adaptations, such as osmolytes and modified lipids, across these extreme environments. The findings are intended to guide future extraterrestrial life detection efforts.
Sandra Gusi-Martínez's dataset presents a meta-analysis of publicly available 16S rRNA amplicon sequencing data from terrestrial ecosystems analogous to Icy Moons like Europa and Enceladus. The 41.9 KB XLSX file, last updated in April 2026, integrates diverse datasets to identify statistically significant microbial patterns. The analysis suggests depth, pH, and hypersalinity are key environmental drivers, with osmolytes and modified lipids as shared adaptive strategies.