Loading...
Loading...
DNA/RNA sequences, gene expression, protein structures, metagenomics, single-cell sequencing
23,821 datasets
Bit strings encode the presence or absence of selected genes across large evolutionary clades. The dataset, created by Xiao Liang and last updated in May 2026, includes tissue-specific expression details for genes expressed in five or fewer tissues. It is a small dataset of 9.5 KB stored in an XLS file.
A 5.5 KB Excel file containing expression and existence bit strings for a subset of genes. The dataset, created by Xiao Liang and last updated in May 2026, focuses on genes absent from at least two large clades and lists tissues for genes highly expressed in five or fewer tissues.
A 51.7 KB dataset published on figshare by Tessa Barrett on April 13, 2026. It contains transcriptomic and immune profiling data from a study investigating the role of the tetraspanin CD37 in critical limb-threatening ischemia (CLTI). The data likely includes results from murine hindlimb ischemia models and human muscle RNA-seq analysis.
A 7-year period (2016-2023) of data on Quebec Acceptance Certificates (CAQ) issued and finalized applications under the International Student Program, as recorded on December 31 of each year. The dataset is provided by the Government and Municipalities of Québec and was last updated in April 2026.
Spatial Services polygon data delineates tidal and non-tidal watermarks forming cadastral boundaries in New South Wales. Updates occur within 10 working days from plan lodgement, sourced from subdivision activity and multiple government agencies. This dataset underpins land tenure, use, and environmental management.
Annual academic arts program tuition fees for full-time international students at public post-secondary institutions in British Columbia, Canada. The dataset covers academic years from 2016/17 to 2025/26 and is provided by the Government of British Columbia. Data is organized by economic development region and institution.
ResearchBench is the official dataset for the ACL Findings 2026 paper. It provides benchmark data for three inspiration-based subtasks of scientific discovery, including inspiration retrieval. The dataset was released by author ankilok and last updated on May 31, 2026.
Ro Real Estate Listings is a dataset of publicly available property advertisements scraped from Romanian websites. It contains structured information such as price, location, number of rooms, and surface area. The dataset is curated by Flavius Paler, is in Romanian, and is continuously updated via an automated pipeline.
Post-launch checkout phase data from 2006, including instrument commissioning activities before and after the aperture door opened on August 29. This version 3.0 dataset contains calibrated images from the Long Range Reconnaissance Imager (LORRI) on the New Horizons spacecraft, taken by NASA. It includes bias images, internal lamp images, and calibration observations of stars in cluster M7 and planetary targets like Jupiter, Uranus, Neptune, and Pluto.
Data derived from Illumina MiSeq sequencing of the 16S rRNA gene in mouse fecal samples, as well as physiological and biochemical measurements. The dataset includes physiological characterization of probiotics, body weight changes, organ indices, and Salmonella loads in mouse liver, ileum, and cecum. Author Yue Wang published this 33.0 KB XLSX file on figshare under a CC-BY-4.0 license, last updated on 2026-05-11.
Post-launch checkout data from the New Horizons spacecraft's Long Range Reconnaissance Imager (LORRI) instrument, collected primarily in 2006. The National Aeronautics and Space Administration produced this dataset, which includes bias images, internal lamp images, and calibration observations of stars and planets. Version 3.0 provides recalibrated lossy images and updated documentation.
Polygon features represent dedicated public roads in New South Wales, Australia, including attributes like section number, plan label, and road width. The dataset forms the foundation fabric of land ownership, managed by Spatial Services (DCS). It is scheduled for retirement and transition to a GDA2020 coordinate reference system.
A spatial polygon dataset collated for the implementation of the EU Water Framework Directive (WFD). It identifies water bodies in two categories: those currently 'failing due to acidification' and those 'at risk of failing due to acidification' by 2027. The data was produced by the Government Digital Service under the OGL-UK-3.0 license.
Polygon data provides a spatial representation of rights of way, including carriageway and easement in gross. Spatial Services continuously updates the dataset with information sourced from stakeholders and custodians. The majority of updates originate from subdivision, registration, and gazettal activity.
Data from a study on visual processing efficiency in autistic adults, submitted for publication in Nature Communications. The archive contains the complete dataset for the study authored by Martin Arguin, Jade Desrosiers, Lili El Khalil, and Laurent Mottron. The associated manuscript was under review as of June 12th, 2026.
Road centrelines represent the spatial center line of cadastral road corridors in New South Wales. Spatial Services continuously updates this line feature dataset with information sourced from stakeholders like Crown Lands and local councils. The data is updated within 10 working days from when a subdivision plan is lodged.
Szeman Cheung authored supplementary material for a prospective cohort study investigating insulin resistance in psoriasis. The 464.9 KB DOCX file was uploaded to figshare under a CC-BY-4.0 license. The study examines an intrinsic insulin-resistant phenotype and its predictive value for biologic treatment efficacy.
Replication data and code for the 2026 Journal of Monetary Economics article by Cavallo et al. The dataset supports the analysis of the price impact of Canadian retaliatory tariffs. It was authored by Alberto Cavallo and hosted on Harvard Dataverse, with a last update in June 2026.
42 real-world laboratories contributed isoform expression and alternative splicing quantification data for the Quartet reference materials. The 3.1 GB dataset, authored by Duo Wang and last updated in April 2026, includes files for isoform FPKM and event PSI values. It is supplemented by junction reference datasets generated using long-read sequencing technology.
Alexander E. Kister published a dataset on figshare in April 2026 detailing peptide residue volumes in HLA II DR protein structures. The dataset likely contains tabular data with residue volumes in cubic angstroms categorized by size groups and summary statistics for each peptide position. The file is 25.7 KB in size.