Loading...
Loading...
DNA/RNA sequences, gene expression, protein structures, metagenomics, single-cell sequencing
23,861 datasets
Ordinances from 2000 to 2014 define the boundaries and regulations for New Orleans' Neighborhood Conservation Districts. The dataset, maintained by the City of New Orleans, outlines the geographic areas designated for historic and architectural preservation. It includes legal text specifying the district's purpose to preserve neighborhood character and architectural history.
Leeds Post Codes lists all postal codes within the Leeds metropolitan district. The dataset includes the administrative ward for each postal code and provides eastings and northings coordinates for its central point. It is provided by the Government Digital Service under an Open Government Licence.
Government Digital Service data on the number of staff employed, updated annually under the Data Transparency Code. The 2016/17 figures include new areas such as Garden Communities and Museum Traineeships, as well as externally funded posts for community initiatives. The dataset is available in multiple formats including XML, CSV, HTML, and JSON.
Building footprints and structural metadata for Morocco comprise this export from the Humanitarian OpenStreetMap Team (HOT). Updated in March 2026, the data includes all features with a non-null building tag and is available in multiple formats including SHP, GeoJSON, and KML.
Forward 42 days of advertisement and bidding information for state and local transportation lettings from Texas sources. The dataset includes details on bid items, quantities, schedules, and contacts, sourced from the Electronic State Business Daily (ESBD) and the Electronic Bidding System. It is published by data.texas.gov and was last updated on 2026-04-03.
An annotation file for the black bear genome, published by Mengnan He under a CC-BY-4.0 license. The dataset is a 64.4 MB GZ file, last updated on May 12, 2026. The specific row count and column structure are not detailed in the provided metadata.
A 45.2 KB survey dataset by Zara Meyer, last updated on 2026-04-27, gauging perspectives on book bans and censorship in America. It includes responses from both US and international participants, with US respondents indicating their state and international respondents indicating their country. The dataset contains agree/disagree, 1-10 scale, and open-ended questions to understand student perspectives.
Saudi Arabia genomic study of Salmonella in 260 chicken eggs, detecting a 9% contamination rate. The research by Amani T. Alsufyani, published in 2026, used whole-genome sequencing to identify 12 distinct serovars and analyze virulence and antimicrobial resistance genes from isolates on eggshells and in contents.
92.2 KB of CSV data supporting a 2026 PNAS study by Grundy & Pujols-Beltran. The dataset likely contains measurements of executive function and bilingualism status in an aging population affected by COVID-19. It was published on figshare under a CC-BY-4.0 license.
Saima Munawar collected this data to test hypotheses for the study "Putting First Things First: Air Travelers' Extra Role Behavior in Commercial Airlines - The Role of Necessity and Additive Logics". The dataset is stored in an XLSX file sized 105.9 KB and was last updated on 2026-04-21. It is shared under a CC-BY-4.0 license on figshare.
Hermes Coworker Flash is a curated instruction-style dataset built from traces of everyday assistant interactions. It combines GLM-5.1 and kimi-2.5 splits, focusing on non-programming, quick-turnaround tasks. The dataset was created by LiteMind and last updated on 2026-05-17.
Six replicates of mass spectrometry data capture metabolites from sorted somatic cyst and germ cells of Drosophila testes. Analysis used hydrophilic interaction liquid chromatography and an Orbitrap QExactive in polarity switching mode, with metabolite identity confirmed by matching to authentic standards. The dataset comprises .mzXML files separated for positively-charged and negatively-charged metabolites.
CERF Marine Biodiversity Hub's survey of the Carnarvon Shelf, Western Australia, was conducted in August and September 2008. The collaborative effort between the Australian Institute of Marine Science and Geoscience Australia collected co-located physical and biological data across three primary study areas. The report details survey methods, initial data interpretations, and examples of encountered biota.
CERF Marine Biodiversity Hub's survey of the Carnarvon Shelf, Western Australia, was conducted in August and September 2008. The collaborative effort between the Australian Institute of Marine Science and Geoscience Australia collected co-located physical and biological data across three primary study areas. The report details survey methods, initial data interpretations, and examples of encountered biota.
Reports from 2026-04-09 include assessments, reviews, analyses, and audits produced by external organizations for the Government of Canada's human resources and pay functions. The collection is published by Public Services and Procurement Canada under the OGL-CA-2.0 license. Documents are provided in DOCX format and may contain both qualitative and quantitative data.
Borhan Shokrollahi's dataset contains PLINK genotype files and phenotype data for genome-wide association analyses of five behavioral traits in horses: Friendliness, Gentleness, Patience, Sensitivity, and Aggressiveness. The phenotype files include deregressed estimated breeding values and reliability-derived weights used in weighted mixed-model GWAS analyses. The dataset was last updated on 2026-04-28 and is available under a CC-BY-4.0 license.
Data from 2000 to 2025 were collected from authorship lists of articles published in the journal Veterinary Anaesthesia and Analgesia. The data were gathered by Daniel Pang and hosted on Harvard Dataverse to explore trends in the number of listed authors per article. The dataset was last updated on June 9, 2026.
NASA's Advanced Plant Experiment-04 - Epigenetic Expression (APEX-04-EpEx) experiment on the International Space Station examined cytosine methylation in Arabidopsis thaliana root tissues. The dataset contains single-molecule methylation profiling from wild-type and elp2-5 mutant plants, generated using Flap-Enabled Next-Generation Capture (FENGC) to target specific genomic regions. Data was last updated on March 13, 2026.
A 122.6 KB Excel dataset from figshare, authored by Dr Rahul Goyal and last updated in April 2026. It contains results from a complete energy, exergy, and exergoeconomic evaluation of a compression ignition engine using silicon dioxide nanoparticle-enhanced water-diesel emulsified fuel.
Port Phillip Bay bathymetry data collected by Deakin University over three days in January 2018. The survey used a Kongsberg EM2040c sonar system onboard the Motor Vessel Yolla to map locations of drift algae. This dataset is part of a collaborative program with the University of Melbourne.