Loading...
Loading...
DNA/RNA sequences, gene expression, protein structures, metagenomics, single-cell sequencing
23,826 datasets
Precision data for Hepatitis B Virus DNA assays comparing three different extraction methods. The dataset was authored by Yanfang Mo and last updated on 2026-05-19. It is a small dataset of 5.5 KB, stored in an XLS file format.
A 2020 review paper from the University of Tehran discusses potential molecular targets for drug development against the novel coronavirus. The paper overviews current drugs in the pipeline and highlights protease and RNA polymerase as promising targets. Based on antiviral repurposing and clinical studies, it suggests remdesivir as an encouraging frontline treatment.
Nihoa, a remote Hawaiian island, is the source for the complete mitochondrial genome of the endemic isopod Ligia barack. The 1.3 MB dataset, authored by Carlos A. Santamaria and published in 2026, contains the sequence for 13 protein-coding genes, 22 tRNAs, two rRNAs, and a control region. It was generated using low-coverage sequencing for phylogenetic analysis.
The Gene Expression Omnibus (GEO) is a public repository for high-throughput gene expression and genomic hybridization data. It was initiated by Ron Edgar at NCBI in response to demands for a central data distribution hub. The repository organizes data into platforms, samples, and series to facilitate submission, storage, and retrieval of heterogeneous experimental datasets.
A video presentation details findings from two Geoscience Australia marine surveys conducted in 2003 and 2005. The story relates to the discovery of previously unknown submerged coral reefs across the southern Gulf of Carpentaria, identified using new multibeam sonar technology. The age of the reefs was determined using drill-core samples analyzed via the Uranium/Thorium method at the Australian National University.
OpenArt's items and artifacts collection contains 25,750 public-domain works of human-made objects and decorative arts. The dataset includes 11,317 paintings and illustrations, 14,216 photographed objects, and 217 unclassified works, each paired with a structured VLM caption. It was created by author jaddai and last updated on Hugging Face in May 2026.
A matrix of information assets from the Governorate of Tolima, Colombia, published via the Socrata platform. The dataset includes columns for asset classification date, personal data content, legal basis, and custodians. It was last updated on 2026-05-18.
Beginning in 1991, this dataset tracks the number of business entity filings made with the New York Department of State's Division of Corporations. It categorizes filings by type, authority, and certificate, with monthly counts. The data is provided by data.ny.gov and was last updated in April 2026.
Raw data files supporting a research paper on in-situ bromide entrapment in tungsten oxide for durable acidic oxygen evolution reaction (OER). The dataset is 20.1 MB in size, authored by Sanjiang Pan, and was last updated on May 18, 2026. It is shared under a CC-BY-4.0 license on the figshare platform.
Fisheries and Oceans Canada collected environmental DNA (eDNA) survey data from 2017 to 2024 to assess the distribution of freshwater mussels in New Brunswick. The dataset includes results for species such as the Brook Floater, Eastern Pearlshell, and Yellow Lampmussel, using species-specific qPCR assays. Data is available in CSV and ESRI REST formats under an OGL-CA-2.0 license.
A 2026 study by YING ZHANG addresses uncertainty in aerosol climate effects by modeling two partial internal mixing states of black carbon. The dataset includes model outputs demonstrating the influence of non-BC species mass fraction on aerosol absorption. Results are intended to improve estimates of aerosol climate effects and understanding of climate change.
City of Sydney's environmental performance grants support projects by businesses and residents to improve building sustainability. The dataset was last updated on 2026-06-12. It likely contains records of funded projects, applicants, and outcomes.
29 years of blue whale sightings data were collected by the Mingan Island Cetacean Study from 1980 to 2008 during annual surveys. The project aimed to inform critical habitat designation under the Canadian Species at Risk Act, with detailed analysis of sightings per kilometer of effort in 3x3 km grid cells for 2000-2008.
Statistiques DVF is a dataset of French real estate transaction statistics sourced from the official French open data portal, data.gouv.fr. The dataset is published by Data-Gouv-FR and was last updated on May 29, 2026. It likely contains tabular data on property sales and valuations.
Transposon-directed insertion-site sequencing data identifies genes essential for adherence of the pathogenic Escherichia coli O157:H7 serotype. The dataset, authored by Miaomiao Liu and published in April 2026, includes results from a genome-wide screen for regulators of the LEE pathogenicity island. It comprises files in XLS, DOCX, and XLSX formats totaling 3.2 MB.
Raw data from a study investigating complement hyperactivation in Severe Fever with Thrombocytopenia Syndrome virus infection. The dataset, 1003.6 KB in size, was published by author Yan Liu on figshare under a CC-BY-4.0 license and last updated on April 15, 2026. It contains experimental results from a lethal mouse model using IFNAR−/− mice.
Ratings on accessibility of buildings for wheelchair users, provided by the City of Ballarat. The data is in high demand by community groups for selecting suitable venues and was last updated on 2026-04-26. The dataset is available in multiple formats including CSV, JSON, and SHP.
RNA-seq datasets are available for download in CSV format. The data, published on figshare by Nien Pei Tsai, is licensed under CC-BY-4.0. Its small size of 158.2 KB suggests a limited scope, such as a pilot study or a focused subset of sequencing results.
Geoscience Australia provides seamless greyscale topographic mapping for the whole of Australia and its external territories. The service combines Geoscience Australia data at smaller scales with OpenStreetMap data at larger scales. It was last updated on 2026-05-14.
14.4 KB of data from a suppressor screen in C. elegans, identifying genetic variants linked to microtubule regulation in neurons. The dataset was created by Sunanda Sharma and last updated on April 15, 2026. It includes variants in tubulin genes and the cytokinesis-associated protein CITK-1/2.