Loading...
Loading...
DNA/RNA sequences, gene expression, protein structures, metagenomics, single-cell sequencing
23,811 datasets
A 1.4 GB dataset from a genome-wide CRISPR/Cas9 loss-of-function screen in Madin-Darby bovine kidney cells, identifying host factors for Bovine Enterovirus F. The data, authored by yuanchen Geng and last updated on 2026-04-20, supports findings linking carboxypeptidase A6 to viral production via the PI3K/AKT/FOXO1 signaling axis.
London universities provide data on the number of international students enrolled. The dataset is published by the Greater London Authority on the uk_data platform and was last updated on 2026-06-24. Columns likely include counts per institution and may suggest temporal or demographic breakdowns.
Environmental DNA (eDNA) data was collected during the RV Investigator voyage IN2022_V09 from November 19 to December 19, 2022. The study compared three eDNA processing methods and validated fish detections against trawl survey data and the regional species pool. The dataset is provided by the Australian Ocean Data Network.
11 percentage points is the estimated increase in the probability of collusion between two firms after the onset of common leadership, according to the associated research paper. The dataset likely contains firm-level observations used to study the link between shared executives or board directors and collusive behavior. It was authored by Alejandro Herrera-Caicedo and associated with the Journal of Political Economy, with a last recorded update in June 2026.
Cadastral alphanumeric information for properties under the jurisdiction of the Valle del Cauca Governorate's Cadastral Manager. The dataset includes columns for municipality, constructed area, land area, and economic use classification. It is published via the Colombian open data portal and was last updated on 2026-05-18.
259 mother-infant dyads participated in a study linking adult attachment styles to physiological arousal and maternal sensitivity. The data includes measurements from the Adult Attachment Interview during pregnancy and physiological data (RSA and SCL) and sensitivity ratings collected during the Still Face Procedure when infants were 6 months old. The dataset was authored by Esther Leerkes and is hosted on Harvard Dataverse.
Environmental DNA sequence data and video observations were collected during the 2021 TEMPO voyage in the East Antarctic sector of the Southern Ocean. The dataset comprises 110 eDNA samples from surface and near-seafloor depths, analyzed with a euphausiid-specific metabarcoding marker, alongside CTD-mounted camera footage. Data were acquired by the Marine National Facility aboard the RV Investigator between 29 January and 24 March 2021.
A reference genome and annotation for the fish species Symphodus ocellatus. The dataset includes a primary genome assembly, a mitogenome, and corresponding gene annotations. It was authored by Ainhoa López and last updated on 2026-05-14.
Ainhoa López provides a reference genome for the marine fish Symphodus tinca. The dataset includes genome assembly, annotation, and a separate mitogenome, totaling 312.7 MB in FASTA and GFF formats. It was last updated on May 14, 2026.
A preliminary list of known mine sites within the Cornwall and West Devon World Heritage Site area. The dataset aggregates information from sources including the Cornwall and Isles of Scilly Historic Environment Record, the Devon Sites and Monument Record, and Ordnance Survey historic maps. It is published by the Government Digital Service under the UK Open Government Licence.
KSI-028 represents a newly discovered tetrahydroquinoline-based chemotype for inhibiting the STING protein, which is implicated in inflammatory diseases. The dataset, published by So Hyeon Jeong in 2026, includes results from mechanistic studies and in vivo efficacy testing in a cisplatin-induced acute kidney injury mouse model. It demonstrates the compound's ability to suppress STING-dependent signaling and reduce cytokine production in murine and human cells.
10.4 MB of data and code for developing a predictive model for acute lumbar disc herniation. The dataset includes Python scripts, serialized model files, and Excel spreadsheets, published under a CC-BY-4.0 license by Dongteng Liao. It was last updated on May 24, 2026.
ELONA assay results evaluate a single-stranded DNA aptamer for detecting active tuberculosis in sputum samples. The dataset includes results from 68 patient samples, comprising 20 TB patients and 48 control patients. Charlotte Maserumule authored this dataset, which was last updated in April 2026.
The T'licho region of Canada's Northwest Territories is the focus of a research report containing 105 place names. This document, published by the Government of Northwest Territories, explores the link between these names and their meanings. The dataset was last updated on 2026-05 06.
Robert Moss published ensemble forecast log-transformed CRPS values on figshare in April 2026. The dataset contains mean CRPS values aggregated for lead times of 1-7 days, 8-14 days, 15-21 days, and 22-28 days across multiple jurisdictions. The data indicates forecast performance decreased as lead time increased.
A summary of models included in ensemble forecasts for influenza. The data reports the percentage of ensemble forecasts each model was included in and the number of models per forecast (minimum, maximum, and mean), broken down by periods when specific strain(s) dominated. The dataset was authored by Robert Moss and last updated on April 22, 2026.
A 2018 revision of China's Code of Corporate Governance established an ESG information disclosure framework for listed companies. This dataset likely contains financial and ESG metrics for China A-share listed companies from 2010 to 2022, used to study the impact of ESG disclosure on overseas revenue. It was authored by Wang Qiankun and hosted on Harvard Dataverse.
Samuel Butcher's dataset provides the uncropped gel images from a study investigating how TDP-43 controls RNA structure through high-affinity lattice interactions. The collection is 228.6 MB in size and includes files in TIF, JPG, and XLSX formats. It was last updated on May 14, 2026, and is shared under a CC-BY-4.0 license.
9.5 KB of data on the audible acceptability of tidal versus deep breathing audio recordings. Sue In Choi authored this dataset, which was last updated on May 13, 2026. The description indicates rater observations on artifacts from skin friction, low lung sound levels, and ambient noise.
Mohsen Khosravi's dataset contains a thematic analysis of 81 studies on the limitations of Large Language Models (LLMs) in healthcare. The analysis was conducted via a systematic review of English articles published between 2018 and 2025. Data was extracted and categorized using Boyatzis's thematic approach and the IPO model.