Loading...
Loading...
DNA/RNA sequences, gene expression, protein structures, metagenomics, single-cell sequencing
23,527 datasets
Three independent bacterial lineages rapidly developed oxytetracycline resistance after 3 days of exposure. Whole-genome sequencing identified recurrent point mutations in genes AHA_2785, AHA_2910, and AHA_0308, while transcriptomic analysis revealed over 1,000 differentially expressed genes. The dataset, authored by Ju Zhang and last updated in May 2026, likely contains genetic and transcriptional data from this experiment on antimicrobial resistance in aquaculture.
Three independent bacterial lineages rapidly developed oxytetracycline resistance after 3 days of exposure, showing consistent adaptation patterns. Whole-genome sequencing identified recurrent point mutations in genes AHA_2785, AHA_2910, and AHA_0308, while transcriptomic analysis revealed over 1,000 differentially expressed genes. This dataset by Ju Zhang, last updated in May 2026, documents genetic and transcriptional changes driving antimicrobial resistance in an aquatic environment.
384 dentists from Maharashtra, India, participated in a cross-sectional survey from May to October 2024. The study identified a significant difference in knowledge and attitude scores between rural and urban dentists (p < 0.01). Urban practitioners displayed higher knowledge levels and more favorable attitudes towards digital dentistry.
1,155,183 SNPs and 286,674 InDels were detected in wild-type tobacco K326, while its cold-sensitive mutant M18 had 1,724,339 SNPs and 360,131 InDels. This dataset, authored by Hui Yin and shared under CC-BY-4.0, contains genomic variant data from a comparative analysis aimed at identifying genes linked to cold-induced early flowering. The data was last updated on May 1, 2026.
A high-quality genome assembly for the elite wheat cultivar Jimai 22 (JM22) was generated using PacBio HiFi and chromosome conformation capture sequencing. The assembly is 197.5 MB in size and was published by Guangwei Li on figshare in May 2026. It provides a resource for delineating the genetic basis of freezing tolerance, specifically highlighting CBF2 gene haplotypes.
Two independent 100 ns molecular dynamics simulations of the PFDN4 protein in complex with the ligand baicalein, performed by Weichao L and last updated on 2026-04-21. The 11.7 GB dataset includes trajectory files and analysis scripts tracking six specific pharmacophoric distances to assess binding stability. The data was generated in response to peer review to demonstrate ligand binding mode stability beyond global metrics.
Supplementary tables S1 to S13 accompany a research paper on chewBBACA 3, a tool for whole- and core-genome multilocus sequence typing. The tables contain performance metrics like runtime and memory usage for schema creation and allele calling, as well as schema composition statistics for multiple bacterial species. Rafael Mamede authored the file, which was last updated on May 1, 2026.
Leon French provides 3.8 GB of additional large data files supporting the study 'Cluster replicability in single-cell and single-nucleus atlases of the mouse brain.' The files include pretrained Metaneighbor models, processed data from multiple brain atlas projects, and results from spatial and marker analyses. The dataset was last updated on April 27, 2026.
R code and processed spreadsheet data for creating heatmaps from metabolomics peak area data. The data were downloaded from the GNPS2 Analysis Status Page for LC-MS/MS acquisition modes and processed into a simplified format. Authored by Vlastimil Novak and last updated on June 1, 2026.
Two text files containing a UK wheat pedigree subset formatted for the Helium crop pedigree visualization software. The dataset is 229.5 KB in size and was authored by James Cockram, with the last update on 2026-04-24. This is an updated version of a UK subset originally published alongside a global wheat pedigree in PLOS Biology in 2019.
The CORAL Earth Venture Suborbital-2 mission provides a uniform picture of coral reef composition across the Mariana Islands, Palau, portions of the Great Barrier Reef, and the Main Hawaiian Islands. Data is collected using the Portable Remote Imaging Spectrometer (PRISM) instrument aboard a Gulfstream-IV aircraft, combined with in situ measurements. The dataset is produced by the National Aeronautics and Space Administration.
60 adolescents with severe obesity were studied longitudinally before and after bariatric surgery. Metabolomic profiles from plasma and adipose tissue were analyzed using gas/liquid chromatography and high-resolution mass spectrometry to identify metabolic signatures linking DDE exposure to weight loss outcomes. The dataset, authored by Zhenjiang Li and last updated in 2026, contains results from metabolome-wide association analyses.
1.1 GB of supplemental data from an ICLR 2026 paper comparing popular hyperbolic embedding methods. The package contains performance and quality results for algorithms like Bläsius et al. (ESA 2016) and Nickel and Kiela's Poincaré (NIPS 2017) and Lorentz (ICML 2018) embeddings on real-life hierarchies, networks, and simulated networks. Author Eryk Kopczynski released it under a CC-BY-4.0 license on figshare in May 2026.
A 5.0 MB dataset from figshare, last updated in May 2026, contains raw data from a study on polypills for hypertension and heart failure. The research, authored by jia hu, employed bibliometric and network pharmacology analyses to identify key biological targets and pathways. Findings suggest polypills exert effects through coordinated action of multiple signaling pathways like MAPK and PI3K/Akt.
Zhen Hu's research dataset from 2026 investigates plasma protein associations with C9ORF72 repeat expansions. It contains proteomic and genetic data from 106 individuals with C9ORF72 expansions and 212 matched controls from the UK Biobank, screening approximately 3,000 proteins. The analysis identifies neurofilament light chain (NEFL) as a key biomarker linked to repeat count and motor neuron disease risk.
106 individuals with C9ORF72 expansions and 212 matched controls from the UK Biobank were analyzed for plasma proteomics. The dataset contains results from a screen of approximately 3,000 proteins, identifying NEFL as a key biomarker. It was authored by Zhen Hu and last updated on 2026-04-21.
Western Australian Permian marine faunas include species from the superfamily Orthotetacea. At least 23 species across four genera are described, many with restricted ranges useful for stratigraphical correlation. The data is provided by the Australian Ocean Data Network.
Alice Springs, Australia hosts this data containing stem diameter, height measurement, and above-ground living biomass calculations for 100 stems located within a mulga flux tower footprint from 2014. The data was aggregated by the Terrestrial Ecosystem Research Network's Data Discovery platform.
Christine Hammond's 2026 dataset provides genome assemblies and comparative genomics data for the phytopathogenic bacterium 'Candidatus Phytoplasma pruni'. It includes two new, high-quality, non-fragmented genome assemblies and a survey of other genomes in the 16SrIII ribosomal group. The data confirms the lack of a functional GroE chaperonin system in this species and identifies non-functional groEL pseudogenes.
A monitoring report covers 464 houses repaired by UNHCR in eastern Ukraine during 2018. The monitoring visits, representing 34% of all repairs that year, were conducted between September 2018 and March 2019 by teams from UNHCR's shelter and protection units. The report confirms high recipient satisfaction and quality of repairs across both government-controlled and non-government-controlled areas.