Loading...
Loading...
DNA/RNA sequences, gene expression, protein structures, metagenomics, single-cell sequencing
23,457 datasets
A seamless topographic greyscale mapping service for the whole of Australia, including the external territories of Cocos (Keeling) Islands, Christmas Island, Norfolk Island, and Lord Howe Island. The service is produced for the National Map project and combines Geoscience Australia data at smaller scales with OpenStreetMap data at larger scales. It was last updated on 2026-06-05.
Domestic Waterfront Precinct is a geospatial dataset recording the spatial extent of precincts created under Domestic Waterfront Reforms in New South Wales, Australia. The dataset is maintained by the Department of Planning, Housing and Infrastructure and is part of the state's Foundational Spatial Data Framework, updated to the GDA2020 coordinate system. It was initially published on 05/05/2020 and is sourced from data provider files.
Heritage buildings protected under Quebec's Cultural Heritage Act are listed in this official register. The Ministry of Culture and Communications provides this data in KML, CSV, and WMS formats under a CC-BY-4.0 license. Note that geographic coordinates are indicative points for location, not legal property boundaries, and some entries may lack coordinates entirely.
Shihao Liu published supporting data for the selection of DNA-encoded libraries against epigenetic protein targets on figshare in May 2026. The dataset contains results from high-throughput screening of a library with over 500 million unique members against the YEATS2 YEATS domain. It includes validated hits, with binding affinities confirmed via SPR and MST assays, and the most potent hit exhibiting a dissociation constant (Kd) in the range of 0.47–7.3 μM.
All original experimental data and analytical scripts from a study on metabolic reprogramming and immune microenvironment regulation in intervertebral disc degeneration. The dataset includes raw Ct values for GCLM and GAPDH genes, calculated relative gene expression results, and complete R code for multi-omics data mining. It was authored by Zhonghua Zhang and corresponds to a manuscript on biomarker identification using machine learning and single-cell transcriptomics.
A research document details a study on the role of complement component C5a in pulmonary hypoperfusion-induced alveolar hypoplasia. The study uses bulk and single-cell RNA sequencing of lung tissues from neonatal rats and includes histological assessments and serum measurements from children. The document was authored by Chenxi Liu and last updated on 2026-05-11.
Jenni Firrman's dataset contains ex vivo study data on the effects of tomato seed extracts on the human gut microbiota. The data includes shotgun metagenomic DNA sequencing, biomass cell counts, short-chain fatty acid concentrations, gas, and pH measurements from 48-hour incubations. It was last updated on May 22, 2026.
Municipal-level data classifies agroclimatic hazards and global threat summaries for the Sahel region. The dataset was produced by the PREDISAN AI-SAHEL Project, funded by the Andalusian Agency for International Development Cooperation and the University of Granada. It was last updated in April 2026.
Over 200 arsenic-binding proteins were identified in living lymphoma cells using a novel fluorescent probe. The data supports research into the therapeutic mechanism of arsenic trioxide in diffuse large B-cell lymphoma. The dataset was authored by Hongyu Zhao and last updated on June 2, 2026.
Over 500 individual white sharks were DNA sequenced for capture-mark-recapture analyses. This data supports a national assessment of the southern-western adult white shark population and an update for the eastern Australasian population. The work was conducted under the NESP Project A3 to inform recovery actions and policies balancing conservation and public safety.
121 pediatric epilepsy patients aged 2-18 years were analyzed in a retrospective study to develop a predictive nomogram. The model, created by Tian Hu and last updated in April 2026, identifies four predictors for suboptimal valproate levels: daily dose, acute liver injury, acute kidney injury, and concurrent meropenem use.
A dataset representing gazetted suburb and locality boundaries for New South Wales, Australia, updated to the GDA2020 national standard. It is maintained by the NSW Geographic Names Board and Spatial Services, with postcodes sourced from Australia Post. The data is positioned in alignment with the Land Parcel and Property theme.
1.1 MB of research data from figshare explores the causal link between primary sclerosing cholangitis (PSC) and colorectal cancer (CRC) risk via gut microbiota. Mendelian randomization analysis suggests a causal association (OR = 1.172) and identifies Lachnospiraceae family and PCBP1 gene as potential mediators. The dataset was authored by Zhaobin He and last updated on April 24, 2026.
A 14.9 MB dataset supporting the study 'Ordinal Alignment of Polymer Solvation States Enables Predictive Materials Design'. It contains processed data bundles for model training and a schema example, created by Zheng Jie Liew and last updated in April 2026. The dataset is intended for benchmarking and developing data-driven models for polymer–solvent solvation behavior.
Almost 60,000 records of historic sites, monuments, buildings, artefacts, and landscapes compiled by Cornwall Council. The Cornwall and Scilly Historic Environment Record (HER) database is maintained for functions including planning, conservation, and research, with information derived from publications, fieldwork, and public contributions. The dataset is constantly updated as new information is acquired.
Data from a 2026 study investigating key parameters for measuring effective density of low-volatility spherical particles. The dataset, authored by M. Kiasadegh, includes measurements from three tandem instrument setups using a DMA, CPMA, and AAC, with mass set points from 0.05 fg to 5 fg and aerodynamic diameters from 50 nm to 500 nm. Results detail the impact of instrument resolution, flow mode, and characteristic scan time on measurement accuracy and operational range.
Geoscience Australia Data presents a novel ensemble tide modelling approach for optimizing coastal tide predictions. The method combines sea levels from satellite altimetry with optical remote sensing data, specifically the Normalized Difference Water Index (NDWI), to evaluate and rank 10 leading global tide models. The approach was validated against independent tide gauge observations across Australia's diverse tidal environments and is based on freely available satellite data and open-source tools.
A repository of raw PDF files for papers from the NeurIPS conference, part of a larger AI conference and journal papers project. The dataset is maintained by GenAI4ELab and was last updated on June 17, 2026. This shard contains only the binary PDF files, with searchable metadata available in a separate main repository.
Additional file 2 from a study on recessive lethal variants in Friesian horses. The dataset includes 70K SNP Chip data from 8,263 horses, with tables covering filtered SNPs, haplotype sequences, and mating outcome statistics. It was authored by Marije J. Steensma and last updated on 2026-04-17.
A series of stratigraphic framework maps for the Saskatchewan Phanerozoic Fluids and Petroleum Systems (SPFPS) project. The maps were produced using 2 km equi-spaced modified grids generated from a kriging algorithm and incorporate validated data from multiple Ministry projects and wells in adjacent jurisdictions. The dataset is provided by the Government of Saskatchewan.