Loading...
Loading...
DNA/RNA sequences, gene expression, protein structures, metagenomics, single-cell sequencing
23,820 datasets
A palynological examination of cores and cuttings from the subsidised A.O.G. Wentworth No.1 well in the Murray Basin, New South Wales. The report indicates transitions from Tertiary into Lower Cretaceous marine sediments and into Lower Permian beds, with comparisons to the Oaklands-Coorabin coalfield. The dataset is provided by the Australian Ocean Data Network.
AGSO Journal volume 14, number 1 contains seven peer-reviewed geoscience research articles from the Australian Ocean Data Network. Articles cover topics including Miocene to Pleistocene foraminiferal biostratigraphy, the structure of the Queensland Trough, and a stream-sediment geochemical survey in the Northern Territory. The journal issue was last updated on the platform in May 2026.
Indurated sand layers in Quaternary coastal deposits can record past soil development phases and influence local hydrology. Samples from sand mine pits and an estuarine channel in southeast Queensland were analyzed for cement composition and trace metals. Optically stimulated luminescence and thermoluminescence dating indicate induration processes occurring over periods up to approximately 100,000 years.
V-JEPA 2 ViT-G Embeddings provide precomputed video features for a 62-hour subsample of the BEHAVIOR-1K 2025 challenge demonstrations. The dataset, created by quastAI, aims to accelerate downstream experimentation by eliminating repeated video processing. It was last updated on May 22, 2026.
A 446.7 MB dataset provides a high-quality, haplotype-resolved genome assembly for the wild diploid potato Solanum neocardenasii CIP764035 (C868). It includes chromosome-level sequences in Fasta format and gene annotations in GFF3, CDS, and PEP formats. The dataset was authored by Shuo Zhao and last updated on May 12, 2026.
AlphaFoldDB provides over 246 million predicted protein 3D structures, massively expanding structural coverage for known protein sequences. The dataset, created by LiteFold, is split into deterministic train and test sets based on UniProt accession hashes. It was last updated on May 27, 2026.
Synchronized audio and inertial measurement unit data from simulated augmentative and alternative communication sessions. The dataset contains AAC audio events, human speech, tap gestures, non-tap gestures, and attribution labels, created by author Chen, Szu-Han Kay. It was last updated on June 3, 2026.
A structured catalog from the Meta Governorate details the format, responsible parties, and update schedules for its published information. The schema includes columns for FORMATO, FRECUENCIA DE ACTUALIZACIÓN, and FECHA DE GENERACIÓN DE LA INFORMACIÓN. Data is hosted on the Colombian open data portal www.datos.gov.co and was last updated on 2026-05-18.
Bioassay data from the subantarctic Southern Ocean dissects the 'ferrous wheel' of iron availability. The dataset quantifies uptake rates of heterotrophic bacteria and phytoplankton during summer, comparing light and dark incubations and the effects of dissolved organic carbon supply. Data was contributed by the Australian Ocean Data Network and last updated in April 2026.
Municipal government financial obligations, including bonds and loans, are detailed with transaction dates, interest rates, and outstanding balances. Columns suggest tracking of debt operations from issuance to maturity, with fields for credit codes, interest rates, and payment currencies. The data originates from the Colombian open data portal www.datos.gov.co and was last updated on 2026-05-18.
1.0 MB of sequence data, structural models, and analysis outputs for Early Light-Inducible Proteins (ELIPs) and the Light-Harvesting Complex (LHC) superfamily in Betula platyphylla. The dataset was authored by preetom regon and last updated on 2026-05-03. Files include PDF, CIF, TXT, FA, HMM, FASTA, and TREEFILE formats.
A 87.0 MB directory of graphs visualizing SHOI rankings for species and species groups. The data was authored by Anna F. V. Pintor and last updated on 2026-05-21. It covers biogeographic regions and individual countries.
Annual geospatial data on housing development projects within Metropolitan Melbourne's 32 local government areas from 2005 to 2016. The layer depicts changes in lots and residential dwellings, including demolitions and construction, for projects comprising one or more land parcels. It was created by the Department of Transport and Planning and last updated in April 2026.
Hyperparameter settings for machine learning models, compiled by Sawera Qureshi. The dataset is a 5.5 KB Excel file last updated on May 21, 2026. Its specific row count and column details are not provided in the metadata.
Paul Knabl published a dataset comparing RNA-Seq and ChIP-Seq data related to BMP signaling pathways on figshare in May 2026. The 7.0 MB dataset is available as an XLSX file under a CC-BY-4.0 license. It likely contains gene expression and chromatin binding data from experiments involving BMP2/4 morpholino knockdowns and anti-pSMAD1/5 antibodies.
David Bahamón-Pinzón created a dataset describing the application of Community-Based Participatory Research principles to a specific research project. The dataset is stored as an XLS file with a size of 13.5 KB. It was last updated on May 21, 2026.
Hyperparameter Settings for the Proposed Siamese BiLSTM Model is a 5.5 KB dataset authored by Weihong Zhao and last updated on 2026-05-21. It is shared under a CC-BY-4.0 license on the figshare platform. The specific hyperparameter values and model configuration details are contained within an XLS file.
A 5.5 KB Excel file summarizes image property preferences extracted from a scientific figure. The data, authored by Emil Dmitruk, was last updated on May 14, 2026. Its small size suggests it likely contains a focused set of aggregated metrics or ratings.
Six of eight comparisons show a significantly higher slope for the permissive survey than the restrictive survey. The dataset is a 9.5 KB Excel file authored by Robert G. Badgett and last updated on May 21, 2026.
Reference sequences used as queries for genome-wide protein identification via BLASTp. The dataset is a small 11.6 KB Excel file authored by Junyi Ren and last updated on May 21, 2026.