Loading...
Loading...
Text classification, translation, QA, summarization, dialogue, sentiment analysis, language modeling, text corpora
43,965 datasets
The central Great Barrier Reef Province is characterised by a mainland prograding terrigenous clastic shoreline and an inner shelf dominated by fluvially derived mud. This dataset, sourced from Geoscience Australia Data, models sedimentation patterns from the Burdekin River region, describing coastal progradation rates decreasing from 2.5 m yr-1 to 0.1 m yr-1. It was last updated on 2026-05-14.
Monitoring data from Boulder Reef in the Northern Great Barrier Reef captures sediment and water flux before, during, and after Tropical Cyclone Dominic. The dataset likely contains measurements of water velocity, sediment load composition (including particulate organic and inorganic carbon), and terrestrial clay inputs. It is provided by Geoscience Australia Data and was last updated on 2026-05-14.
415,090 line-kilometres of airborne radiometric data were collected over Western Australia in 2024 by the WA Government. The data has been processed to provide point-located concentrations of potassium, uranium, and thorium at the Earth's surface. This reduced dataset is intended for geological mapping, mineral exploration, and environmental studies.
Quarterly quality indicators for the Hospital San Rafael de Facatativรก and its affiliated sites, generated by each of the hospital's processes. The dataset includes columns for the 1st, 2nd, 3rd, and 4th trimesters of 2022 and 2023. It was published on the Colombian open data platform www.datos.gov.co and last updated in May 2026.
5028 Block 4 (Northeast) reduced radiometric point-located data was acquired in 2024 by the WA Government. The dataset consists of 415,090 line-kilometres of gamma-ray spectrometric data collected at 100m line spacing and 50m terrain clearance. It is processed using methods like NASVD filtering and tie-line levelling to highlight concentrations of potassium, uranium, and thorium.
Total magnetic intensity (TMI) data measures variations in the Earth's magnetic field caused by rock-forming minerals. This grid covers the Narryer survey area, processed with reduction to pole and first vertical derivative, with a cell size of approximately 20 meters. The data was acquired in 2024 by the WA Government, consisting of 415,090 line-kilometres of data.
Northern African and Arabian Middle Stone Age and Middle Palaeolithic lithic artefacts are analyzed using latent class modelling. The dataset includes supplementary files for a study comparing LCM with traditional typology and hierarchical clustering. It was created by Lucy Timbrell and last updated in April 2026.
The Clermont 4-mile Sheet area in Queensland exposes parts of three major structural units: the pre-Devonian Anakie Inlier, the Lower Carboniferous Drummond Basin, and the Permian-Triassic Bowen Basin. The description details lithologies, thicknesses, and unconformities, such as the 14,000 to 20,000 feet of Drummond Beds and the 850 feet of Triassic Carborough Sandstone. This geological data is provided by Geoscience Australia.
Northumberland County Council provides the Green Belt boundaries and strategic policy for Northumberland, excluding the National Park area. The dataset consolidates boundaries from former district plans and defines the revised Green Belt extent as of the adopted Local Plan from March 2022. It is served via multiple geospatial formats including Feature Server and WFS.
Lorenzo Brunetti's 2026 study investigates the potential for transitioning to a more sustainable coffee value chain for smallholder farmers in Central Kenya. The research applies a participatory and systems-based approach, using qualitative inference methods to identify leverage points for sustainability. Findings suggest current information sharing, agroecosystem diversification, and certification schemes can foster transitions, but highlight a misalignment with policy-driven yield intensification.
12,102 question-answer pairs in CommonsenseQA require real-world commonsense reasoning. 5,957 four-option multiple-choice questions in OpenBookQA are based on student-level scientific knowledge, and 12,723 questions from the USMLE comprise the MedQA-USMLE biomedical dataset. The collection, uploaded by daiyi Li to figshare under CC-BY-4.0, is packaged in a 466.7 MB RAR file.
SynthLabs Chat Final Cleaned v3 is a cleaned instruction-following chat dataset for supervised fine-tuning of language models. Each example is a conversation with explicit chain-of-thought reasoning separated from the final answer. The dataset was authored by mkurman and last updated on June 20, 2026.
A 2026 cross-sectional study surveyed 236 Israeli respondents aged 18โ50 on eating attitudes and Holocaust trauma transmission. The dataset includes responses from 130 third-generation Holocaust survivors and 106 non-third-generation participants to the EAT-26 questionnaire and a custom survey. It was authored by Goni Biran and published on figshare under a CC-BY-4.0 license.
City of Hobart provides geospatial data representing planning scheme overlays under the Hobart Interim Planning Scheme 2015. The dataset, created by HCCGISICT and last updated in April 2026, indicates the location of Council services and regulations for a specific area plan. The data is provided with a disclaimer regarding potential errors and is recommended for general guidance rather than definitive project planning.
City of Hobart Open Data provides geospatial overlays from the Hobart Interim Planning Scheme 2015. The data, last updated on 2026-04 11, is published by HCCGISICT and is intended to indicate the general location of Council services. It is available in multiple formats including CSV, KML, and GeoJSON.
A 2.6 MB document details a study where repetitive transcranial magnetic stimulation alleviated pain behaviors in a rat neuropathic pain model. The research includes behavioral testing and molecular, histological, and ultrastructural analyses of the spinal cord and sciatic nerve. Daniel Youngsuk Kim authored this dataset, which was last updated on May 7, 2026.
A mathematical analysis of the perturbed nonlinear Biswas-Milovic equation with Kudryashov's law of refractive index. The study, authored by Yuxi Zhang, derives Gaussian soliton solutions and explores chaotic dynamics via perturbation terms. The work provides a classification of traveling wave solutions and chaotic properties.
Cabo Ortegal in northern Spain is the source for this collection of reflected light microscopy images, backscattered electron images, element maps, and laser ablation ICP-MS data for Fe-Ni-Cu sulfide minerals. The data were acquired in 2021 and 2022 from dunite, harzburgite, and pyroxenite samples held at Cardiff University. It was gathered to understand concentrations and mineral forms of precious and semi-metal trace elements in sulfides.
High-precision mechanical testing data from in-situ micropillar compression experiments on synthetic forsterite bicrystals at 700ยฐC. The dataset, associated with a pre-print manuscript (DOI: 10.22541/essoar.167979601.17867144/v1), compares deformation between pillars containing low-angle (4ยฐ tilt) and high-angle (60ยฐ tilt) grain boundaries and those in the crystal interior. Data was produced under NERC Grant NE/S00162X/1 and hosted by the British Geological Survey.
Digitised magnetic records from the ten-day period from 25th August to 5th September 1859, encompassing the Carrington solar storm. The dataset is based on digital images from the BGS online archive and observatory yearbooks, scaled to SI units with quasi-minute cadence spot values. It was created by the British Geological Survey and includes data in ASCII text files and IAGA-2002 formatted files.