Loading...
Loading...
Text classification, translation, QA, summarization, dialogue, sentiment analysis, language modeling, text corpora
40,087 datasets
Miocene to Pleistocene foraminiferal assemblages from dredge samples at two sites on the Marion Plateau, offshore Queensland. The dataset documents distinct geological histories: a northern Early-Middle Miocene platform with subaerial exposure and later infill, and a southern Late Miocene platform with a hardground and slope mounds. Faunal evidence suggests shallow-water deposition (<50m) and periods of non-deposition, aiding reconstruction of the platform's paleoenvironmental evolution.
A 2026 baseline study surveyed ten reef sites along Oman's northern coast, documenting five disease types across nine coral genera. The dataset provides the first quantitative assessment of coral diseases in Omani waters, with an overall low average disease abundance of 0.17 cases per square meter. It was authored by Thangadurai Thinesh and published under a CC-BY-4.0 license on figshare.
Australian Ocean Data Network hosts a palynological study of Aptian and Albian sediments from the Surat Basin in southeastern Queensland and northeastern New South Wales. The work documents 38 genera and 72 species of spores, 18 genera and 21 species of pollen grains, and 35 genera and 60 species of dinoflagellates, proposing several new species. It outlines a system for classifying Australian fossil dinoflagellate cysts and uses statistical records to approach environmental reconstruction.
Sentence-level text samples from annual, ESG, sustainability, and CSR reports of listed firms in China's automotive supply chain, covering 2012 to 2024. Each sentence is annotated for green strategic orientation, mixed expressions, and nine green action elements. The dataset was created by Chaohui Wang and is available under a CC-BY-4.0 license.
Yukon Digital Geology aggregates multiple geoscience data sets for the Yukon territory. The data includes syntheses of bedrock geology, glacial limits, geochronology, paleontology, mineral occurrences, oil and gas wells, and aeromagnetic images. Data is organized into 45 map tiles corresponding to the National Topographic System 1:250,000 quadrangles and is provided by the Government of Yukon.
A dataset of food consumption and water use for 280 Chinese cities from 2014 to 2022, developed using a downscaling model based on a feedforward neural network. Yu Yu created this dataset to measure food–energy–water (FEW) nexus efficiency and examine its association with cadres’ intercity transfer experience. The data shows FEW nexus efficiency increased, especially in cities led by cadres with prior intercity appointment experience.
Ab initio and classical molecular dynamic simulations of supercooled liquid iron alloys at Earth's core conditions, covering temperatures from 3800 K to 6000 K and a pressure of 360 GPa. The data includes time-to-freeze outputs and size-frequency histograms of solid-like atomic clusters, generated using VASP and LAMMPS on the ARCHER2 HPC system between June 2020 and February 2023. This data was collected by researchers from the University of Leeds and University College London as part of NERC grant NE/T000228/1 to study the Inner Core Nucleation Paradox.
Experimental data from dual tests on salt rock cores measures stress-dependent ultrasonic wave velocities, attenuation, and permeability. The dataset includes electrical resistivity tomography and permeability data for water and nitrogen from four geological salt samples. Tests were conducted at the National Oceanography Centre, Southampton, using a high-pressure, room-temperature triaxial flow-through rig.
A numerical study using the discrete element method to analyze soil-nailed slope stability under point and distributed linear surcharge loads. The dataset contains parameters and results from simulations investigating progressive failure, macroscopic stability, and micromechanical behaviors. Authored by Fengling Tan and published on figshare in May 2026.
A numerical test scheme investigates the mechanical response of soil-nailed slopes under surcharge loading using the discrete element method (DEM). The dataset likely contains results from simulations analyzing progressive failure, macroscopic stability, and micromechanical behaviors under point and distributed linear loading. Fengling Tan authored this dataset, which was last updated on May 26, 2026.
More than 4,000 user reviews from Travis County, Texas, were analyzed using three large language models to classify sentiment across categories like charging operation, accessibility, and parking. The dataset contains results from Random Forest regression models showing the influence of walkability, greenery, and amenities on user perception. Authored by Ahyoung Chang and last updated in June 2026, this 9.5 KB Excel file provides a framework for location-sensitive EV infrastructure planning.
Kevin William Richter provides a dataset supporting a systematic revision and taphonomic analysis of the pteriomorph bivalve genus Actinopteria from the Lower Devonian Ponta Grossa Formation in the Paraná Basin, Brazil. The data includes detailed shell morphometric parameters, taphonomic signatures, facies analysis, and ichnological data. It was last updated on 2026-05-19 and is shared under a CC-BY-4.0 license.
A modelled scenario dataset for the Macquarie and Cudgegong Regulated River Water Source, created by removing Plan Environmental Water (PEW) rules and Held Environmental Water (HEW) licences from a Current Conditions model. It was published by the NSW Department of Climate Change, Energy, the Environment and Water and last updated on 2026-05-13. Flow data is provided for eight key monitoring sites, including Cudgegong@YambleBridge and Macquarie@Dubbo.
A web-based interactive mapping and decision support system maintained by Geoscience Australia. It integrates curated government, state, and academic data layers on maritime boundaries, petroleum, fisheries, environment, native title, and regulation. The system is updated as of 2026-06-04.
70 nursing students participated in a pilot workshop evaluating the EASE framework for family-focused mental health practice. Data was collected via anonymous online surveys administered before training and at one-week follow-up, using participant-generated IDs. The dataset, authored by Abigail Dunn and last updated in 2026, is shared under a CC-BY-4.0 license.
Colombian dataset listing entities subject to fiscal oversight by the Auditoría General de la República. The data includes columns for municipality, department, address, and the responsible regional management office. It is published by www.datos.gov.co and was last updated on 2026-05-18.
2,402 taxon names for African reptiles, including 263 genera, 1,893 species, and 246 subspecies, have their etymologies resolved. The dataset was compiled by Peter H Uetz and published on figshare in May 2026. It covers names described until the end of 2025, including those from mainland Africa and some offshore islands.
A psychometric evaluation of a four-item mental health screening scale derived from the PHQ-9 and GAD-7. The analysis is based on data from a follow-up survey of the China Multi-Ethnic Cohort, focusing on community-dwelling adults in rural Yunnan Province. The dataset was authored by Nan Cheng and last updated on 2026-05-29.
UK Biobank data from 398,200 participants was used to evaluate the association of metabolic and inflammation vulnerability indices with incident systemic lupus erythematosus. The study, authored by Dongqi Zhou, followed participants for a median of 13.2 years and found significant associations between the indices and SLE risk. The dataset was last updated on May 8, 2026.
398,200 UK Biobank participants were followed for a median of 13.2 years to assess novel metabolic and inflammation indices as risk factors for systemic lupus erythematosus. The study, authored by Dongqi Zhou and shared under CC-BY-4.0, found a per standard deviation increase in the composite metabolic vulnerability index (MVX) was associated with a 44% higher risk of incident SLE. Results were published in a supplementary document on figshare in May 2026.