Loading...
Loading...
Text classification, translation, QA, summarization, dialogue, sentiment analysis, language modeling, text corpora
43,979 datasets
Cabo Ortegal in northern Spain is the source for this collection of reflected light microscopy images, backscattered electron images, element maps, and laser ablation ICP-MS data for Fe-Ni-Cu sulfide minerals. The data were acquired in 2021 and 2022 from dunite, harzburgite, and pyroxenite samples held at Cardiff University. It was gathered to understand concentrations and mineral forms of precious and semi-metal trace elements in sulfides.
High-precision mechanical testing data from in-situ micropillar compression experiments on synthetic forsterite bicrystals at 700°C. The dataset, associated with a pre-print manuscript (DOI: 10.22541/essoar.167979601.17867144/v1), compares deformation between pillars containing low-angle (4° tilt) and high-angle (60° tilt) grain boundaries and those in the crystal interior. Data was produced under NERC Grant NE/S00162X/1 and hosted by the British Geological Survey.
Digitised magnetic records from the ten-day period from 25th August to 5th September 1859, encompassing the Carrington solar storm. The dataset is based on digital images from the BGS online archive and observatory yearbooks, scaled to SI units with quasi-minute cadence spot values. It was created by the British Geological Survey and includes data in ASCII text files and IAGA-2002 formatted files.
Digital Geological Map Data of Great Britain - 25k (DiGMapGB-25) Artificial version 2 identifies 167 10x10km tiles of landscape areas where the ground surface has been significantly modified by human activity. The data, provided by the British Geological Survey (BGS), includes polygons attributed with types of artificial ground such as disturbed, infilled, made, and worked ground. This version 2 release added tiles in 2012, but the data represents a snapshot in time and may become dated rapidly.
167 vector map tiles identify landscape areas attributed with geological names for bedrock geology at a 1:25,000 scale. The data is produced by the British Geological Survey (BGS) and covers selected 'classic' geological areas in Great Britain, such as Llandovery, Coniston, and the Cuillan Hills. It classifies rock types from igneous, metamorphic, and sedimentary classes, with ages ranging from the Archean eon to the Pliocene.
Great Britain's landscape areas where the ground surface has been significantly modified by human activity, categorized into types like made ground and infilled ground. The data is provided as vector polygons by the British Geological Survey and represents a snapshot in time. It was last updated on 2026-05-28.
A 1:50,000 scale digital geological map covering the bedrock of England, Wales, Scotland, and the Isle of Man. The data, created by the British Geological Survey, identifies landscape areas as polygons attributed with geological names, rock types, and ages. Bedrock formations range from the Archean eon, around 7500 million years ago, to the Pliocene, 58 million years ago.
Onshore coverage is provided for all of England, Wales, Scotland, and the Isle of Man. The data identifies linear features representing geological faults at a scale of 1:250,000, attributed by fault type (contact and non-contact). It is a generalised set of major faults, inferred from landscape evidence, created by the British Geological Survey.
Polygon data identifying landscape areas across England, Wales, Scotland, and the Isle of Man, attributed with geological names and rock type descriptions. The British Geological Survey created this dataset, which provides a generalized 1:250,000 scale view of the bedrock geology, including igneous, metamorphic, and sedimentary rocks formed from 3500 to 58 million years ago. Rock types are classified using the BGS Rock Classification Scheme, and ages are identified via the BGS Lexicon of Named Rock Units.
Geological faults across the United Kingdom, including England, Wales, Scotland, the Isle of Man, and Northern Ireland, mapped at a 1:625,000 scale. The data from the British Geological Survey represent simplified interpretations of major faults, categorized as faults at rockhead or thrust faults. This version 5 dataset is available as vector data in OGC GeoPackage format under the Open Government Licence.
AST14DEM Version 3 was decommissioned on December 15, 2025. The Terra ASTER Digital Elevation Model product is generated from stereoscopic data acquired by the Visible and Near Infrared sensor, with a time lag of approximately one minute between nadir and backward images. Validation testing indicates the new system frequently produces DEMs with accuracies better than 25 meters root mean square error.
Two lodgepole pine stands on xeric sites had total net primary productivity ranging from 780 to 960 g/m2/yr, while two on mesic sites each had 1,190 g/m2/yr. This dataset provides estimates of above- and below-ground biomass and net primary productivity for selected boreal forests in North America and Eurasia. Data was compiled from a review of published literature, with individual site records spanning a temporal range from 1965 to 1995.
A simulation-based framework predicts thermodynamic equilibrium and transport properties for pure CO2, pure n-butane, and their binary mixture. The dataset, last updated on 2026-04-28, is based on fundamental Helmholtz energy equations of state constructed from molecular simulation data. It was created by Denis Saric and shared under a CC-BY-NC-4.0 license.
Over 250,000 years of sediment data from Lake Chala, Kenya/Tanzania, including tephra layers, Ar-dates, and stable isotope measurements. The dataset comprises six tables with chronological and geochemical data generated by researchers between 2017 and 2022. These data support the reconstruction of palaeoclimate from the Lake Chala sediment sequence.
Iron isotope ratio measurements (δ57Fe) for bulk rock samples of basalts and picrites from the Galápagos mantle plume system. The dataset includes samples from three evolutionary periods: the 70- to 90-Ma plume head, 60- to 70-Ma transitional terranes, and the modern (<2 Ma) plume. Measurements were performed at the University of Cambridge using a Neptune Plus MC-ICPMS, with typical 2 SEs better than 0.02‰.
Supplementary table S1 contains data on morphological and biochemical characteristics of lime (Citrus aurantifolia Swingle) fruits. The dataset compares fruit quality attributes from two major oasis cultivation regions in Saudi Arabia: Alhasa in the East and Alula in the Northwest. It was authored by Nouf Alshareef and last updated on 2026-05-30.
Michael Jones authored a benchmark comparing 9 large language models on an 800-household test case. The dataset, last updated on June 2, 2026, records distributional fit, structural feasibility, and generation time metrics. It is a small, 5.5 KB Excel file shared under a CC-BY-4.0 license.
2026 data from the Government of Yukon provides Curie point depth (CPD) contours at 1 km intervals, demarcating 95% of Yukon into shallow and deep CPD regions. The mapping was derived from public aeromagnetic data using two methodologies and window sizes, with results broadly consistent across methods and correlated with surface heat flow measurements.
Yukon's subsurface heat flow is analyzed through Curie point depth mapping using public aeromagnetic data. The study demarcates 95% of the territory into regions of shallow or deep CPD, corresponding to higher or lower heat flow. Results are presented as a gridded dataset with 1 km interval contours.
SpeechArenaBench is a large-scale human-preference dataset for evaluating multilingual Text-to-Speech systems. It contains benchmark sentences, generated audio, and crowd-sourced pairwise preference judgments collected from native raters across 10 Indian languages. The dataset was created by ai4bharat and accompanies a paper accepted to Interspeech 2026.