Loading...
Loading...
Text classification, translation, QA, summarization, dialogue, sentiment analysis, language modeling, text corpora
44,366 datasets
72 species of spores, 21 species of pollen grains, and 60 species of dinoflagellates are documented in this palynological study of Aptian and Albian sediments. The Australian Ocean Data Network provides stratigraphic sequences and taxonomic descriptions for marine and terrestrial microfossils from southeastern Queensland and northeastern New South Wales. Statistical analysis of abundance fractions is used for tentative environmental reconstruction.
2005 magnetic intensity data acquired by the WA Government measures variations in the Earth's magnetic field. The processed grid has a cell size of 0.0025 degrees (approximately 266m) and units are in nanoTesla. The data is quality-checked by GA geophysicists to reveal geological structures.
A 1.6 MB replication package from a 2026 study on generating requirements from code. It contains datasets from a human-in-the-loop experiment, the experimental framework implementation, prompts, and evaluation code. The package was authored by Alexander Korn and is shared under a CC-BY-4.0 license.
The Great Artesian Basin occupies 1.7 million km² across parts of Queensland, New South Wales, South Australia, and the Northern Territory. Data from the Australian Ocean Data Network describes the basin's multi-layered aquifer system, including aquifer thicknesses, hydraulic properties, and historical well discharge rates. The dataset likely contains information on the basin's structure, recharge zones, and groundwater flow.
Two 1:25 million scale maps depict Australia's gravity field. The Free-air Anomaly map is a smaller version of an unpublished overlay for a 1976 geological map, while the Gravity Map shows Bouguer anomalies on land and free-air anomalies at sea. Both maps were prepared by BMR's Geophysical Drawing Office and printed by the Division of National Mapping, incorporating data from the 1976 Gravity Map and additional marine observations from the 'Gulf Rex' vessel.
365 surface and near-surface seabed samples provide the basis for an assessment of regional lithofacies variations on the Tasmanian shelf and in eastern and western Bass Strait. The dataset likely contains information on sediment types, heavy mineral suites, and geochemical analyses, including maximum values of 10 ppm Sn and 3.6 percent phosphate. It was aggregated by the Australian Ocean Data Network and last updated in May 2026.
Scottish local authority data compiled under the Community Empowerment (Scotland) Act 2015. The register amalgamates information on asset transfer requests, common good property, and allotments, including details on land availability and food growing strategies. The dataset is provided by the Scottish Government via SpatialData.gov.scot and was last updated in May 2026.
6,013 participants from two large aging cohorts were analyzed to assess the association between a composite metabolic-nutritional index and incident cardiovascular disease. The dataset contains results from Cox models and restricted cubic spline analyses, showing a significant association for the continuous TCBI variable. Author Peijian Wang published this analysis under a CC-BY-4.0 license on figshare in April 2026.
Harmonized data from the China Health and Retirement Longitudinal Study (2011–2018) and the English Longitudinal Study of Aging (2002–2018). This dataset, authored by Peijian Wang, includes 6,013 community-dwelling adults aged 45+ and free of CVD at baseline, analyzing the association between the triglyceride–cholesterol–body weight index and incident cardiovascular disease.
Audit results from regional comptrollers for Colombian state entities, consolidated by the National Auditor General. The data includes audit ratings, counts of findings, and financial amounts for fiscal, disciplinary, administrative, and criminal infractions. Information is sourced from the SIA system and other sources, with the dataset last updated on 2026-05-18.
Marion Plateau offshore Queensland provides planktic and benthic foraminiferid data from dredge samples at two sites. The dataset likely contains thin-section analyses describing carbonate platform development, cavity infills, and borings from the Early Miocene to Pleistocene. It was contributed by the Australian Ocean Data Network and last updated in May 2026.
A geospatial map shows mineral occurrences and deposits within Australia's 200 nautical mile exclusive economic zone and extended continental shelf. The map draws together data from published and unpublished marine research surveys and government records, covering minerals like manganese nodules, heavy mineral sand, phosphorites, diamonds, tin, copper, gold, and coal. It was produced in August 2006 through a collaborative project involving Geoscience Australia, CSIRO, and State and Territory Geological Surveys.
Geoscience Australia conducted marine surveys GA-310 and GA-2476 in 2008-2009, acquiring about 26,000 line km of new gravity and magnetic data. This record describes the integration and levelling of this new data with approximately 150 previous surveys since 1960 to create a unified dataset. The processed data is intended for constraining regional tectonics, basin structure, and petroleum prospectivity.
High-resolution 2D seismic, gravity, magnetic, and multibeam bathymetry data acquired in 2006-2007 for the Capel and Faust basins offshore eastern Australia. Geoscience Australia synthesized this data to derive 2D and 3D geological information, contributing to understanding of basin structure and tectonic reactivation. The analysis was presented at the Australian Earth Sciences Convention in July 2010.
80,489,226 rows of raw English text form this multi-domain corpus engineered for pre-training BERT-style models via Masked Language Modeling. The dataset, created by 8Opt, aggregates text stripped of metadata and labels to focus purely on language. It was last updated on June 20, 2026.
12 genera and 13 species of marine invertebrate macrofauna from the upper Permian strata of eastern Australia, including one newly recognised genus. The data, published by the Australian Ocean Data Network, establishes three biostratigraphic zones and correlates formations across the Bowen and Sydney Basins. Faunal diversity is described as relatively low, likely reflecting restricted marine conditions at the end of the open sea in these basins.
9086 cores have been modelled over a total area of 4912 km2, defining the extent of the 2011 version of the Digital 3D Geological Model of the Latrobe Valley Coal Resource. The model, based on pre-mine topography, integrates a century of coal delineation work and is published by the Department of Energy, Environment and Climate Action on data_gov_au. It was last updated on 2026-04-09.
IMCRA Mesoscale V4 is a regionalisation of Australian coastal waters to the 200-meter isobath, derived from biological and physical data such as coastal geomorphology, tidal attributes, and oceanography. The dataset was compiled for Version 3.1 from the most up-to-date data available as of 7 March 1997 by Environment Australia, with information supplied by state, territory, and Commonwealth agencies. It includes named regions and their locations within provinces, with its seaward extent defined by the 200m isobath or the Australian Exclusive Economic Zone boundary.
MagicHub's dataset provides multi-stream audio for automatic speech recognition training. Each speaker's audio track is captured and labeled separately to preserve natural conversational phenomena. Recordings are in English at 16 kHz and 16-bit resolution, captured using mobile device microphones.
Government of Yukon data describes two types of precious metal occurrences in volcanic rocks near Dawson City. Lithology includes Precambrian to Paleozoic metamorphic rocks, Paleozoic ultramafic rocks, and various dikes and sediments. Both mineralization types are characterized by four distinct stages.