Loading...
Loading...
Text classification, translation, QA, summarization, dialogue, sentiment analysis, language modeling, text corpora
44,671 datasets
Lithostratigraphy, grain sizes, and down-hole logs from Ocean Drilling Program Sites 1166 and 1167 reconstruct glacial processes in eastern Prydz Bay, Antarctica. The Australian Ocean Data Network hosts this dataset, which was last updated in May 2026. Sediment analysis indicates repeated advances and retreats of the Lambert Glacier-Amery Ice Shelf system from the upper Pliocene to mid Pleistocene.
Geoscience Australia contracted EOMAP to provide high-resolution (10m) Satellite-Derived Bathymetry (SDB) for Priority Australian Seabed Mapping Sites. The dataset covers an area within the Kimberley Region in Western Australia, including Ashmore Reef, Browse Island, Cartier Island, Clerke Reef, Cunningham Island, Mermaid Reef, Scott Reef, and Seringapatam Reef. These data provide an environmental baseline for long-term monitoring and management of Marine Parks.
A hydrogeological inventory for the Laura Basin in Australia, containing descriptive attribute information grouped into themes like location, geology, hydrogeology, and groundwater management. The dataset describes sedimentary rocks deposited between 168 and 102 million years ago, with strata reaching a maximum thickness of about 1,000 meters. It is provided by the Australian Ocean Data Network via data.gov.au.
A collection of Indian appellate judgments paired with machine-generated FIRAC annotations: facts, issues, rules/authorities, analysis, and conclusion. It is the minimal public release for the FIRAC Appeals paper and will be expanded incrementally. The dataset was created by mborcin and was last updated on 2026-06-07.
April 2024 survey data collected by EGS onboard the RV Bold Explorer for the SSCN cable route project. The dataset contains bathymetry, seabed features, and shallow geology measurements within the South-west Corner Marine Park, Australia. It is provided by the Australian Ocean Data Network and is not intended for navigational purposes.
Data and associated text from three published neuroimaging studies on language processing in the brain. The 229.2 MB ZIP file was authored by Ebrahim Feghhi and last updated in April 2026. It aggregates datasets from Pereira 2018, Fedorenko 2016, and Blank 2014 studies.
A standardized and reformatted version of the corefud-1-4 coreference resolution dataset. The repository provides a unified document structure across multiple coreference datasets to simplify cross-dataset comparison and multilingual experimentation. It was created by lattice-nlp and last updated on June 11, 2026.
A research paper explores the nature and extent of tsunami hazard to NSW coastal communities and informs tsunami emergency planning and management. The work outlines results of recent risk scoping examining hazard sources, tsunami history, and inundation studies for selected sites. It was published by the Australian Ocean Data Network and last updated on 2026-05-05.
Australian Ocean Data Network provides descriptive attribute information for the Money Shoal Basin, a large passive margin basin in northern Australia. The dataset groups information into themes including location, geology, hydrogeology, groundwater management, and land use. The basin's sedimentary succession spans from the Mesozoic to Cenozoic era, reaching a maximum thickness of 4,500 meters.
ACT Government data showing priority enrolment areas for schools in the Australian Capital Territory for the 2026 enrolment year. The dataset is updated annually by ACTmapi, with the latest update on 2026-04-26. Enrolments for the next calendar year generally open in April.
SGI-Bench is a benchmark for evaluating Scientific General Intelligence in large language models across the full inquiry cycle. It spans 10 scientific disciplines and contains more than 1,000 expert-curated samples inspired by Science's 125 Big Questions. The dataset was created by InternScience and last updated in June 2026.
SGI-Bench is a benchmark for evaluating Scientific General Intelligence (SGI) of LLMs across the full inquiry cycle. It spans 10 scientific disciplines and contains more than 1,000 expert-curated samples inspired by Science's 125 Big Questions. The dataset was created by InternScience and was last updated on June 2, 2026.
Total magnetic intensity (TMI) data from the GA302 Capel and Faust Basins MSS survey acquired in 2006 for Geoscience Australia. The processed data measures variations in the Earth's magnetic field to reveal sub-surface geological structure and was quality-checked by GA geophysicists. The survey also collected seismic reflection, gravity, swath bathymetry, and seafloor dredge samples.
2003 to 2008, with regular updates, this dataset provides a nationally consistent geographic classification of geological landscapes in Wales. It was devised by the former Countryside Council for Wales and is maintained by the Government Digital Service. The data includes both objective and subjective information in polygon form with associated textual descriptions to inform sustainable decision-making.
Implemented in 2014, this dataset lists retail gasoline outlets in New York State subject to the FuelNY emergency preparedness law. It likely contains station locations and compliance details to maintain fuel supply during energy emergencies like Super-storm Sandy. The data is available in multiple formats and appears on several government data platforms.
A dataset from the Australian Ocean Data Network published on 2026-05-05 presents findings on the first confirmed presence of dolomite and magnesite in living crustose coralline algae Hydrolithon onkodes. The data likely contains chemical micro-analysis results quantifying mineral phases like magnesium calcite, dolomite, and magnesite within algal skeletons. This research addresses the long-standing 'Dolomite Problem' by linking modern algal mineral formation to ancient reef dolomites.
2018 heat vulnerability index features for Metropolitan Melbourne, represented by polygons based on 2016 Australian Bureau of Statistics Statistical Area Level 1 (SA1) boundaries. The dataset is part of the Plan Melbourne Action 91 initiative, also known as Cooling & Greening or Vegetation and Urban heat mapping, and was published by the Department of Transport and Planning.
May 29 to June 19, 2017 surveys collected seabed sediment samples in inner Darwin Harbour (GA0358) and shallow water areas in and around Bynoe Harbour (GA0359). The dataset comprises grain size measurements and was conducted by Geoscience Australia, the Australian Institute of Marine Science, and the Northern Territory Government. This work is part of a four-year (2014-2018) science program to create baseline habitat maps for marine resource management.
UNHCR Afghanistan conducted post-distribution monitoring for two cash-based intervention programs in the Eastern Region in 2020, covering cash for protection and cash for shelter. The monitoring focused on 13,792 households receiving cash for protection and 506 households receiving cash for shelter across Kunar, Laghman, Nangarhar, and Nuristan provinces. The data collection aimed to assess program efficiency, market access, cash use, unmet needs, and coping strategies to improve future intervention design.
Geoscience Australia conducted a regional mapping program addressing stratigraphic and structural exploration risk in the Triassic succession of the Roebuck Basin. The data pack comprises seismic horizon grids and isochron grids generated from the TR10.0_SB, TR17.0_SB, and J10.0_SB horizons, along with fault maps. The grids were created using 2D and 3D seismic surveys, including AGSO s110, AGSO s120, PGS New Dawn, and several 3D surveys, tied to wells via synthetic seismograms.