Loading...
Loading...
Text classification, translation, QA, summarization, dialogue, sentiment analysis, language modeling, text corpora
43,905 datasets
ESGenius is an EMNLP 2025 Main Conference Oral benchmark for evaluating large language models on Environmental, Social, and Governance (ESG) and sustainability knowledge. The paper was nominated for the EMNLP 2025 Resource and Theme Paper Awards, Top 1%. It was authored by cy0307 and last updated on June 15, 2026.
The Australia’s Future Energy Resources project reinterpreted the geology of the Pedirka and western Eromanga basins using new seismic and biostratigraphic data. This dataset likely contains interpretations of sedimentary records from the early Paleozoic to Late Cretaceous, focusing on fluvial-lacustrine and shallow marine environments. It was produced by the Australian Ocean Data Network as part of the Exploring For The Future program.
A briefing package prepared for a Standing Committee on Public Accounts hearing on October 21, 2025. The document likely contains analysis and summaries of the Auditor General of Canada's 2025 Fall Reports. It was published by the Office of the Auditor General of Canada and last updated on the open_canada platform in May 2026.
A seamless topographic colour mapping service for the whole of Australia, including its outer islands and external territories. The map is compiled from multiple sources including Geoscience Australia, the Australian Antarctic Division, and OpenStreetMap, with data for specific islands sourced from NATMAP datasets at 1:30,000 and 1:25,000 scales. Vegetation data for the continent is aggregated from the Australian Collaborative Land Use and Management Program.
The central Great Barrier Reef Province's Cainozoic evolution is detailed using shallow, intermediate, and deep focus seismic reflection profiling. The data describes depositional episodes from the Late Cretaceous to Pleistocene, including alluvial fan, deltaic, and reef facies development. It was published by Geoscience Australia Data and last updated on 2026-05-14.
125 gravity provinces and their subdivisions have been defined and named based on land and marine reconnaissance gravity surveys across Australia and its continental margins. The dataset rationalizes boundaries and nomenclature from individual surveys into a consistent regional pattern, now that gravity coverage is virtually complete. It originates from Geoscience Australia Data and was last updated on 2026-05-14.
The Davis and Mawson Sea continental shelves between 85°E and 115°E were surveyed during the RV Polarstern Expedition PS141 from February to April 2024. The expedition collected over 7500 nautical miles of hydroacoustic data, revealing glacial landscapes including iceberg scours and grounding zone wedges. This dataset presents preliminary results from the EASI-3 project, hosted by the Australian Ocean Data Network.
A 9,824-foot deep exploratory well drilled in far southwest Queensland between December 1959 and April 1960 was found to be a dry hole. The operation, conducted by Delhi Australian Petroleum Ltd, Frome-Broken Hill Company Pty Ltd, and Santos Limited, included a comprehensive program of electric and mud logging, testing, and coring. Several hydrocarbon showings were detected between 4,400 and 5,757 feet but were deemed noncommercial, with porous zones found to be water-bearing.
415,090 line-kilometres of reduced radiometric point-located data were acquired in 2024 by the Western Australian Government. The data measures gamma-ray emissions from potassium, uranium, and thorium decay for geological and environmental applications. It has been processed with noise filtering, background corrections, and levelling techniques.
415,090 line-kilometres of reduced radiometric point-located data were acquired in 2024 by the WA Government. The data, processed with noise filtering and corrections for background radiation and height attenuation, measures concentrations of potassium, uranium, and thorium in the ground surface. It was collected at 100m line spacing and 50m terrain clearance.
Geoscience Australia provides a grid of Total Magnetic Intensity (TMI) data for the 5028 Block 3 (Northwest) region. The grid has a cell size of approximately 20 meters, with data values in nanoTesla (nT). It was acquired in 2024 by the WA Government, consisting of 415,090 line-kilometres of data at 100m line spacing and 50m terrain clearance.
415,090 line-kilometres of airborne magnetic data were acquired in 2024 by the WA Government to create this geophysical grid. The Total Magnetic Intensity grid has been processed with Reduction to Pole and a first vertical derivative, resulting in units of nanoTesla per kilometer. Geoscience Australia geophysicists performed quality checks to ensure the final data is fit-for-purpose for revealing sub-surface geological structure.
A benchmark for evaluating patent novelty search systems, created by PatSnap and last updated in June 2026. Each sample contains a query patent publication number and ground truth novelty-destroying prior art references identified by examiners. The dataset is a 50% public release of an internal full evaluation set combining cross-jurisdiction and single-jurisdiction sample types.
Phanerozoic rocks in onshore Western Australia are described, focusing on sedimentary basins and associated mineral deposits. The dataset, from Geoscience Australia Data, covers geological formations from the Palaeozoic to Cainozoic eras. It details basin-specific characteristics and strata-bound mineralisation, including base metals, coal, mineral sands, evaporites, diamonds, and iron ore.
A 5.5 KB Excel database details the machine modifications and beam characteristics for ultra-high dose-rate FLASH radiation therapy. Gyu-Seok Cho created this dataset to compile information on the biological effects of FLASH beams, achieving a maximum dose rate of 339.1 Gy/s. The data was last updated on April 17, 2026.
Seven mineral deposits in the Cobar-Nymagee area show strong mineralogical and chemical zoning. The deposits are contained within distal turbidite facies of the Devonian Cobar Supergroup. The evidence suggests a syn-sedimentary exhalative origin in a non-volcanic environment, related to rifting and growth faults.
East Antarctica's Mac. Robertson Shelf and western Prydz Bay contain sediment cores and seismic data collected in 1993, 1995, and 1997. The dataset includes Paleocene and Eocene foraminifera, pollen, spores, dinoflagellates, and other fossils recovered from weakly lithified coastal plain sediments. It was published by Geoscience Australia Data.
The Browse Basin region of Australia's North West Shelf is the focus of this cruise proposal. The document outlines plans to acquire up to 3600 km of deep seismic and other geophysical data along 11 lines, tying into 18 exploration wells. The survey is part of a major regional research program by Geoscience Australia (AGSO) to determine the basin's structural framework and assess its hydrocarbon potential.
Tommaso Mario Buonocore released novel datasets of Italian pre-university and post-university medical exam questions as part of a comparative evaluation of large language models. The datasets consist of five-choice questions covering clinical and preclinical fields, stored in an XLS file of 5.5 KB. The data was published on figshare in April 2026 under a CC-BY-4.0 license.
5.5 KB of tabular data from figshare, authored by Dandan He and last updated on 2026-04 17. The dataset likely contains results from a sensitivity analysis of an improved band selection algorithm for hyperspectral images. The proposed SSGIE-KFCM method reportedly achieved over 90% average classification accuracy on the Indian Pines and Pavia University datasets.