Loading...
Loading...
Text classification, translation, QA, summarization, dialogue, sentiment analysis, language modeling, text corpora
44,603 datasets
Australian Capital Territory stormwater sump locations owned or managed by the City and Environment Directorate. Attributes likely include location description, suburb, ownership, maintenance responsibility, asset subtype, and structural details. The dataset is maintained through a works-as-executed handover process or field audits and was last updated in April 2026.
12 genera and 13 species of marine invertebrate macrofauna are described from the upper Blenheim Subgroup and Kulnura Marine Tongue. The fauna is assigned to three biostratigraphic zones and indicates a hiatus within the Blenheim Subgroup. This dataset from the Australian Ocean Data Network appears to document faunas younger than Kungurian but not younger than Kazanian.
South-western Victoria and the contiguous part of South Australia, specifically the Portland-Nelson-Mt Gambier area, are the focus of this geological report. The dataset is a compilation of geological, bore-hole, and well data, along with a geological sketch map, created by the Australian Ocean Data Network. It was last updated on 2026-04-28.
Synthetic data for urban logistics scenarios supports the EMGRO framework study. The dataset includes customer orders, delivery locations, and pickup locations for evaluating routing efficiency and emissions. Author Metin Özşahin published the data on figshare in April 2026.
Eight seismic profiles totalling about 2000 km, plus bathymetric data, were collected in February 1992 to assess seabed morphology and offshore mineral resources. The data enabled compilation of a new 1:1,000,000 scale bathymetric map and the first sediment thickness map for the area. This work was conducted by AGSO (now Geoscience Australia) following Australia's declaration of a 200-mile Fisheries Zone around Christmas Island.
December records from 2013 to 2020 detail affiliations to Colombia's General System of Social Security in Health for the municipality of Firavitoba. The data is published by www.datos.gov.co and includes variables like age, sex, zone, and health administrator. It was last updated on 2026-05-18.
Informes de Ley Oficina de Control Interno - ESAP catalogs all legal reports generated by the Internal Control Office of the Superior School of Public Administration. The dataset details each report's evaluated period, description, format, and a direct institutional link for consultation. It is maintained by www.datos.gov.co and was last updated on 2026-05-18.
Surat Basin in southeastern Queensland and northeastern New South Wales is the focus of this palynological study of Aptian to mid-Albian sediments. The work systematically describes 38 genera and 72 species of spores, 18 genera and 21 species of pollen grains, and 35 genera and 60 species of dinoflagellates, proposing several new species. It was published by the Australian Ocean Data Network and last updated on 2026-04-28.
Oolong-Pairs is a long-context, pairwise-aggregation reasoning benchmark built on top of the oolongbench/oolong-synth dataset. Each task presents a long context of thousands of general-knowledge questions, each implicitly labeled with one of six TREC coarse categories. The dataset was created by mit-oasys and was last updated on June 1, 2026.
The Gawler Craton in central Australia contains the late Archean-Proterozoic Harris Greenstone Domain. This GeoPDF map, produced by Geoscience Australia, visualizes the domain's lithology and structure based on aeromagnetic, gravity, and drillcore interpretation. The map is georeferenced, allowing coordinate and distance retrieval, and was last updated on 2026-04-20.
Leicester City Council provides the 2021 Census boundaries for Middle Layer Super Output Areas (MSOAs) within Leicester City. The data is available in multiple formats, including GeoPackage, CSV, and JSON, suggesting it is designed for interoperability. The record was last updated in June 2026.
Polygon features delineate areas of the U.S. Outer Continental Shelf as defined by the Bureau of Ocean Energy Management. The data is provided by the U.S. Department of the Interior and was last updated in April 2026. These boundaries are intended for cartographic representation, not for official legal or area calculations.
Morphological trait data for phyllostomid bat species includes mean body weight in grams and mean forearm length in millimeters. The dataset was authored by Daryl Cruz and is hosted on figshare under a CC-BY-4.0 license. It was last updated on May 22, 2026.
An inventory of public information generated, received, and processed by Colombian government entities that has been classified as confidential or reserved. The dataset is published by datos.gov.co and was last updated on 2026-05-18. It includes metadata such as the responsible area, legal justification, classification period, and format for each classified record.
Registro activos de información is a base for managing information security risks and determining required protection levels. The dataset is hosted by www.datos.gov.co and was last updated on 2026-05-18. It contains columns such as Nombre o título de la información, Formato, Propietario, and Dependencia.
figshare hosts a dataset describing the discovery and optimization of MK-1088, a potent dual A2A/A2B adenosine receptor antagonist. Yonglian Zhang authored this publication, which was last updated on 2026-05-05. The dataset likely contains experimental results or compound properties supporting the progression of MK-1088 to human clinical studies.
Clinical cohort data from a South Korean study investigating predictors of generalization in ocular myasthenia gravis. The dataset, authored by Hyunjin Shin and last updated in June 2026, is stored in a 34.4 KB XLSX file. It likely contains patient-level variables related to antibody positivity, RNST abnormality, and thymoma status.
Sentinel-5P's TROPOMI instrument is a push-broom grating hyperspectral spectrometer with a 108-degree field of view, measuring radiance across ultraviolet-visible, near-infrared, and shortwave infrared wavelengths. Its Level-1B product provides calibrated radiance, irradiance, and engineering data, with a spatial resolution of approximately 5.5 km at nadir implemented from August 6, 2019. This dataset is processed by the Royal Netherlands Meteorological Institute (KNMI) using the netCDF-4 enhanced model.
Three sediment cores from Nara Inlet in the Whitsunday Islands, Australia, collected by the Australian Ocean Data Network. The dataset describes sediment composition and accumulation rates over the last 3000 years in a tropical mixed clastic/carbonate system. It includes measurements of terrigenous clay and carbonate components, with radiocarbon dating indicating changes in accumulation rates.
219 clonotypes and 169 αβ TCR pairs were isolated from six donors to characterize the T-cell response to a conserved SARS-CoV-2 spike protein epitope. The dataset, authored by Yoshiki Aritsu and last updated on 2026-04-27, includes analysis of peptide modifications and their impact on T-cell recognition and vaccine efficacy. It is shared under a CC-BY-4.0 license on figshare as a 1.5 MB PDF document.