Loading...
Loading...
Text classification, translation, QA, summarization, dialogue, sentiment analysis, language modeling, text corpora
44,398 datasets
From 1982 to 1997, the Government of Alberta created a series of printed monochrome road and access maps. These maps depict features such as roads, railways, pipelines, trails, transmission lines, airfields, municipalities, bodies of water, and natural resource sites within Alberta. They are available as non-georeferenced PDF files and TIFF files on request.
The Historical Resource Base Series maps are a collection of printed monochrome road and access maps created by the Government of Alberta between 1982 and 1997. They depict features such as roads, railways, pipelines, trails, transmission lines, airfields, municipalities, bodies of water, and natural resource sites within Alberta, using National Topographic System identifiers. The maps are available as non-georeferenced PDF files and TIFF files on request.
Between 1982 and 1997, the Government of Alberta created a series of printed monochrome road and access maps. These Historical Resource Base Series maps depict features such as roads, railways, pipelines, trails, transmission lines, airfields, municipalities, bodies of water, and natural resource sites within Alberta, using the National Topographic System (NTS) map sheet identifier. The maps are available as non-georeferenced PDF files and TIFF files on request.
82 maps created between 1982 and 1997 depict infrastructure and resources in Alberta. The Government of Alberta produced these monochrome maps, which include features such as roads, railways, pipelines, trails, transmission lines, airfields, municipalities, bodies of water, and natural resource sites. They are available as non-georeferenced PDF files and TIFF files on request.
83 maps created by the Government of Alberta between 1982 and 1997 depict roads, railways, pipelines, and natural resource sites. These printed monochrome maps are named using the National Topographic System (NTS) map sheet identifier for Alberta. The series is available as non-georeferenced PDF files, with TIFF files accessible upon request.
Table 1 presents a comparative synthesis of three educational YouTube e-academies: Khan Academy, Unicoos, and Academia Play. The dataset, created by Antonio Sánchez Sillero and last updated on 2026-05-03, organizes the main features identified in an analysis across six analytical categories. It is a 236.5 KB PDF file intended to provide a structured overview of similarities and differences between the channels.
Legacy product from the Australian Ocean Data Network with no abstract available. The title indicates it contains notes compiled for an international scientific workshop on the geology, mineral resources, and geophysics of the South Pacific. The data is available in HTML and PDF formats.
MathArena provides training data generated from past ArXiv articles, together with outputs generated by the Qwen3.6-35B language model. The dataset is derived from the MathArena/arxivmath-training repository on Hugging Face. It was last updated on June 16, 2026.
Legacy product from the Australian Ocean Data Network with no abstract available. The dataset title suggests it contains morphological data of the seabed, likely related to assessing offshore heavy-mineral deposits. Its last recorded update was 2026-06-16 20:51:56.363210.
Colombian data for 2020 contains gender parity indices (IPG) and gross coverage indicators for preschool, primary, secondary, and media education, disaggregated by certified territorial entity (ETC). The dataset includes columns for male and female enrollment (MATR) and gross coverage (COBERTURA_BRUTA) across various age groups. It originates from the Colombian open data portal, www.datos.gov.co, and was last updated in May 2026.
Victoria, Australia's land use data for the 2021/22 period, created by the Department of Energy, Environment and Climate Action. The dataset provides spatially detailed information to support greenhouse gas accounting and climate mitigation strategies under the Agriculture Sector Pledge. It is part of a time series with previous releases for 2006/07, 2008/09, 2010/11, 2012/13, 2014/15, and 2016/17.
Survey responses collected before and after an educational mini-course intervention, aiming to characterize perceptions, familiarity, and usage patterns of generative AI tools in an educational context. The dataset is organized into separate 'before' and 'after' sheets with a comparison view, enabling descriptive and comparative analysis. It is a 50.4 KB XLSX file authored by Jefferson Rodrigo Speck and last updated on 2026-05-17.
Experimental artifacts from an automated Android Open Source Project (AOSP) build pipeline. The dataset, created by IRedDragonICY, hosts synthetic outputs generated to evaluate complex dependency resolution and hardware-specific patching. Its last update was recorded on 2026-06-16.
Synthetic supervised fine-tuning data teaches a small language model to respond to a single puppet-theater beat with one compact JSON object. The dataset is authored by 'build-small-hackathon' and was last updated on June 15, 2026. It is intended for hackathon prototyping, schema following, and local adapter experiments.
Action Against Hunger ACF - Regional Office for West and Central Africa ROWCA provides monthly surface water presence statistics in square kilometers for West and Central Africa. The data is calculated from November 2020 onwards at four administrative levels (Admin 0, 1, 2, 3) using 100m resolution Copernicus Land Monitoring Service imagery. Administrative boundaries are sourced from OCHA, and the dataset was last updated in April 2026.
4,798 records of Canadian libraries and archives compiled for a Royal Society of Canada expert panel report in 2014. The data includes names, institution types, locations, and establishment years, originally used to power a mapping and timeline application. Multi-branch library systems are represented by a single entry unless a branch is in a different community.
Legacy product from the Australian Ocean Data Network. The dataset describes heavy-mineral sand deposits along the Western Australian coast. The record was last updated on 2026-06-16.
RUEmoCorp is a large-scale emotion classification corpus for Roman Urdu, the informal transliterated writing style dominant in Pakistani digital communication. The dataset includes a formally annotated benchmark subset of approximately 28,000 samples and a larger raw corpus, created to address the underrepresentation of Roman Urdu in NLP research. It was authored by Muhammad Khubaib Ahmad and last updated in May 2026.
Active business registrations in Pennsylvania, broken down by county, with data extending from 1768 to the present. The dataset is provided by the Pennsylvania Department of State via data.pa.gov and includes counts of corporations, LLCs, nonprofits, and other business entities. It contains 22 columns detailing business names, addresses, types, creation dates, and associated legislative and school districts.
A generational variance dataset outputs used to validate the synthetic cluster replication fidelity of the Rosetta-Routine analysis. The dataset was authored by Bradley Mason and is available under a CC-BY-4.0 license. It was last updated on May 29, 2026.