Loading...
Loading...
Text classification, translation, QA, summarization, dialogue, sentiment analysis, language modeling, text corpora
44,645 datasets
764 cartographic quality monochrome maps covering the provincial extent of Alberta at a 1:50,000 scale. The series is comprised of individually named maps using the National Topographic System (NTS) identifier and displays features like the Alberta Township System, hydrography, roads, pipelines, and administrative boundaries. This series is published by the Government of Alberta and is not updated on a regular basis, containing a range of publication dates.
A series of 764 monochrome cartographic maps covering the provincial extent of Alberta at a 1:50,000 scale. The maps display the Alberta Township System, hydrography, municipalities, roads, pipelines, powerlines, railways, and other geo-administrative features. The Government of Alberta produced this series, which is not regularly updated and may contain a range of publication dates.
84G 50K Map - Provincial Resource Access Map Series is a collection of 764 cartographic-quality, monochrome topographic maps covering the entire province of Alberta. The Government of Alberta produced this series, which displays infrastructure, administrative boundaries, and natural features using the National Topographic System. The series is not updated regularly and may contain maps from a range of publication dates.
ISS-RapidScat Version 1.2 provides science-quality ocean wind vectors from a Ku-band scatterometer mounted on the International Space Station. Unlike its predecessor QuikSCAT, this instrument flies at a lower altitude with a non-sun-synchronous orbit, restricting data coverage to latitudes between approximately 61°N and 61°S and providing no consistent local time of day retrieval. The dataset offers a calibrated continuation of wind data from 19 August 2015 onward, processed on a 12.5 km grid.
Performance data from the Department of Women, Aboriginal and Torres Strait Islander Partnerships and Multiculturalism regarding the Queensland Government's On-Time Payment policy. The dataset was created under Administrative Arrangements Order (No. 3) 2024, with data commencing in Quarter 3 (January to March 2025). It is published by [email protected] under a CC-BY-4.0 license.
A 2.2 KB text file identifies and normalizes semantically identical keywords that appear in different written forms. It was authored by Hua Song and last updated on 2026-05-14. The file generates an equivalence mapping table, applying normalization only when the same concept occurs in at least two distinct expressions.
Hua Song published a text prompt file on figshare in May 2026. The 2.8 KB file contains instructions designed to guide a model in determining whether an input term is an abbreviation or acronym. It aims to help distinguish shortened forms of words or phrases from general terms.
An aligned multilingual dataset for supervised fine-tuning, containing multiple-choice questions across three configurations. The dataset was created by cs-552-2026-claude-bots and last updated on May 31, 2026. It comprises 148,497 total base questions, with 133,647 rows for training and 14,850 for evaluation.
A distilled corpus created by hirundo-io, last updated on 2026-06-09. It contains short-form technical descriptions optimized to be under 1200 tokens. The dataset is structured with ShareGPT-style message pairs, including a prompt and a clean assistant answer.
Kelly Gabriela Cambero Nava published data on biodegradable polymer blends for flexible packaging. The dataset likely contains experimental results on how tannic acid modifies the stiffness, strength, and toughness of poly(butylene adipate-co-terephthalate) and plasticized cellulose acetate blends. The data was last updated on 2026-05-23.
Alberta CPR Land Sales Spatial Data contains records of agricultural land sales by the Canadian Pacific Railway to settlers in Alberta from 1883 to 1927. The dataset includes purchaser names, legal land descriptions, acreage, and cost per acre, transcribed from original ledgers by Glenbow Archives volunteers and enhanced by Archives and Special Collections at UCalgary. A non-spatial tabular version and an interactive web application for exploring the spatial data are also available.
A 2026 study by the Australian Ocean Data Network analyzed thermal maturity in Permian sandstone reservoirs of the Northern Denison Trough, Bowen Basin, Australia. The dataset includes vitrinite reflectance measurements ranging from 0.55% to 0.93% Rmax and Rock-Eval Tmax values from 421°C to 447°C from coals and shales. Research focused on evaluating CO2 storage potential and understanding diagenetic effects on reservoir quality.
A scientific description of a new ophiuroid species, Ophiomicros bathursti gen. et sp.nov., from Cenomanian (Upper Cretaceous) strata. The dataset is provided by Geoscience Australia Data and was last updated on 2026-05-26. It includes details on the fossil's morphology and its distinction from related genera.
India-based survey of 224 Generation Z respondents from eight major cities, investigating behavioral intentions toward sleep-optimized hotel accommodations. The dataset, created by Suraj Yadav and last updated in May 2026, applies the Theory of Planned Behaviour to examine factors like attitude and subjective norms. It is a 30.8 KB Excel file containing survey results.
Information on the 10 individuals utilized to generate enhancer-protein interactions (EPI). The dataset was authored by Jiafang Li and last updated on May 29, 2026. It is a small dataset, sized at 9.1 KB.
9.0 KB of RNA-seq data used to identify genes differentially expressed between patients and controls. The dataset, authored by Jiafang Li, is available as an XLSX file under a CC-BY-4.0 license and was last updated on 2026-05-29. Its small size suggests a focused analysis rather than a large-scale repository.
A dataset compiled from technical documentation files of Turkish POS (Point of Sale) software. It was generated from the documentation of four POS software packages used in the dockerli_ragli project. The dataset was created by author sonposai and was last updated on June 14, 2026.
Heliocentric trajectory data for the Phobos 2 spacecraft, calculated using the 'Mean of Date' method for the Equinox Epoch. The data is provided in Heliographic (HG), Heliographic Inertial (HGI), and Solar Ecliptic (SE) coordinate systems. The original trajectory data is sourced from NASA JPL's Horizons system and is maintained by the National Aeronautics and Space Administration, with a last update recorded in March 2026.
Cluster Generating Julia Script Example is a 4.4 KB ZIP file authored by Bradley Mason. It provides an example Julia script used to generate synthetic cell cluster data. The dataset was last updated on May 29, 2026, and is shared under a CC-BY-4.0 license.
Lalita Roy's dataset on figshare, last updated 2026-05-29, examines types of wet containers and their contribution to larval productivity. The dataset is 5.5 KB in size and is available in XLS format under a CC-BY-4.0 license. Its specific row count and column details are not provided in the metadata.