Loading...
Loading...
Text classification, translation, QA, summarization, dialogue, sentiment analysis, language modeling, text corpora
43,370 datasets
School of Basic Midwifery, Obudu has released its 2026/2027 admission application forms. The dataset is a PDF document containing information on application procedures, available courses, and contact details for the school administration. It was authored by Refeal Mark and last updated on 2026-05-11.
A 205.0 KB PDF document outlines the application process for nursing programs at the School of Nursing, ATBU, Bauchi State, Nigeria. The file, authored by Refeal Mark and last updated on May 11, 2026, provides contact details and lists available courses for the 2026/2027 academic year. It is shared under a CC-BY-4.0 license on the figshare platform.
127.2 MB of processed proteomic and phosphoproteomic data from a study investigating nitric oxide regulation of cardiac beta-adrenergic signaling. The dataset contains quantitative protein and phosphopeptide measurements analyzed with Spectronaut and MSFragger, supporting findings on NOS1 and PKA signaling. It was authored by Sherif M. F. M. Bahriz and last updated in May 2026.
Queensland landfills and recyclers received a total of 855,000 tonnes of construction and demolition waste generated interstate in 2018β19. The dataset, published by the Queensland Department of Environment, Tourism, Science and Innovation, quantifies waste flows for disposal and recycling. It was last updated in May 2026.
Queensland's public transport sector is performing better than its target service level benchmarks. The data, published by the Queensland Department of Environment, Tourism, Science and Innovation, likely contains metrics on network reliability. It was last updated on 2026-05-27.
Queensland litter counts from the National Litter Index, which began sampling in 2005β06. The data shows Queensland has generally experienced higher average litter counts than the national average, though counts have trended downwards over time for both. It was published by the Queensland Department of Environment, Science and Innovation.
Arkansas and Missouri soybean genotypes from maturity groups III to V were evaluated across 10 environments during the 2023 and 2024 growing seasons. Rafael Goncalves Marmo created this dataset to support a classification-based genomic prediction framework for identifying high-yielding genotypes. The dataset likely contains genomic predictors and yield performance classes derived from SoySNP3K BeadChip markers.
Analysis of humanitarian accessibility in Ethiopia produced at the Woreda administrative level. The dataset is prepared by OCHA Ethiopia in consultation with the Access Working Group and field focal points, based on information from humanitarian partners and reliable sources. It depicts the general access situation during a specific reporting period, with conditions potentially changing by the time of publication.
2.5 MB of raw scheduling results and charts supporting a paper published in the Journal of Parallel and Distributed Computing. The dataset contains two ZIP files: one with CSV files of raw results for different graph structure types, and another with the charts created for the paper. It was authored by Raymond Li and last updated on 2026-05-20.
Clean Air Tracking System (CATS) records permit applications for boilers, engines, generators, and industrial work in New York City. The dataset tracks requests for registration, renewal, inspection, and amendments, linking them to specific buildings and owners. Columns suggest it contains administrative details like application status, issue dates, fuel types, and equipment models.
SupraLabs's Supra Wild Titles 130K is a dataset series for training and evaluating chat title generation models. It contains 130,000 niche and specialized conversation samples partitioned from primary title datasets. The dataset was last updated on June 20, 2026.
EMIT L1B At-Sensor Calibrated Radiance and Geolocation Data Version 1 provides raw, non-orthocorrected at-sensor radiance measurements from the EMIT instrument on the International Space Station. Each data granule covers approximately 75 km by 75 km and contains 285 spectral bands ranging from 381 to 2493 nanometers, along with observation geometry and geolocation information. The data is produced by NASA's Jet Propulsion Laboratory and targets sunlit regions between 52Β° N and 52Β° S latitude.
36.4 KB supplementary table containing the combined output from the Genomica tool's analysis of 500 orthologs. The file, authored by Salvatore Galgano and last updated in June 2026, summarizes generic linear mixed model output generated via the anova function in Genomica.
69.1 KB supplementary table from the Genomica analysis tool. The file summarizes significant comparisons from linear mixed models run on 500 orthologs from a demonstration dataset. Authored by Salvatore Galgano, this output was last updated on June 3, 2026.
126 football pitches and turf samples were tested for vertical compliance and rotational stiffness using FIFA-approved devices. The supplementary PDF files contain data that informed revisions to the FIFA Quality Programme's performance thresholds for playing surfaces. Author David James published the files under a CC BY 4.0 license in May 2026.
Plasma Science and Fusion Center Dataverse hosts a dataset by Jintao Hu, Patricia Sadde, Liangjun Shao, Philip C. Michael, and Dongkeun Park describing a novel insulated magnet design. The dataset likely contains experimental and design parameters for a REBCO magnet using Pyralux insulation and a four-tape co-winding technique. The record was last updated on June 18, 2026.
545 records form this release, combining a seed corpus with captured LoopGym trajectories for LoopNet. The dataset was created by KanakMalpani and was last updated on June 14, 2026. Records conform to the 'ln/record-v1' schema.
The Gippsland Lakes Local Coastal Hazard Assessment provides the extent of a 10% Average Exceedance Probability water level event, incorporating 0.2 meters of sea level rise based on hydrodynamic modelling. It was produced by the Department of Energy, Environment and Climate Action, with the dataset last updated on April 8, 2026. The hazard extent results from a combination of catchment inflows, coastal ocean levels, and wind setup.
Reporte de novedades realizadas por los afiliados al Sistema General de Seguridad Social en Salud tracks administrative changes for health system affiliates in Colombia. The dataset includes columns for origin and destination regimes, municipalities, EPS providers, and demographic details like age and sex. It is hosted on the Colombian open data portal datos.gov.co and was last updated on 2026-05-18.
NASA's SASSIE field campaign deployed two types of profiling floats with different ice-avoidance behaviors to capture the transition from summer melt to autumn ice advance in the Beaufort Sea. ALTO floats halted transmissions at near-freezing surface temperatures to survive winter, while ALAMO floats continued reporting during freeze-up at the likely cost of their survival. This dataset provides in situ temperature and salinity measurements from August-October 2022 within approximately 200 kilometers of the sea ice edge.