Loading...
Loading...
Text classification, translation, QA, summarization, dialogue, sentiment analysis, language modeling, text corpora
43,044 datasets
Official mortality data on suicide in the department of Caldas, Colombia, provided by the National Administrative Department of Statistics (DANE). The dataset contains annual suicide rates by municipality, with records from the year 2000 to the most recent available date. It allows for analysis by sex and geographic zone to support public health awareness, prevention, and policy formulation.
A registry of beneficiaries for the 'Renta Ciudadana' social welfare program in the municipality of Arboledas, Norte de Santander, Colombia. The dataset includes columns for beneficiary demographics, enrollment status, and location. It was published via the Socrata platform on datos.gov.co and was last updated on 2026-05-18.
A 2012 legal agreement and its supplemental updates through 2020 govern the regeneration of the Dollis Valley housing estate in Chipping Barnet, London. The dataset comprises the Principal Development Agreement between Barnet Council, developer Countryside, and housing provider L&Q, outlining plans for 616 new homes. It includes 33 detailed schedules covering objectives, financial models, masterplans, tenancy agreements, and partnership governance.
A 205.0 KB PDF document provides the official application and admission form for the School of Nursing, Agbor in Delta State, Nigeria, for the 2026-2027 academic year. The dataset, authored by Refeal Mark and last updated on 2026-05-11, is licensed under CC-BY-4.0 and hosted on figshare. It details available nursing and midwifery programmes and outlines contact procedures for application.
School of Basic Midwifery, Obudu has released its 2026/2027 admission application forms. The dataset is a PDF document containing information on application procedures, available courses, and contact details for the school administration. It was authored by Refeal Mark and last updated on 2026-05-11.
A 205.0 KB PDF document outlines the application process for nursing programs at the School of Nursing, ATBU, Bauchi State, Nigeria. The file, authored by Refeal Mark and last updated on May 11, 2026, provides contact details and lists available courses for the 2026/2027 academic year. It is shared under a CC-BY-4.0 license on the figshare platform.
127.2 MB of processed proteomic and phosphoproteomic data from a study investigating nitric oxide regulation of cardiac beta-adrenergic signaling. The dataset contains quantitative protein and phosphopeptide measurements analyzed with Spectronaut and MSFragger, supporting findings on NOS1 and PKA signaling. It was authored by Sherif M. F. M. Bahriz and last updated in May 2026.
Queensland landfills and recyclers received a total of 855,000 tonnes of construction and demolition waste generated interstate in 2018β19. The dataset, published by the Queensland Department of Environment, Tourism, Science and Innovation, quantifies waste flows for disposal and recycling. It was last updated in May 2026.
Queensland's public transport sector is performing better than its target service level benchmarks. The data, published by the Queensland Department of Environment, Tourism, Science and Innovation, likely contains metrics on network reliability. It was last updated on 2026-05-27.
Queensland litter counts from the National Litter Index, which began sampling in 2005β06. The data shows Queensland has generally experienced higher average litter counts than the national average, though counts have trended downwards over time for both. It was published by the Queensland Department of Environment, Science and Innovation.
Arkansas and Missouri soybean genotypes from maturity groups III to V were evaluated across 10 environments during the 2023 and 2024 growing seasons. Rafael Goncalves Marmo created this dataset to support a classification-based genomic prediction framework for identifying high-yielding genotypes. The dataset likely contains genomic predictors and yield performance classes derived from SoySNP3K BeadChip markers.
Analysis of humanitarian accessibility in Ethiopia produced at the Woreda administrative level. The dataset is prepared by OCHA Ethiopia in consultation with the Access Working Group and field focal points, based on information from humanitarian partners and reliable sources. It depicts the general access situation during a specific reporting period, with conditions potentially changing by the time of publication.
2.5 MB of raw scheduling results and charts supporting a paper published in the Journal of Parallel and Distributed Computing. The dataset contains two ZIP files: one with CSV files of raw results for different graph structure types, and another with the charts created for the paper. It was authored by Raymond Li and last updated on 2026-05-20.
Clean Air Tracking System (CATS) records permit applications for boilers, engines, generators, and industrial work in New York City. The dataset tracks requests for registration, renewal, inspection, and amendments, linking them to specific buildings and owners. Columns suggest it contains administrative details like application status, issue dates, fuel types, and equipment models.
SupraLabs's Supra Wild Titles 130K is a dataset series for training and evaluating chat title generation models. It contains 130,000 niche and specialized conversation samples partitioned from primary title datasets. The dataset was last updated on June 20, 2026.
EMIT L1B At-Sensor Calibrated Radiance and Geolocation Data Version 1 provides raw, non-orthocorrected at-sensor radiance measurements from the EMIT instrument on the International Space Station. Each data granule covers approximately 75 km by 75 km and contains 285 spectral bands ranging from 381 to 2493 nanometers, along with observation geometry and geolocation information. The data is produced by NASA's Jet Propulsion Laboratory and targets sunlit regions between 52Β° N and 52Β° S latitude.
36.4 KB supplementary table containing the combined output from the Genomica tool's analysis of 500 orthologs. The file, authored by Salvatore Galgano and last updated in June 2026, summarizes generic linear mixed model output generated via the anova function in Genomica.
69.1 KB supplementary table from the Genomica analysis tool. The file summarizes significant comparisons from linear mixed models run on 500 orthologs from a demonstration dataset. Authored by Salvatore Galgano, this output was last updated on June 3, 2026.
126 football pitches and turf samples were tested for vertical compliance and rotational stiffness using FIFA-approved devices. The supplementary PDF files contain data that informed revisions to the FIFA Quality Programme's performance thresholds for playing surfaces. Author David James published the files under a CC BY 4.0 license in May 2026.
Plasma Science and Fusion Center Dataverse hosts a dataset by Jintao Hu, Patricia Sadde, Liangjun Shao, Philip C. Michael, and Dongkeun Park describing a novel insulated magnet design. The dataset likely contains experimental and design parameters for a REBCO magnet using Pyralux insulation and a four-tape co-winding technique. The record was last updated on June 18, 2026.