Loading...
Loading...
Text classification, translation, QA, summarization, dialogue, sentiment analysis, language modeling, text corpora
40,440 datasets
A capital improvement program involving over 200 projects and approximately 400 miles of roadway repair in New Orleans. The dataset tracks individual projects with details on phases, costs, contractors, and geographic boundaries. Columns suggest it likely contains project timelines, financial data, and location information for municipal planning.
7,555 coastal segments of Kenya are assessed for vulnerability using eight physical and three socioeconomic indicators. The analysis, authored by Abigail Kagema and last updated in May 2026, finds 31.7% of segments fall in the High or Very High vulnerability class. It provides a transferable, open-data framework for coastal adaptation planning in Kenya and East Africa.
A collection of text corpora for agricultural risk analysis, including over 527,000 original sentences and 703,000 causal triplets. The dataset was created by WenJun Cui and last updated in May 2026. It focuses on 7 crop categories and 10 major agrometeorological disasters.
A dual-standard reverse-phase HPLC-UV method was developed for the simultaneous identification, assay, and impurities analysis of phenylboronic acid. The method uses 4-methoxyphenyl boronic acid as a model compound and a bromo-analog surrogate for quantitation, bypassing the need for a single-component primary reference standard. This 1.7 MB dataset, authored by Weijiang Ying and last updated in May 2026, is available under a CC-BY-4.0 license.
Nine cameras on NASA's MISR instrument captured each piece of Earth's surface from angles of 0, 26.1, 45.6, 60.0, and 70.5 degrees in four spectral bands. This FIRSTLOOK product provides a subset for the ARCTAS region, containing directional reflectance, albedo, FPAR, and terrain-referenced geometric parameters. It is designed for monitoring monthly trends in aerosols, clouds, and land surface cover.
A geospatial dataset from Spatial Services (DCS) defining cultural and man-made features of interest across New South Wales, Australia. It includes locations of facilities such as Police Stations, Ambulance Stations, and Hospitals, along with other cultural and utility features. The dataset is updated daily and has been aligned to the GDA2020 national geodetic standard.
A polygon feature dataset from Spatial Services NSW defining man-made cultural areas. Feature types include building dwellings and various pondage types like swimming pools and settling ponds. The dataset aligns with the national GDA2020 spatial standard and maintains positional relationships with other NSW foundational spatial data themes.
The NSW State Water Quality Assessment and Monitoring Program (SWAMP) collects monthly water quality data from approximately 140 sites across 13 regions. The program has operated since 2007, measuring parameters like electrical conductivity, temperature, turbidity, pH, dissolved oxygen, phosphorus, and nitrogen. Data is collected and archived by Water NSW according to Australian Standard AS/NZS 5667.1.1998 and analyzed at NATA-accredited laboratories.
A framework for modelling shoreline response to clustered storm events, focusing on two case study areas in southeast Australia: the Adelaide metropolitan coast and Old Bar beach. The dataset integrates coastal geomorphology and engineering approaches, using sediment compartment mapping and sub-surface data like boreholes and ground-penetrating radar profiles. This work is a contribution to the Bushfire and Natural Hazard Cooperative Research Centre project on storm surge resilience.
A geological study of the Albany Canyon complex, which extends 700 km from Cape Leeuwin to east of Esperance. The dataset, hosted by the Australian Ocean Data Network, describes canyon dimensions, structure, and a proposed evolutionary history from the Jurassic to the Middle/Late Eocene. It was last updated on 2026-06-05.
Fuzzy Extent Area is a polygon feature class defining the approximate extent of formally named landforms with indistinct boundaries in New South Wales. The dataset includes categories such as Dune Like, Flat Like, Valley Like, and Plateau Like, based on the Geographic Names Board designations. It is provided by Spatial Services and has been updated to the GDA2020 coordinate standard.
New South Wales, Australia, features a point dataset defining general cultural feature types. The dataset includes features such as swimming pools, cemeteries, communication towers, racetracks, and dam walls, captured at scales from 1:500 to 1:250,000. It is provided by Spatial Services, a business unit of the Department of Customer Service NSW, and has been updated to the GDA2020 spatial reference standard.
5.5 KB of partitioned data supporting a federated learning framework for speech emotion recognition. The dataset, authored by Mohammed Tawfik and last updated in May 2026, is associated with experiments on three corpora: EmoDB (German), RAVDESS (English), and CREMA-D. It likely contains client assignments and experimental splits for non-IID federated training and evaluation.
5.5 KB of computational efficiency metrics from the FedEmoNet framework, authored by Mohammed Tawfik and last updated in May 2026. The dataset likely contains performance metrics from a federated learning system for speech emotion recognition, evaluated on German (EmoDB) and English (RAVDESS) speech corpora. The framework achieved high accuracy on held-out test sets and was tested for cross-corpus generalization on CREMA-D.
5.5 KB of tabular data comparing SHAP and LIME explainability methods within the FedEmoNet framework. The dataset was authored by Mohammed Tawfik and last updated on May 7, 2026. It supports a federated learning system achieving over 99% accuracy on EmoDB and RAVDESS datasets.
FedEmoNet achieves 99.07% accuracy on EmoDB and 98.96% on RAVDESS using federated learning with differential privacy. The framework, authored by Mohammed Tawfik, demonstrates cross-corpus generalization with 68.15% accuracy on CREMA-D. Results were last updated in May 2026.
Mohammed Tawfik published results from ablation experiments for the FedEmoNet framework on 2026-05-07. The dataset, 5.5 KB in size, contains results from a federated learning study on speech emotion recognition using German (EmoDB) and English (RAVDESS) speech corpora. It includes performance metrics for model components like PSO feature selection, Transformer blocks, and the FedProx protocol.
Results from multivariable regressions modeling pediatric emergency department (PED) use incidence during 2023-2024. The dataset, authored by Denis Mongin and shared on figshare, analyzes the relationship between PED use and factors like distance to the PED, neighborhood socio-economic vulnerability, and pediatrician density, stratified by Canadian Triage Acuity Scale (CTAS) levels.
A meta-analysis of gene expression correlations across seven independent human substantia nigra microarray datasets, comprising 156 samples (70 controls, 86 Parkinson's disease). The dataset was created by Drake H. Harbert and last updated in May 2026. It examines the relationship between ALDH1A1 expression and dopaminergic pathway genes in Parkinson's disease.
A meta-analysis of gene expression correlations across seven independent human substantia nigra microarray datasets, comprising 156 samples (70 controls, 86 Parkinson's disease). The dataset, authored by Drake H. Harbert and last updated in May 2026, reports correlation changes between the ALDH1A1 gene and dopaminergic pathway genes in Parkinson's disease.