Loading...
Loading...
Text classification, translation, QA, summarization, dialogue, sentiment analysis, language modeling, text corpora
44,344 datasets
A polygon feature class spatially representing property descriptions from the Valuer General's ValNet database in New South Wales, Australia. The data includes classifications for Large Rural, Rural, Semi-Rural, and Urban property types. Attributes available for point queries include Address string, House Number, and PropID.
NI Coastal Erosion High Level Risk Appraisal shows the potential for erosion of the Northern Ireland coastline. It was created by Amey Consulting with HR Wallingford for the Department for Infrastructure and DAERA in 2018 as part of a baseline study and gap analysis. The data is available in geospatial formats like GeoJSON and ESRI Shapefile.
Seasat-A Scatterometer data provides the first spaceborne global wind vector measurements from a three-month mission in 1978. Wind direction ambiguities are resolved using a global weather prediction model, and data are binned into 100 km cells per swath. This complete dataset results from reprocessing efforts led by Frank Wentz, Robert Atlas, and Michael Freilich.
Over the past 800 ka, a sequence of stranded coastal barriers in south-east South Australia preserves a record of sea-level variations. This dataset likely contains quartz SAR-OSL ages determined on quartz extracts from these dunes, comparing them with an existing chronology. The data is presented by the Australian Ocean Data Network.
A free preview subset of a larger proprietary Italian text corpus developed by Egomnia S.p.A. The dataset contains over 360 pieces of content sourced from the Progetto Talia website. The full dataset is available for purchase from the author.
Natural Resources Canada provides Energy Efficiency Report (EER) templates for regulated heating and air-conditioning equipment. Dealers must use these product-specific templates to report energy-using products to NRCan prior to first import. Completed reports must be submitted electronically through the Compliance Regulatory Energy Efficiency Database (CREED).
A bibliography compiled for the 60th anniversary of the University of Montreal's School of Architecture lists faculty publications from 1965 to 2025. The current version primarily includes print and digital monographs available at the university's library, with plans to expand to journal articles, book chapters, and theses. The dataset was last updated on June 13, 2026, by author Mélançon Bolduc, Ginette-Denyse.
A project examining the n-body problem involving three or more masses in two dimensions. It includes derivations, numerical solutions, and analysis of chaotic system behavior through plots and animations. The work was authored by Marcus Lindsey and last updated on June 27, 2026.
Spotibot (Strawberry) is a dataset supporting research on automated fruit-level screening of Botrytis cinerea in strawberries using a multi-model deep learning pipeline. The dataset is 898.4 MB in size and consists of JPG image files. It was authored by Dan Jeric Arcega Rustia and last updated on 2026-05-27.
736 samples provide mean summer and winter temperature deviations from the Holocene mean, reconstructed from fossil pollen records at five sites in the Carpathian Mountains. The dataset offers an average temporal resolution of 16.3 years over the last 12,000 years. It was created by Jon Camuera and published in 2026.
A theoretical model dataset from figshare, created by Liurui Deng and last updated in May 2026. It examines the impact of tax disparities and blockchain technology on the profits of multinational enterprises (MNEs) operating through various e-commerce entry models. The dataset, 5.5 KB in size, is provided in an XLS file format.
Ablation study results for the EAC-Agent multimodal conversational model, published by Shahid Jamil on figshare in April 2026. The dataset likely contains performance metrics from experiments comparing the model's emotion recognition and response generation capabilities against benchmarks. The proposed model achieved an accuracy of 76.27% on IEMOCAP and 67.57% on MELD for emotion recognition.
76.27% accuracy on IEMOCAP and 67.57% on MELD for emotion recognition. The dataset contains performance metrics for a multimodal conversational agent (EAC-Agent) compared against existing techniques on two benchmark datasets. It was authored by Shahid Jamil and last updated on April 17, 2026.
Shahid Jamil published results from a multimodal conversational agent model on April 17, 2026. The 5.5 KB Excel file likely contains performance metrics for emotion classification and response generation on the MELD benchmark dataset. The model incorporates text, audio, and visual features.
A 5.5 KB Excel file containing results from a multimodal conversational agent model tested on the IEMOCAP dataset. The dataset likely contains performance metrics for emotion classification and response generation, as described in the research paper by Shahid Jamil. It was last updated on April 17, 2026.
5.5 KB of tabular data contains benchmark performance statistics for the EAC-Agent multimodal conversational model. Shahid Jamil published the dataset on figshare in April 2026. The data likely includes accuracy, perplexity, BLEU, and ROUGE-L scores for emotion recognition and response generation.
Benchmark results from 2026 comparing a novel multimodal chatbot model against existing techniques. The dataset likely contains performance metrics for emotion classification and response generation, including accuracy, perplexity, BLEU, and ROUGE-L scores. It was authored by Shahid Jamil and uploaded to figshare.
Throughout 2012 and 2013, the Greater London Authority conducted a program of research to explore the impact of the London 2012 Olympic Games. The data likely contains the opinions, behaviors, and attitudes of Londoners and visitors to London, collected during and after the Games. This dataset aggregates the results from the GLA's Gamestime research.
UnityShotsBench is a multilingual, multi-cultural benchmark for evaluating multi-shot audio-video generation. Each case is a short cinematic story requiring consistent character identity, voice, and world persistence across cuts. The benchmark was released by KlingTeam in 2026 with the UnityShots research paper.
A 2018 high-level vulnerability assessment of historic assets along the Northern Ireland coast, prepared by Amey Consulting with HR Wallingford for government departments. This geospatial layer results from an Erosion Risk Appraisal stage, comparing erosion risk against asset value. The full report is published by the Department for Infrastructure and the Department of Agriculture, Environment and Rural Affairs.