Loading...
Loading...
Text classification, translation, QA, summarization, dialogue, sentiment analysis, language modeling, text corpora
44,787 datasets
The convert/FormalPowerSeries functionality was completely rewritten for Maple 2022. It is based on Dr. Bertrand Teguia Tabuguia's PhD thesis, Power Series Representations of Hypergeometric Type and Non-Holonomic Functions in Computer Algebra, under the supervision of Prof. Wolfram Koepf at University of Kassel, Germany, May 2020. The dataset was authored by JΓΌrgen Gerhard and harvested by Borealis Dataverse.
Geoscience Australia bathymetry compilation products represent the current extent of seafloor depth data as of June 2019. The dataset consists of polygons showing the spatial coverage of each compilation, with attributes detailing data sources, product specifics, and access methods. Contributing individual survey data is referenced in a separate AusSeabed Bathymetry Holdings dataset.
Released in 2026, this dataset provides a per-cell evaluation corpus and donated session prefixes for the NeurIPS 2026 Evaluations & Datasets Track submission titled 'ContextEcho: A Benchmark for Persona Drift in Long Agentic-Coding Sessions'. The dataset is authored by contextecho2026 and is intended to support research on persona consistency in extended AI coding interactions.
AIGNLAI's DARS Routing Benchmark is a collection of scored large language model generations for studying model routing. It provides multiple observations for each query-model pair by varying inputs and decoding processes. The dataset was last updated on June 4, 2026.
423 survey responses from university students in Indonesia, collected online via Google Forms. The data measures variables like perceived prompt framing, AI bias, trust, usefulness, and decision-making using a five-point Likert scale. Author Neysa Sabrina published the dataset on figshare in April 2026, and it was analyzed using Partial Least Squares Structural Equation Modeling (PLS-SEM).
Asset acquisitions data for a council, starting from 2012. The dataset is published by the Government Digital Service on the EU open data platform and is updated annually. The specific assets, transaction details, and total volume are not described.
Raw data used for generating figures in a manuscript on reentrant superconductivity in a naturally occurring Josephson junction array. The dataset was authored by Yoram Dagan and is shared under a CC-BY-4.0 license. It was last updated on May 28, 2026.
674 porcine reproductive and respiratory syndrome virus (PRRSV) ORF5 sequences collected from 25 Chinese provinces between 2019 and 2024. The dataset, authored by Jinyong Zhang and shared on figshare, includes results of phylogenetic analysis and amino acid mutation tracking. It was last updated in April 2026.
674 porcine reproductive and respiratory syndrome virus (PRRSV) ORF5 gene sequences collected from 25 provinces in China between 2019 and 2024. The dataset, authored by Jinyong Zhang and shared on figshare, was used to analyze spatiotemporal distribution, molecular evolution, and amino acid mutations. The data was last updated on April 10, 2026.
674 porcine reproductive and respiratory syndrome virus (PRRSV) ORF5 sequences collected from 25 Chinese provinces between 2019 and 2024. The dataset, authored by Jinyong Zhang and uploaded to figshare, was used to analyze spatiotemporal distribution, molecular evolution, and amino acid mutations. Sequences were phylogenetically classified into lineages 1, 3, 5, and 8.
City of Melbourne Open Data provides modelled energy consumption projections for 2026 at a block level. The data, produced by CSIRO, is based on building attributes like age and floor area and covers both commercial and residential sectors relative to a 2011 baseline.
Modelled energy consumption data for commercial and residential buildings across the City of Melbourne municipality. The dataset is a 2016 projection based on building attributes like age and floor area, created by CSIRO for a study commissioned by IMAP Councils. It is presented at a block-level scale and excludes the industrial sector.
City of Melbourne Open Data provides modelled energy consumption projections for 2021 based on a building retrofit scenario. The dataset, created by CSIRO, covers both commercial and residential building blocks within the municipality, using building attributes rather than metered data.
City of Melbourne Open Data provides modelled energy consumption data for the municipality at a block level. The dataset is a 2016 business-as-usual projection based on building attributes like age and floor area, covering both commercial and residential sectors relative to a 2011 baseline.
A 2021 business-as-usual projection of modelled energy consumption across the City of Melbourne municipality, based on building attributes like age and floor area. The data, provided by CSIRO, covers both commercial and residential buildings at a block level, excluding the industrial sector.
2026 projection relative to a 2011 baseline provides modeled energy consumption for commercial and residential buildings across the City of Melbourne municipality. The data is modeled from building attributes like age and floor area, not metered, and is provided at a block level scale by the CSIRO.
NYC Parks Syringe Disposal Kiosks data lists locations, installation details, and capacities for syringe collection kiosks across New York City parks. The dataset is maintained by NYC Parks via an ArcGIS Online application and is updated as kiosks are installed, moved, or removed. A data dictionary is available for reference.
ODP Site 1167 provides age control indicating the bulk of the trough mouth fan was deposited prior to the Brunhes-Matuyama Boundary (780 ka). The dataset describes the stratigraphy of the Prydz Channel Fan, built by the Lambert Glacier-Amery Ice Shelf system, and is provided by Geoscience Australia Data. It was last updated on 2026-04-30.
A study by Simon Langener, uploaded to figshare in 2026, evaluating the use of Embodied Conversational Agents (ECAs) in Immersive Virtual Reality to simulate peer pressure to drink alcohol. The dataset, 120.1 MB in size, contains results from a repeated measures experiment with twenty patients with Mild to Borderline Intellectual Disability and Alcohol Use Disorder. It assesses the ECA's persuasiveness and effects on perception, emotional state, and coping behavior using actor-recorded dialogues in dominant-friendly and dominant-hostile styles.
Geoscience Australia conducted a regional mapping program addressing stratigraphic and structural exploration risk in the Triassic succession of the Roebuck Basin. The data pack comprises seismic horizon grids and isochron grids generated from the TR10.0_SB, TR17.0_SB, and J10.0_SB horizons, alongside fault maps. Seismic horizons were mapped using 2D and 3D surveys, including AGSO s110, AGSO s120, PGS New Dawn, and 3D surveys like Admiral and Beagle.