Loading...
Loading...
Text classification, translation, QA, summarization, dialogue, sentiment analysis, language modeling, text corpora
40,756 datasets
A 55.55% reduction in unscheduled downtime was observed in a single-case study of an IoT-enabled Andon system integrated with a Business Intelligence visualization layer in a genset manufacturing business. The research provides empirical validation of an integrated Industry 4.0 intervention through a sociotechnical lens, highlighting the synergy between digital tools and human decision-making. The dataset, a 1.1 MB DOCX file authored by M.S. Narassima, was last updated on May 12, 2026.
A listing of Ontario Provincial Police host and satellite detachments organized by regional clusters and commands. The data includes General Headquarters bureaus and their associated commands, sourced from the Government of Ontario. It was last updated on June 3, 2026.
Dynamic simulation and experimental test data for a piezoelectric cantilever system. The dataset includes frequency response data for linear and nonlinear systems with single-sided stops, varying excitation levels, stop gaps, and samples with different elastic moduli. The 367.9 MB ZIP file was authored by Tianran Song and last updated on 2026-05 31.
Empirical data, metrics, and training materials from a controlled experiment evaluating the D2S approach and C2C toolset. The dataset includes individual participant metrics, IDE session logs, and System Usability Scale (SUS) questionnaire responses. It was authored by Diego Andrés Firmenich and last updated on 2026-05-29.
Yasong Yan's research dataset investigates the synergistic effect of the plant alkaloid Neferine with the antibiotic gentamicin against Escherichia coli. The data supports findings that Neferine targets the FNR and ArcA transcriptional regulators, inducing metabolic collapse and membrane disruption to overcome resistance. Results include in vitro and in vivo validation of the combination's efficacy in reducing bacterial load and improving survival.
Hubert Josien published a dataset on figshare on 2026-05-27 describing the discovery process of enlicitide (MK-0616), an orally active macrocyclic peptide therapeutic targeting PCSK9 for LDL-C reduction. The dataset likely contains information on the design-make-test-analyze (DMTA) cycle and preclinical profiling of candidate compounds. The modular fragment assembly strategy used to overcome development bottlenecks in a prior lead compound is a central feature.
A figshare-hosted research document by Elomofe Ikuyinminu, last updated in May 2026. The study assesses the effects of three commercial seaweed extracts on micronutrient uptake and remobilisation in winter barley and wheat seedlings under nutrient-limited conditions. It details experimental methods using ICP-MS analysis and reports findings on biomass increases and nutrient partitioning.
300 multiple-choice questions on high myopia generated by five large language models for a benchmarking study. The dataset was created by Ligang Jiang and last updated on May 5, 2026. It includes objective metrics and expert ratings for evaluating the models' performance in generating educational content.
300 multiple-choice questions generated by five large language models for ophthalmic education on high myopia. The dataset was created by Ligang Jiang and last updated on May 5, 2026. It includes objective metrics and expert ratings evaluating the models' performance on 60 predefined generation tasks.
May 24, 2011 data from the Pawnee single-polarization Doppler radar, collected during the Midlatitude Continental Convective Clouds Experiment (MC3E) in Oklahoma and northeastern Colorado. This dataset was designed to support the CHILL radar and NASA ER-2 aircraft instrumentation, specifically for dual Doppler wind analysis. It is provided by the National Aeronautics and Space Administration.
Ordnance Survey and Office for National Services intellectual property underpins this official vector boundary data for England's Combined Authorities. The dataset provides ultra-generalized (500m) boundaries, clipped to the Mean High Water coastline mark. It is available for multiple reference years, including December 2020 and December 2025, and is accessible via multiple geospatial web services.
A 1.4 MB dataset uploaded by Ling Chen on figshare in May 2026, associated with a research paper on mixed membership models. The data likely contains multivariate categorical responses, such as from political surveys or population genetics studies, used to demonstrate a novel spectral estimation method. The work addresses estimation challenges for high-dimensional polytomous data with locally dependent noise.
A 5.6 MB dataset of human preference outcomes used to evaluate large language models. The data supports a novel statistical framework for online decision-making and inference in Reinforcement Learning from Human Feedback, proposed by author Nan Lu. It was last updated on 2026-05-18 and applied to analyze model performance on the Massive Multitask Language Understanding dataset.
2.4 KB of C++ source code implements a low-pass complementary filter for orientation estimation. The module fuses high-frequency gyroscope integration with low-frequency accelerometer vectors to resolve exact orientation angles. Authored by Jamie Davis and released open-access under CC BY 4.0, it was last updated on May 30, 2026.
A production C++ source code implementation for a high-frequency safety supervisor module. The module tracks real-time variance shifts in multi-axis inertial samples to detect structural anomalies and trigger hardware shutdowns if safety thresholds are breached. The code is released open-access under CC BY 4.0 guidelines by author Jamie Davis.
A defined series of bridged cyclooctyne structures analyzed using high-level DLPNO–CCSD(T) calculations for Strain-Promoted Azide–Alkyne Cycloaddition (SPAAC). The dataset, authored by Abdulkader Baroudi and last updated in June 2026, identifies architectures with reduced activation barriers and correlates reactivity with alkyne-localized distortion. Results establish alkyne bond angles as a predictor of reactivity and examine effects of heteroatom substitutions.
A 2023 bathymetry surface created from a contracted national reference survey in Gulf St Vincent, South Australia, for calibrating multibeam echosounders. The Australian Hydrographic Office acquired the data in September 2020 using Kongsberg EM 2040 and EA440 echosounders. Separate 1-meter resolution grids are provided for two surveyed sites in MSL, LAT, and Ellipsoid vertical datums, exported as 32-bit floating point GeoTIFFs.
A 0.5m resolution bathymetry survey of the Torres Strait was acquired for the Australian Hydrographic Office on 12 December 2021. Separate high-resolution grids are provided for two surveyed sites in multiple vertical datums (MSL, LAT, Ellipsoid). The data was processed using QPS Qimera and exported as 32-bit floating point GeoTIFFs.
SRC collected surface meteorological and radiation data from December 1993 to December 1996 across a 1000 km by 1000 km area of northern Manitoba and Saskatchewan. The dataset includes Suite A measurements from ten sites and Suite B diffuse solar and longwave measurements from five of those sites. It provides a multi-year record designed for researchers studying near-surface energy balance in the boreal forest.
VIIRS/NPP Land Surface Temperature/Emissivity Daily L3 Global 1km SIN Grid Night V001 was decommissioned on April 8th, 2025. This daily, global 1-kilometer resolution dataset provides nighttime land surface temperature estimates, derived from cloud-free observations with a minimum 15% coverage threshold. It was algorithmically aligned with MODIS products to ensure continuity within NASA's Earth Observation System.