Loading...
Loading...
Text classification, translation, QA, summarization, dialogue, sentiment analysis, language modeling, text corpora
39,925 datasets
A retrospective cohort of 16,518 pediatric emergency department encounters for acute viral respiratory illness from 2019 through 2025. The dataset, authored by Rashed A. Hasan and shared under CC-BY-4.0, includes 2,384 preterm-born children and tracks outcomes like ICU admission and hospital length of stay. It was last updated in June 2026.
Numerical simulation data support a study on the mechanical behavior of irregularly-shaped pipelines during marine hoisting operations. The dataset includes Mises stress, strain, sling tension, and parameter sensitivity results for three pipeline types, generated using a custom C++/Qt tool and OrcaFlex software. Author Zhang Wenbo published the 4.2 MB dataset under a CC-BY-4.0 license on figshare, with a last update in May 2026.
A sample of 100 high-demand course textbooks from the University of Canterbury library catalogue was compared for availability on the Anna's Archive website in May 2026. Author Dave Clemens conducted searches for matching editions, finding 64 matches. The dataset includes comments on edition availability.
Data from the SWOT satellite's Poseidon-3C altimeter, launched December 16, 2022, provides near-real-time measurements along its nadir track. The dataset includes sea surface height, significant wave height, and wind speed measurements at sampling resolutions of approximately 6-km (1Hz) and 300-m (20Hz). It is processed using onboard DORIS orbit ephemeris with predicted auxiliary data and has a nominal latency of less than 7 hours.
The VNP64A1 Version 1 dataset is a decommissioned monthly global product providing per-pixel burned area information at a 500-meter resolution. It uses a hybrid algorithm on VIIRS satellite imagery to identify the ordinal day of burn for each grid cell, designed to continue the MODIS burned area record. The dataset was released on a limited basis due to known quality issues near water bodies and high latitudes.
A high-fidelity behavioral state map engineered to optimize Dynamic Voltage and Frequency Scaling (DVFS) and task migration across smartphone multi-core SoCs. The dataset is formatted as a packed Q16.16 fixed-point integer matrix for real-time kernel-space scheduling and was generated via Hardware-in-the-Loop (HIL) state simulations targeting Energy-Delay Product (EDP) boundaries. It was authored by Jamie Davis and last updated on June 2, 2026.
Three sediment cores from Nara Inlet in the Whitsunday Islands, central Great Barrier Reef, Australia. The data, hosted by the Australian Ocean Data Network, was last updated on 2026-06-04. It describes sediment composition and accumulation rates over the last 3000 years.
A 2.4 KB text file containing a highly optimized, constant-time matrix inversion kernel for 32-bit bare-metal architectures. The core, authored by Jamie Davis and last updated in June 2026, implements an analytical 3x3 inversion method using determinants and adjugate matrices. It is designed for real-time sensor fusion, inertial navigation, and Kalman filtering loops.
Stormwater Culverts in the Australian Capital Territory are mapped in this polyline dataset. Attributes likely include location description, suburb, ownership, maintenance responsibility, asset sub-type, material, and physical dimensions like length, width, and height. The data is captured through works-as-executed handover or field audits for assets owned or managed by the City Services directorate.
NASA/NOAA's VNP21A2 dataset is an 8-day composite Land Surface Temperature and Emissivity (LST&E) product derived from the Suomi NPP VIIRS sensor. It provides global coverage at a 1-kilometer spatial resolution, combining daytime and nighttime acquisitions into a single HDF file. The product is algorithmically compatible with MODIS data to ensure continuity in Earth observation records.
131 samples of 22 essential medicines across 8 therapeutic categories were procured from licensed retail outlets in Kerala, India. The dataset contains results from pharmaceutical quality testing and cost-effectiveness analysis conducted by Cyriac Abby Philips, last updated in June 2026. All samples met pharmacopeial standards, and generic medicines were on average 48.6% cheaper than branded equivalents.
Philipp Pably published this dataset in June 2026. It contains recorded process data from lab-scale experiments using a 5-L benchtop reactor, cultivating Escherichia coli between August 2024 and January 2026. The data supports the development and verification of model-based dissolved oxygen controllers for pulsed feeding regimes.
30 responses each from three large language models (ChatGPT 5, DeepSeek V3, Grok 4) evaluated for ADHD-related content. The dataset contains scores for content accuracy, readability (FKRE, FKGL, SMOG), lexical complexity, and response stability. Author Xingmin Han published the data on figshare in June 2026.
NASA/NOAA's Suomi NPP VIIRS VNP43MA2 Version 2 product provides Bidirectional Reflectance Distribution Function (BRDF) and Albedo quality data at a 1-kilometer resolution. It is produced daily using a 16-day temporal window weighted to the ninth day, providing quality and observation-day information for nine VIIRS moderate resolution bands and three broadbands. The dataset uses the RTLSR kernel-driven BRDF model to reconstruct surface anisotropic effects and supports the continuity of NASA's MODIS BRDF/Albedo product suite.
260 KB of supplementary material from a 2026 study on skeletal muscle regeneration in mice. The research from figshare investigates the role of Homer 2 protein in the calcineurin-NFAT pathway during muscle repair. Data includes immunofluorescence staining results for Homer 2, NFATc1, Pax7, and myogenin from mouse tibialis anterior muscles at 2, 4, and 6 days post-injury.
A 5.5 KB Excel dataset contains experimental results for Fe3O4/TiO2 nanocomposites used to degrade Reactive Yellow 145 dye. It was created by Safdar Abbas Kazmi and last updated in June 2026. The data supports a study where a Convolutional Neural Network model achieved a predictive RΒ² of 0.91 for photodegradation efficiency.
Modelled hillslope erosion data for New South Wales, Australia, produced using the Revised Universal Soil Loss Equation (RUSLE). The dataset includes monthly and annual time-series maps from 2000 to the present, incorporating factors like rainfall erosivity, soil erodibility, slope, and ground cover. It was created by the NSW Department of Climate Change, Energy, the Environment and Water.
Hiren Jethva presents a global dataset of above-cloud aerosol optical depth (ACAOD) retrieved using the Depolarization Ratio method applied to CALIOP lidar cloudy-sky measurements. The dataset was validated against airborne HSRL-2 lidar and 4STAR sun photometer measurements from the ORACLES field campaign in 2016β2018 and compared with Aqua-MODIS and OMI satellite retrievals. The data, last updated in May 2026, is shared under a CC-BY-4.0 license.
Fire hydrant data from multiple Quebec municipalities, including Montreal, Quebec City, and Trois-Rivières. The dataset likely contains point locations and associated attributes such as installation date, operational status, owner, and jurisdiction. These records are published annually for public information but are not intended for real-time engineering use without city validation.
A 1.9 KB open-access reference dataset published by Jamie Davis in 2026. It contains the core for a deterministic, fixed-point phase compensation algorithm designed to neutralize filter delays in high-frequency tracking systems like robotic neuro-navigation and missile guidance.