Loading...
Loading...
Text classification, translation, QA, summarization, dialogue, sentiment analysis, language modeling, text corpora
40,377 datasets
Haiyan Gao published a research article on figshare under CC-BY-4.0 license. The document presents findings from the Fujian Birth Cohort Study (FJBCS), analyzing associations between sleep variability, sleep irregularity, and gestational metabolic syndrome (GMS) in pregnant women. Data was collected from 2019 to 2021 across four sub-cohorts.
An Australian and New Zealand extract of Ookla Speedtest Open Data, containing approximately 88,000 lines of aggregated test results. The data includes average download and upload speeds, number of tests, and number of users per 600m x 600m grid cell, with centroids provided as latitude and longitude. The dataset was created by Richard Ferrers and last updated in May 2026.
The Bureau of Mineral Resources developed a reconnaissance-style mapping technique for documenting surficial cover on reefs. These maps exhibit only surficial facies and apply a simple bathymetric classification differentiating supratidal, intertidal, and subtidal zones. The data was compiled in 1982 and edited in 1983, with map resolution variable over the reef and generally decreasing with increasing water depth.
María Ángeles Ramón y Cajal Junquera donated 28 documents related to her grandfather, Nobel laureate Santiago Ramón y Cajal. The collection includes press clippings, interviews, correspondence, and lawsuits concerning the Cajal Legacy, which comprises 28,223 pieces. Santiago Giménez-Roldán authored this PDF archive, last updated in May 2026.
Natural Resources Canada's HRDEM provides elevation data for the entire Canadian territory. It is derived from airborne LiDAR in the south and satellite imagery in the north, offering Digital Terrain and Surface Models at 1 m or 2 m resolution. The data is referenced to the Canadian Geodetic Vertical Datum of 2013 (CGVD2013).
A mini-review PDF document authored by Esra Sümer-Arpak, last updated on 2026-05-28. The 64.8 KB file discusses the application of foundation models for decoding inner speech using non-invasive neuroimaging modalities like fMRI, EEG, MEG, and fNIRS. It covers architectural trends, pretraining strategies, and challenges in the field.
160 monitoring stations form the Water Quality Reference Network, with data collected from 2005 to the present. The dataset consolidates results from physicochemical and microbiological analyses of water and sediment matrices. Records include location, date, observed property, and result, with notes on quantification limits.
Daily updates from State and Territory roadworks APIs build a nationally consistent, harmonised historic database of roadworks and road closures over time across all of Australia. The National Freight Data Hub (NFDH) generates efficiencies by collecting and harmonising this data to the national level. This dataset could be used to understand, predict, and manage road damage and closures in the context of natural disasters or disruptions.
Petra J. Mudie authored this dataset, which includes taxonomic descriptions and SEM images of dinoflagellate cysts from the Mid-Pleistocene Bakunian Stage (MIS 21–11) of the Caspian Sea and MIS 19–9 of the Black Sea and Gulf of Corinth. The data is stored in a 1.6 MB DOCX file and was last updated on June 1, 2026. It focuses on cruciform cysts used to characterize salinity and biotic migrations in Eurasian basins.
A 763.6 KB PDF document authored by Maria Pia Ciano, last updated on June 1, 2026. It presents a framework derived from a systematic literature review, showing how Lean practices and Industry 4.0 technologies contribute to the sustainability, resilience, and human-centricity pillars of Industry 5.0.
Yuening Yang's dataset contains results from a multiphysics coupling simulation evaluating bolt preload in T-type electrical clamps under live-working conditions. The data supports a method validated with an average error of 5.82% against experimental results from the Zhejiang Shangjian Electric Power Testing Institute. The dataset was last updated on 2026-05-12.
A 38.0 KB Excel dataset presents results from a multiphysics coupling method for evaluating bolt preload in T-type clamps on live distribution networks. The method, proposed by author Yuening Yang, achieved an average error of 5.82% compared to experimental results from the Zhejiang Shangjian Electric Power Testing Institute. The dataset was last updated on May 12, 2026.
A figshare document by Lizhu Yuan, last updated June 1, 2026, under a CC-BY-4.0 license. The study investigates cadmium accumulation mechanisms in four plant species—ryegrass, castor bean, amaranth, and mirabis—under C14 alkane stress. It uses multivariate analyses to identify mineral element homeostasis as a central mediator of plant response and phytoremediation efficacy.
Alaska and Canada's Yukon Flats and Peace-Athabasca Delta are covered by wetland inundation maps at ~10-meter resolution. NASA produced this time series, with maps estimated every 12 days during the free-water periods from May to October across 2017 to 2019. The dataset includes detailed coverage maps with five land/water classes and derived frequency maps showing how often areas were inundated.
A GraphRAG framework integrates large language models with a spatiotemporal knowledge graph to enhance spatial reasoning. The graph explicitly models campus facilities, events, accessible entrances, and real-time parking availability. The approach, authored by Wenyu Zhang and shared under CC-BY-4.0, is implemented as a decision-support chatbot with a visual analytics interface.
Gulshat Amirkhanova's research document describes a multi-agent pipeline for generating SCORM 1.2-compliant e-learning courses from enterprise documents. The system uses large language models and retrieval-augmented generation (RAG) to automate course design and content creation. The document, last updated on 2026-06-01, is 679.8 KB in size and shared under a CC-BY-4.0 license.
A 470 KB PDF report details experiments on the combined effects of Arylsulfatase B (ARSB) and Pembrolizumab in a syngeneic mouse model of metastatic melanoma. The report, authored by Sumit Bhattacharyya and shared under a CC-BY-4.0 license, describes how ARSB treatment interacts with Pembrolizumab to potentially improve therapeutic responses. Findings suggest the combination may increase apoptosis, reduce metalloproteinases and invasiveness, and alter cytokine expression.
995 participants in an online experiment examined how media narratives shape public views of AI-assisted healthcare in China. The dataset includes experimental conditions, manipulation checks, pre-existing trust, attitudes, risk perceptions, and demographics. Fangzhou Zhou published the data on figshare in June 2026.
March 2020 and January-February 2021 bathymetry data for the South-west Corner Marine Park, collected by Geoscience Australia. The survey covers 330 km^2 offshore from Cape Naturaliste to Cape Leeuwin, producing a 5-meter resolution geotiff from processed multibeam sonar. It was funded by the National Environmental Science Program Marine Biodiversity Hub and partners to build baseline information for benthic habitats.
CARS2000 provides seasonal maps of sea salinity and other oceanographic properties for the Australian region. The data is derived from the World Ocean Atlas 98 and CSIRO Marine and NIWA archives, interpolated to a 0.1-degree grid at depths of 0, 150, 500, 1000, and 2000 meters. It was designed to improve upon the Levitus WOA98 Atlas for the region spanning 100-200E and 50-0S.