Loading...
Loading...
Text classification, translation, QA, summarization, dialogue, sentiment analysis, language modeling, text corpora
41,396 datasets
205.2 KB of processed data from online tourist reviews for Con Dao, Vietnam, analyzed using Aspect-Based Sentiment Analysis and Penalty-Reward Contrast Analysis. The dataset, created by Van-Manh Pham and last updated in May 2026, quantifies Cultural Ecosystem Services dimensions like Aesthetics, Recreation and Ecotourism, and Sense of Place. It provides empirical evidence for assessing the social-ecological nexus in island tourism development.
From March 2019 to August 2022, the NSW Department of Planning and Environment conducted a multibeam sonar survey off Cape Hawke, NSW, onboard the RV Bombora. The dataset provides 5-meter resolution 32-bit GeoTIFF files of bathymetry and backscatter, intended as a baseline for mapping seabed type distribution. It was created as part of the SeabedNSW program, processed using Hypack, R2Sonic GUI, POSPac, Qimera, and FMGT software.
Áreas de Protección para la Producción de Alimentos (APPA) for the municipality of Concordia in the department of Antioquia, Colombia. The dataset, with a total area of 13,040.13 hectares, is defined by Resolution 320 of September 22, 2025, from the Ministry of Agriculture and Rural Development (MADR). It spatially delimits areas designated for agricultural, livestock, fishing, and aquaculture activities to protect the human right to food and ensure food sovereignty.
Longzhi Chen published a supplementary document on figshare on 2026-05-11 detailing the optimization and characterization of a novel chitinase enzyme. The 2.2 MB document describes a 49.43-fold enhancement in enzyme production and the immunomodulatory effects of its product, N-acetylglucosamine, in a mouse model. The data likely contains experimental results, optimization parameters, and efficacy measurements supporting the described biotechnological and immunological findings.
Enlicitide (MK-0616) is an orally active macrocyclic peptide therapeutic discovered to target PCSK9 for LDL-C reduction. The dataset, shared by Hubert Josien in May 2026, details the modular fragment assembly strategy used to overcome development bottlenecks in a prior lead compound. This approach accelerated the design-make-test-analyze cycle, delivering compounds with low picomolar potency, improved solubility, stability, and pharmacokinetics.
A 2026 qualitative study by Wei Wang explores mechanisms of change in an 8-week mindfulness program for 13 elite athletes. Data includes participants' stated intentions, practice notes, and semistructured interviews, analyzed via reflexive thematic analysis. The findings provide contextual insight for refining mindfulness-based programs to support athlete mental wellbeing.
Xinyu Du's dataset contains 1,101 Neolithic settlement sites within China's Dongting Lake Basin. The data was analyzed using the DBSCAN algorithm and ArcGIS 10.5 to delineate settlement clusters across cultural phases and examine their spatiotemporal evolution. The study provides a foundation for Neolithic cultural heritage conservation and exploring human-land relationships.
A 17.5 KB CIF file contains the crystal structure data for a lead-free perovskite single crystal. The dataset, authored by Tao Song and last updated in May 2026, supports research into high-performance, eco-friendly optoelectronic materials. The described crystal, NH4CsIn1.33Cl6 doped with Sb3+, achieved a photoluminescence quantum yield of 78.5%.
October 2023 inventory of information assets from a public administration source, likely Colombia's Ministry of Information and Communications Technologies (MINTIC). The dataset documents assets with details on their classification, legal basis, responsible areas, and retention periods. It was published by www.datos.gov.co and last updated on 2026-05-18.
Real-time atmospheric pollutant measurements from an automatic monitoring station in Piedecuesta, Colombia, operated by the CDMB. The dataset includes concentrations of PM₁₀, PM₂.₅, SO₂, NO₂, O₃, CO, and TRS, alongside meteorological parameters for the first seven months of 2025. Each data point is accompanied by a detailed validation flag indicating its quality status, as per Colombian environmental regulations.
Principal statistics for Canada's mineral industries, including metal ore and non-metallic mineral mining and quarrying. The data covers revenue, expenses, employee counts, and inventory levels, compiled by Statistics Canada. The dataset was last updated on June 3, 2026.
The Great Cumbung Swamp in Eastern Australia is the terminus of the low-gradient Lachlan River. The dataset likely contains a detailed scientific description of the swamp's three depositional environments: the Lachlan channel, Phragmites Marsh, and overflow areas. It was published by the Australian Ocean Data Network and last updated in May 2026.
11.7 GB of decomposed motor unit spike trains and corresponding force measurements from two human participants performing controlled finger contractions. The dataset, created by Farah Baracat and last updated in April 2026, was generated from high-density intramuscular electromyography recordings using the swarm-contrastive-decomposition framework.
483.3 MB of data from a study evaluating the multi-strain probiotic NeuralliTM-CORE in a dextran sulfate sodium (DSS)-induced colitis mouse model. The dataset includes results on disease activity, body weight, histopathology, cytokine profiling, and 16S microbiota analysis. It was authored by Fu-Sheng Deng and last updated on 2026-05-19 under a CC-BY-4.0 license.
NeuralliTM-CORE_part 3 is a dataset from a study evaluating a multi-strain probiotic formulation in a dextran sulfate sodium (DSS)-induced colitis mouse model. The dataset, 432.6 MB in size and authored by Fu-Sheng Deng, was last updated on 2026-05-19. It likely contains results from cytokine profiling and microbiota analysis related to colitis severity and microbial resilience.
A Spanish secondary school study analyzes the role of self-awareness work in improving classroom attention for 26 students in a first-year vocational training program. The research uses a mixed-methods design with questionnaires, participant and non-participant observation, Likert scales, and activity evaluation rubrics across four sessions. The dataset, authored by Maria Babí and last updated in June 2026, is a 764.7 KB PDF document.
Experimental data from a study evaluating the multi-strain probiotic NeuralliTM-CORE in a dextran sulfate sodium (DSS)-induced colitis mouse model. The dataset includes measurements of disease activity, body weight, histopathology, cytokine levels, and microbiota analysis. It was authored by Fu-Sheng Deng and last updated on 2026-05 19.
Maria Babí's research dataset analyzes the role of self-awareness work in improving classroom attention for 1st-year vocational training students in a public secondary school. The mixed-methods study involved 26 students and implemented a four-session program using Likert questionnaires, participant observation, and activity evaluation rubrics. The data was last updated on figshare on June 2, 2026.
NOVAGene introduces a nonlinear method to improve weighted gene co-expression network analysis for low-variability datasets like sepsis. The approach was evaluated using a local sepsis dataset and is documented in a 14.8 MB PDF file. Junelle Rey C. Bacong authored this work, last updated on May 12, 2026.
A mouse model dataset evaluating the protective effects of the NeuralliTM-CORE probiotic formulation against dextran sulfate sodium-induced colitis. The data likely contains measurements of disease activity, body weight, histopathological damage, cytokine levels, and microbiota composition. The dataset was authored by Fu-Sheng Deng and last updated on May 19, 2026.