Loading...
Loading...
Text classification, translation, QA, summarization, dialogue, sentiment analysis, language modeling, text corpora
42,817 datasets
Data from 943 patients collected over 12 years in the DiabMemory system, supplemented by 100 synthetic notes, were analyzed for physical activity information extraction. The dataset was created by Fabian Wiesmüller and last updated in May 2026. It includes pseudonymized free-text notes from a diabetes telehealth platform.
Survey data from a study of 1,229 participants who used the Headspace mindfulness app during a public health deployment. The data includes two survey time points measuring distress, loneliness, mental health stigma, and use of other online mental health tools. The dataset was authored by Judith Borghouts and last updated in May 2026.
A field study dataset comparing concentrations of mineral-associated organic carbon (MAOC) and particulate organic carbon (POC) in desertifying grasslands of Inner Mongolia, China. The data includes mean MAOC values of 15.66 g/kg and 14.99 g/kg at two depths in the typical steppe, and 12.10 g/kg and 11.64 g/kg in the desert steppe, alongside POC concentrations. Authored by Hao Peng and published on figshare in May 2026.
Cupos otorgados para licencias de cannabis psicoactivo tracks the quotas granted for cultivating psychoactive cannabis plants in Colombia. The data includes counts for initial applications and granted quotas for both ordinary and supplementary license types. Information is available from 2017 through the first quarter of 2025, sourced from the Colombian open data portal www.datos.gov.co.
World Bank data on energy and mining for Australia, compiled from the International Energy Agency and the Carbon Dioxide Information Analysis Center. The dataset covers energy production, use, dependency, and efficiency metrics. It was last updated on 2026-04-27 and is provided under a CC-BY-4.0 license.
Portuguese wine regions provided 397 yeast strains for a study of their biocontrol potential against common grape phytopathogenic fungi. The dataset, authored by Marcos Esteves and last updated in May 2026, contains results from time-course monitoring of mold growth inhibition. All tested yeasts displayed antagonistic activity against at least one of the four fungal targets: Aspergillus, Botrytis, Rhizopus, and Penicillium.
A dataset of cell identifiers used in a scientific machine learning (SciML) project to model voltage-gated potassium (Kv) ion channels. The data, published by Domas Linkevicius on figshare in April 2026, supports a unified Hodgkin-Huxley-like model fitted to recordings from 20 different Kv types.
A structured dataset of paired design alternatives, consisting of baseline solutions and AI-generated variants developed at multiple levels of representational definition. The dataset, created by Francesco Sica and last updated in May 2026, supports a two-phase comparative framework linking representational variability to economic outcomes for sustainable architectural and urban design.
A 223.1 KB PDF document authored by Aliyah J. Ross and last updated on May 7, 2026, presents experimental data on fear behaviors in mice. The study investigates the role of the plasma membrane monoamine transporter (PMAT) in fear conditioning, generalization, and extinction, using PMAT wildtype and heterozygous male and female mice. It is shared under a CC-BY-4.0 license on the figshare platform.
European farms and a zoo in Czech Republic and Italy provided specimens for this dataset. It contains morphological, biometrical, and molecular data for 58 Trichuris tenuis nematodes collected from alpacas, llamas, and guanacos. Simona Rejnková published the integrative taxonomic study on figshare in May 2026.
1246 experimental solvation free energy measurements from the Minnesota Solvation database were used to parametrize a Poisson–Boltzmann surface area model for 27 organic solvents. The dataset likely contains calculated solvation free energies, partition coefficients, and blood-brain barrier permeability predictions, supporting the development of computationally efficient ADMET descriptor models. The dataset was created by Taoyu Niu and last updated on May 5, 2026.
An expanded Poisson–Boltzmann surface area (PBSA) solvation model for calculating solvation free energies (SFEs) and partition coefficients (logP) across diverse organic solvents. The dataset includes parameters derived from 1246 experimental SFE measurements from the Minnesota Solvation (MNSol) database and a predictive model for blood–brain barrier (BBB) permeability (logBB). The work was authored by Taoyu Niu and published on figshare in May 2026.
526 English articles on occupational therapy for disorders of consciousness were analyzed using bibliometric methods. The dataset, created by Jinqin Zhang, includes knowledge maps generated with CiteSpace focusing on authors, institutions, countries, and keywords. It was last updated on May 7, 2026.
Geoscience Australia conducted marine surveys in Jervis Bay, New South Wales, across 2007, 2008, and 2009. Data includes a family per sample matrix generated by aggregating species-level information from infauna sampling. Surveys mapped seabed bathymetry and characterized benthic environments using sediment sampling, underwater video, and photography.
Data acquired during 2017 and 2018 includes reflected light microscopy images, backscattered electron images, and laser ablation ICP-MS metadata for Fe-Ni-Cu sulfide minerals. Samples are dunites, harzburgites, and pyroxenites from the upper mantle and lower crust of the Kohistan arc system in northern Pakistan. The data were gathered under the From Arc Magma to Ore System (FAMOS) Project to understand trace element concentrations in sulfide minerals.
NERC Grant NE/S00162X/1 data includes nanoindentation, high-angular resolution electron backscatter diffraction (HR-EBSD), and transmission electron microscopy data from synthetic forsterite bicrystals. The data were collected on two samples with high- and low-angle grain boundaries at room temperature. These data support the published manuscript 'The Role of Grain Boundaries in Low-Temperature Plasticity of Olivine Revealed by Nanoindentation' (DOI: 10.1029/2023JB026763).
Data from brine and CO2 flow-through experiments on three sandstones with varying porosity and clay content. Geophysical and transport properties were measured before, during, and after CO2 exposure in a high-pressure laboratory setup at the National Oceanography Centre, Southampton in 2022. The dataset was produced by the British Geological Survey as part of the OASIS, EHMPRES, and FOCUS projects.
High-resolution oxygen isotope data from planktic foraminifera in sediment cores collected offshore Montserrat during IODP Expedition 340. The dataset provides an age framework for the upper ~300,000-year sections of three cores, with sampling resolution estimated at every 2000 years. This work was funded by NERC grant NE/K002724/1 and involves collaboration with scientists from the University of Massachusetts and Florida State University.
A case report details an 89-year-old male patient who developed myopathy with elevated creatine kinase (2,103 U/L) after concomitant use of atorvastatin and almonertinib. The report, authored by Jifang Zhou and published on figshare under CC-BY-4.0, describes the diagnosis and resolution after drug discontinuation. It concludes with recommendations for alternative cholesterol-lowering therapies in patients taking almonertinib.
Data from data.bts.gov provides the monthly Twenty-foot Equivalent Unit (TEU) capacity of container ships calling at U.S. ports from January 2020 to September 2023. The dataset represents the total potential container slots available on ships that docked, offering a supply-side view of maritime trade. It was last updated on 2026-05-28 15:56:03.