Loading...
Loading...
Text classification, translation, QA, summarization, dialogue, sentiment analysis, language modeling, text corpora
39,943 datasets
Alaa Al-Haddad's proof-of-concept study provides a dataset of 160 AI-generated responses to 80 patient prompts across four cosmetic dentistry domains. The dataset, last updated in 2026, includes comparative evaluations of two large language models using a 20-point rubric assessing readability, practicality, empathy, structure, and safety. The data supports the feasibility of a multi-metric framework for evaluating patient-facing AI communication.
315 valid survey responses from small and medium-sized textile and garment enterprises in Ghana's Western Region. The data was gathered via simple random sampling and analyzed using Partial Least Square Structural Equation Modelling (PLS-SEM 4.0). The study was authored by Kweku Safo-Ankama and published on figshare in May 2026.
Satellite-derived river ice roughness maps for selected Canadian regions in the current calendar year, produced by Natural Resources Canada. The dataset is updated in near real-time, typically within 4 hours of new satellite imagery acquisition, to support emergency response. Coverage is not comprehensive nationwide.
A metabolomics study from 2026 analyzed biological samples from 18 beef steers rotationally grazing toxic, novel, or endophyte-free tall fescue pastures. The dataset includes untargeted high-resolution metabolomics and targeted volatile fatty acid analysis results from urine, saliva, plasma, rumen fluid, and feces. It was authored by Ignacio M. Llada and shared under a CC-BY-4.0 license.
2022-23 data provides an overview of income and tax status for Australian individuals, companies, partnerships, trusts, and funds. The dataset offers a cross-sectional view of the Australian tax system for that financial year, with a historical entry also noted for 2011-12. It is published by the Australian Taxation Office and available in multiple tabular and geospatial formats.
TLabel Format is a unified annotation framework for tactile manipulation data. It includes validation data from two sensors: Daimon-Infinity (94 episodes, 6 tasks) and PaXini PXCap (15 episodes, 7 tasks). The dataset was published by Xi Luo on figshare in June 2026.
Experimental data from a study on how physical effort influences speech adaptation to auditory perturbations. The dataset includes acoustic (formant frequencies) and physiological (EMG) measurements from 21 native French speakers performing a speech task under two effort conditions. The data was published by Elodie Ronayette on figshare under a CC-BY-4.0 license in May 2026.
Geoscience Australia Data provides a dataset on Earth's rotation rate and length-of-day (LOD) variations since 1962. The data is used to analyze the 18.6-year lunar nutation signal and long-term trends, with extrapolations suggesting LOD may vary between -1 ms to +1 ms until at least 2050. The associated research article was published in 2026.
An unbalanced panel dataset of 298 Chinese prefecture-level cities from 2000 to 2023, created by Fanke Sheng. It measures urban ecological resilience using an entropy-weighted composite index of 14 environmental indicators to evaluate the impact of China's 2014 National New Energy Demonstration City pilot policy.
A meta-analysis synthesizing 94 effect sizes from 54 independent samples, totaling 36,583 participants, to examine the link between upward social comparison on social media and psychological outcomes. The research was conducted by Yuqing Lei and published on figshare in May 2026. It found an average correlation of r = 0.330, with the strongest link to social-evaluative negative emotions.
Textual data addressing research questions on multi-objective optimization paradigms in energy-aware production scheduling. The dataset was authored by Andreas Nearchou and last updated on 2026-05-30. It is a small dataset of 19.4 KB, available in DOCX and TXT formats.
7,547 doctor profiles from the Chinese online medical platform haodf.com were collected using Python. The dataset includes calculated facial feature values from doctor images, such as attractiveness, smile, and gender, linked to offline conversion rates. It was created by Xue Zhang and last updated in June 2026.
7547 doctor profiles, including images and ratings, were scraped from the Chinese platform haodf.com. The dataset likely contains calculated facial feature values and offline conversion metrics. It was authored by Xue Zhang and last updated in June 2026.
7547 doctor profiles were scraped from the Chinese online medical platform haodf.com. The dataset includes calculated facial feature values and was used to study the impact of appearance on offline service conversion rates. It was authored by Xue Zhang and last updated in June 2026.
Xue Zhang published regression results on 2026-06-03. The dataset contains facial feature analysis for 7,547 doctors from the Chinese online medical platform haodf.com. It likely includes regression results exploring the relationship between facial features and user conversion rates.
54 Chinese charitable foundations were analyzed using fsQCA to explore fundraising effectiveness. Zhe Zhu published this dataset on figshare in June 2026. The study identifies key conditions like digital transparency infrastructure and program activity intensity.
54 Chinese charitable foundations were analyzed using fsQCA to explore fundraising effectiveness. The study identifies digital transparency infrastructure and program activity intensity as key conditions for high fundraising benefits. Zhe Zhu authored this dataset, which was last updated on June 3, 2026.
An analysis of 54 Chinese charitable foundations explores factors influencing fundraising effectiveness using fsQCA. The dataset likely contains variables related to digital transparency, program activity intensity, professional management, and political connections. It was authored by Zhe Zhu and last updated on June 3, 2026.
54 Chinese charitable foundations were analyzed using the fsQCA method to explore factors influencing fundraising effectiveness. The dataset, authored by Zhe Zhu and last updated in June 2026, identifies two core configuration paths leading to high fundraising benefits. It is a small dataset, 5.5 KB in size, shared under a CC-BY-4.0 license on figshare.
54 Chinese charitable foundations analyzed using fsQCA to explore factors influencing fundraising effectiveness. The dataset likely contains variables such as digital transparency infrastructure, program activity intensity, professional management level, and political connections. The study was authored by Zhe Zhu and uploaded to figshare.