Loading...
Loading...
Text classification, translation, QA, summarization, dialogue, sentiment analysis, language modeling, text corpora
42,901 datasets
A study by Zhongqun Guo employed microbially induced carbonate precipitation (MICP) to solidify ion-adsorbed rare earth mining tailings. The dataset, last updated in May 2026, contains experimental results measuring unconfined compressive strength up to 770 kPa and heavy metal leaching concentrations. It investigates the effects of treatment method, cementation solution concentration, and initial heavy metal contamination levels.
Longitudinal data from 6, 9, and 12-month-old infants, collected via eye-tracking to study audiovisual speech processing. The dataset contains results from General Linear Mixed Models analyzing preferential looking behavior in infants at typical and elevated likelihood of autism. It was authored by Elena Capelli and last updated on 2026-05-12.
A dataset split for benchmarking multimodal sentiment analysis models, published by Wenyan Xiao on 2026-05-12. It contains textual, acoustic, and visual information aligned for emotion interpretation. The data is stored in an XLS file with a size of 5.5 KB.
Solvencia Consolidada determines consolidated solvency and capital adequacy ratios for financial supervision in Colombia. Data originates from official reports like CUIF and F-239, with values recorded in Colombian pesos. The dataset is hosted by datos.gov.co and was last updated on 2026-05-18.
403 survey responses from American audiences evaluating four English translation strategies for Li Bai's classical Chinese poem 'Yu Jie Yuan'. The dataset was created by Hongjing Chang and last updated in May 2026. Quantitative and qualitative data were collected via an online questionnaire on the Prolific and Qualtrics platforms.
403 American participants recruited via Prolific evaluated four English translations of Li Bai's "Yu Jie Yuan" using an adapted Reader Response Questionnaire. The dataset, created by Hongjing Chang and last updated in May 2026, contains quantitative and qualitative responses analyzed with t-tests, ANOVA, and thematic analysis. Results indicated a preference for rhymed translation, offering empirical support for translation strategies in cross-cultural communication.
211 working parents in dual-earner families provided survey data during the COVID-19 pandemic. The dataset, created by Selda Coşkuner Aktaş and last updated in May 2026, examines relationships between time-based spousal support, self-efficacy, time demands, and work-family conflict. It likely contains variables derived from the survey questions described in the study.
52 patients diagnosed with Blastocystis spp. infection were studied at the Central University of Venezuela. Clinical and epidemiological data were collected, and diagnosis involved direct stool examination, culture, morphology evaluation, parasite load, and stool consistency. DNA was extracted for PCR amplification of the 18S rRNA gene and subtyping via RFLP.
New York State's Division of Mineral Resources provides data on active and historical mining permits. The dataset includes details on permit status, location, acreage, commodities, inspection dates, and financial security. Columns suggest it tracks the lifecycle of mining operations from initial permit to reclamation.
A research document describing an in vitro study investigating the HSP90 inhibitor onalespib as a combination therapy to overcome cisplatin resistance. The document, authored by Anja Charlotte Lundgren Mortensen and last updated in May 2026, details methods assessing cell viability, proliferation, migration, apoptosis, and DNA damage response in ovarian and head and neck cancer cell models. Results indicate onalespib enhances cisplatin efficacy in a dose-dependent manner.
Over 3,000 sediment samples from Geoscience Australia's MARS database provide a regional synthesis of the non-reefal seabed, which comprises 95% of the Great Barrier Reef Marine Park area. This dataset combines surface sediment data with geomorphic features to characterize seabed habitats, updating models since the 1960s-1980s. It reveals regional trends and local-scale sedimentary patterns, including the patchy distribution of sand and concentrations of gravel and mud.
Tagging data for 22,573 North Sea cod (1961-2015) and 53,489 plaice (1957-2005) includes information on fish location, maturation, sea surface temperature, length, weight, and tagging/recapture times. The dataset was generated for the EU H2020 Pandora project and includes both conventional marker-ID tags and, for cod from 1999 onward, electronic Data Storage Tags (DSTs). It supports analysis of how environmental factors like temperature affect fish fertility and population dynamics.
Simulation study comparing statistical methods for analyzing time-to-event endpoints when treatment effects are delayed, violating proportional hazards assumptions. Evaluates type I error and power of weighted logrank tests, combinations, and regression-based alternatives.
Records of land invasion and property usurpation crimes in Colombia, based on the country's Penal Code (Law 599 of 2000). The dataset includes columns for location, date, and description of the criminal conduct. It is published by www.datos.gov.co and was last updated on May 19, 2026.
A single, biopsy-proven case report of a young male patient with combined central and peripheral demyelination (CCPD). The report details a rare presentation where severe optic neuritis was the initial manifestation, and the patient tested positive for anti-neurofascin 155 (anti-NF155) antibodies. Authored by Yujing Peng and shared under a CC-BY-4.0 license on figshare in May 2026, the document is 1013.7 KB in size.
A research document from figshare authored by Ce Cao, last updated in May 2026. It details a study investigating the pro-regenerative effects of Icariin (ICA) on endothelial cells in a rat model of heart failure. The work employs bioinformatics, molecular docking, and experimental validation to propose a novel signaling mechanism.
Outdoor measurement data from Golden, Colorado, USA, for a four-terminal gallium arsenide/silicon tandem solar mini-module deployed from October 2019 to January 2021. The dataset includes current-voltage characteristics, spectral irradiance, meteorological data, and pre-deployment lab characterization. It was produced by the Department of Energy to support a published performance modeling and degradation analysis framework.
October 2019 to January 2021 outdoor measurement data for a four-terminal gallium arsenide/silicon tandem solar mini-module deployed in Golden, Colorado. The dataset includes current-voltage characteristics, spectral irradiance, meteorological data, and pre-deployment lab characterization. It was created by the Department of Energy to support performance modeling and degradation analysis for tandem photovoltaic devices.
Geoscience Australia maintains the Australian Marine Spatial Information System (AMSIS), a web-based interactive mapping and decision support tool. AMSIS integrates curated data from government, state, and academic sources to visualize competing interests in Australia's marine jurisdiction. The system contains many layers of information displayed in themes including Maritime Boundaries, Petroleum, Fisheries, Environment, Native Title, and Regulation.
Five newly described fungal species from the Hydnellum genus, collected from the Changbai and Dabie Mountains in China. The data includes morphological characteristics, species distribution, host information, and phylogenetic analysis files. The dataset was authored by Yonglan Tuo and last updated in April 2026.