Loading...
Loading...
Text classification, translation, QA, summarization, dialogue, sentiment analysis, language modeling, text corpora
40,759 datasets
A small dataset of 5.5 KB contains test accuracy results from a computational modeling study of pragmatic language emergence. The study, authored by Kristina Kobrock and shared on figshare in May 2026, models context-based and utility-based pragmatic reasoning in a multi-agent framework. Results show that languages emerging in a shared context become more efficient and ambiguous.
Kristina Kobrock created a dataset modeling pragmatic mechanisms in referential communication. The dataset likely contains lexicon sizes or related metrics from computational simulations of language emergence. It was last updated on 2026-05-26.
A 9.5 KB Excel file published by Kristina Kobrock on figshare in May 2026. The dataset likely contains results from a computational model simulating pragmatic mechanisms in referential communication and categorization. It models the efficiency tradeoff between speaker and listener utilities in a multi-agent framework of emergent communication.
138.1 KB of legacy citation and reference records connected to the Samuel & Audrey Media Network. The dataset includes records for public academic citations, media mentions, tourism-sector references, public profiles, and interviews. It is a legacy version archived by Samuel Jeffery on figshare in May 2026.
Primers and probes used in a molecular study of Pteropine orthoreovirus (PRV) virulence determinants. The dataset was created by Hayato Harima and published on figshare under a CC-BY-4.0 license on May 26, 2026. It contains oligonucleotide sequences designed for analyzing the Nachunsulwe-57 and Miyazaki-Bali/2007 strains.
Hayato Harima's dataset on figshare contains experimental data from a study of Pteropine orthoreovirus virulence. The data likely includes measurements from recombinant monoreassortant viruses derived from strains Nachunsulwe-57 and Miyazaki-Bali/2007, tested in laboratory mice. The dataset was last updated on 2026-05-26 and is licensed under CC-BY-4.0.
A research dataset from figshare presents experimental and simulation results for chemically propelled micromotors. The data, authored by Jinwei Lin and last updated in May 2026, demonstrates a structural encoding strategy enabling programmable, time-variable motion in polystyrene–Au–Pt Janus micromotors. The 95.9 MB ZIP file likely contains data on motion speed, orientation, and transition times controlled by layer parameters.
Colombia's hydrographic zoning units for environmental land-use planning, updated to a 1:100,000 scale for the 2022 National Water Study (ENA2022). The dataset was created by IDEAM based on HydroSHEDS data and 1:100,000 scale base cartography from IGAC in 2016. It represents the analysis units used in the ENA2022, excluding the marine-coastal areas from the 2013 version due to a lack of official data at the target scale.
December 2018 land suitability assessment for commercial pea (Pisum sativum L.) cultivation in the Nariño department of Colombia. The map results from applying a methodology that integrates biophysical, socioeconomic, and socio-ecosystem components under contract Interadministrativo No 222 of 2017 between UPRA and the University of Nariño. It classifies zones into high, medium, low, unsuitable, and legally excluded categories for agricultural planning.
A land suitability map for commercial bean (Phaseolus vulgaris) cultivation in the Nariño department of Colombia, produced in December 2018. The dataset results from a land evaluation methodology applied at a 1:100,000 scale, incorporating biophysical, socioeconomic, and socio-ecosystem components. It was developed under contract No. 222 of 2017 between UPRA and the University of Nariño.
A land suitability map for commercial Kikuyo grass (Pennisetum clandestinum) cultivation in the Nariño department of Colombia, produced at a 1:100,000 scale. The dataset results from a 2017 inter-administrative contract between UPRA and the University of Nariño, validated in December 2018. It classifies areas into High, Medium, Low, and Unsuitable aptitude categories based on biophysical, socioeconomic, and socio-ecosystem components.
A land suitability assessment map for climbing bean (Phaseolus vulgaris L.) cultivation in the Cundinamarca department of Colombia. The dataset results from a methodology applied under contract 268 of 2016, integrating biophysical, socioeconomic, and socio-ecosystem components at a 1:100,000 scale. It categorizes land into high, medium, low, unsuitable, and legally excluded zones for commercial bean production during the first agricultural semester.
A re-evaluation of the armoured dinosaur Pinacosaurus based on dozens of new specimens. The dataset supports a cladistic analysis distinguishing species like P. grangeri and P. hilwitnorum. It was authored by Paul Penkalski and last updated on 2026-05-12.
A land suitability map for commercial Dominico Hartón banana cultivation in the Cundinamarca department of Colombia at a 1:100,000 scale. The dataset results from a 2016 inter-administrative contract between UPRA and the Cundinamarca Governor's Office, incorporating biophysical, socioeconomic, and socio-ecosystem components. It classifies areas into high, medium, low, and non-suitable aptitude categories, as well as legal exclusions.
December 2018 land suitability assessment for commercial soybean (Glycine max L.) cultivation in the Meta department of Colombia, produced by the UPRA and the Governor's Office of Meta. The dataset contains a general-scale (1:100,000) zoning map based on biophysical, socioeconomic, and socio-ecosystem components. It classifies areas into suitability categories from 'High Aptitude' to 'Legal Exclusion' zones.
A land suitability map at a 1:100,000 scale for commercial Valencia orange (Citrus sinensis L) cultivation in the Cesar department of Colombia. The dataset categorizes areas into high, medium, low, unsuitable, and legally excluded aptitude based on physical, socio-ecosystem, and socio-economic criteria. It was last updated on 2026-05-18 and originates from the Colombian open data portal www.datos.gov.co.
A land suitability map for Keitt mango cultivation in Colombia's Cesar Department, produced using a 1:100,000 scale assessment methodology. The dataset categorizes land into high, medium, low, and unsuitable aptitude zones based on physical, socio-ecosystem, and socio-economic criteria. It was last updated on 2026-05-18.
Benchmark calculations for a cluster-in-molecule double-hybrid density functional approach (CIM-DHDF) developed to enable efficient calculations for large molecular systems. The dataset likely contains statistical error metrics from performance tests, including reaction barrier heights for 6 large hydrogen-transfer reactions containing up to 1443 atoms. The data was uploaded by Zhigang Ni to figshare on 2026-05-05.
A 2018 land evaluation map for commercial avocado production in Colombia's Nariño Department, created at a 1:100,000 scale. The dataset results from a 2017 inter-administrative contract between UPRA and the University of Nariño, incorporating biophysical, socioeconomic, and socio-ecosystem components. It classifies land into five aptitude categories, from 'High' to 'Legal Exclusion' zones.
A collection of 1,035 scientific publications from the Web of Science Core Collection, focused on active intervention in inner ear hair cell repair and regeneration. The dataset was compiled by Xiaoyi Zhang using a specific search query and exclusion criteria, covering publications from 2011 to 2025. It is available under a CC-BY-4.0 license and was last updated in May 2026.