Kristina Kobrock created a dataset modeling pragmatic mechanisms in referential communication. The dataset likely contains lexicon sizes or related metrics from computational simulations of language emergence. It was last updated on 2026-05-26.
Use Cases
- Analyzing efficiency tradeoffs between speaker and listener utilities based on the described simulation framework.
- Studying the impact of shared context on language ambiguity and informativeness as described in the model.
- Comparing lexicon characteristics between languages that emerged with and without contextual information during training.
- Investigating the role of utility-based pragmatics, modeled with the Rational Speech Acts framework, on linguistic efficiency.
Strengths
- Dataset is openly licensed under CC-BY-4.0.
- The underlying research investigates specific factors (context-based and utility-based reasoning) in language emergence.
- File size is 5.5 KB, indicating a focused and manageable dataset.
Limitations
- Row count is unknown, which may limit suitability assessment.
- Column-level documentation is absent; field semantics must be inferred after download.
- The dataset's small size (5.5 KB) suggests limited scope and likely contains only summary metrics rather than detailed interaction logs.
Provenance
- Source
- figshare
- Collection Method
- Likely generated from computational simulations of multi-agent communication.
- Freshness
- Last updated 2026-05-26 17:46:45; freshness should be verified.