85 languages are represented in a typological database of bivalent verbs and their argument encoding frames. The dataset includes spreadsheets for predicates, languages, valency patterns, and language statistics, compiled by Sergey Say at the Institute of Linguistic Studies. It likely contains data on verb argument frames, translations, and cumulative properties of language valency systems.
Use Cases
- Cross-linguistic analysis of verb argument frames based on the predicate questionnaire.
- Studying the distribution of valency patterns across languages based on the valency_patterns_main.csv file.
- Calculating statistical properties of language valency systems based on the language_stats.csv spreadsheet.
- Mapping linguistic data to genealogical information based on the languages.csv file.
Strengths
- Dataset covers 85 languages.
- Includes multiple structured files for predicates, languages, patterns, and statistics.
- Detailed documentation is available at the referenced website.
Limitations
- Column-level documentation is absent; field semantics must be inferred after download.
- Row count is unknown, which may limit suitability assessment.
- Last update date is unknown; freshness unverified.
Provenance
- Source
- Institute of Linguistic Studies
- Collection Method
- Data gathered via a questionnaire study of bivalent verbs.