Kartmaan's French Dictionary is derived from the French Wiktionary, containing nearly 900,000 distinct word forms. It provides structured definitions, usage examples, and linguistic metadata, formatted for both SQLite and Parquet applications.
Use Cases
- Train language models on structured definitions and usage examples for semantic understanding tasks.
- Analyze linguistic metadata to study patterns in conjugated verb forms and other word inflections.
- Build an offline query tool using the SQLite format to look up definitions and examples for nearly 900,000 words.
Strengths
- Contains nearly 900,000 distinct word forms, providing broad lexical coverage.
- Includes structured definitions and usage examples for each entry.
- Available in both SQLite and Parquet formats to support different application pipelines.
Limitations
- The dataset's size and specific column structure are not detailed in the input.
- Potential limitations regarding the completeness or update frequency of the source Wiktionary are unknown.
Provenance
- Source
- French Wiktionary
- Collection Method
- Derived from the French Wiktionary project.
- Time Range
- null
- Freshness
- null
- Geography
- null