KitaKo Multimodal Dataset contains 110,000 images paired with 548,000 parallel captions. The captions are provided in three languages: English, Filipino, and Taglish. The dataset's author, organization, and last update date are unknown.
Use Cases
- Train image captioning models based on the 548,000 parallel captions.
- Develop multilingual NLP systems leveraging the English, Filipino, and Taglish text.
- Benchmark machine translation performance between English and Filipino/Taglish.
- Analyze linguistic features of Taglish, a hybrid language, using the provided captions.
Strengths
- 110,000 images provide a substantial visual corpus.
- 548,000 parallel captions offer a significant amount of aligned text data.
- Captions are provided in three distinct languages: English, Filipino, and Taglish.
Limitations
- Column-level documentation is absent; field semantics must be inferred after download.
- Last update date is unknown; freshness unverified.
- Row count is unknown, which may limit suitability assessment.