Codemix_TTS: Text-to-Speech Data for Code-Mixed Language
Available on 1 platform
Sign in to view source links and access this dataset
Description
A Kaggle dataset titled 'codemix_tts' likely contains audio data for text-to-speech synthesis. The dataset's specific content, such as the number of audio samples or languages covered, is not detailed in the provided metadata. It is hosted on the Kaggle platform, but the author, organization, and last update date are unknown.
Use Cases
Training a TTS model for code-mixed language utterances (inferred from domain, verify after download)
Benchmarking speech synthesis quality on linguistically mixed input (inferred from domain, verify after download)
Studying phonetic and prosodic features in code-switched speech (inferred from domain, verify after download)
Strengths
Published on Kaggle, a platform with established data sharing infrastructure.
Limitations
Metadata is minimal; actual content requires verification after download.
Column-level documentation is absent; field semantics must be inferred after download.
Row count and file size are unknown, which may limit suitability assessment.
Provenance
Source
Kaggle
Collection Method
Method of data gathering is unknown.
Time Range
Temporal coverage is unknown.
Freshness
Last update date is unknown; freshness unverified.
Geography
Spatial coverage is unknown.
License is unknown; users must verify permissions before use.