Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
Vaja-Thai is a unified Thai speech dataset containing 289,916 audio samples totaling 554.6 hours for Text-to-Speech research. The dataset was created by dubbing-ai and last updated in April 2026. All audio is resampled to 24 kHz WAV format and combines multiple quality-filtered sources.
License for the combined dataset is unknown; individual source licenses like CC-BY-NC-SA-3.0 may apply.