114,036 preprocessed Indonesian speech samples totaling approximately 4GB of data. The dataset includes WAV audio recordings sampled at 16,000 Hz paired with corresponding text transcriptions.
Use Cases
- Train an Indonesian Text-to-Speech (TTS) model using the audio and text features
- Fine-tune Automatic Speech Recognition (ASR) systems by mapping the audio recordings to their text transcriptions
- Analyze phonetic distribution in Indonesian speech using the provided text labels
Strengths
- Contains 114,036 individual speech examples
- Audio files are provided in WAV format with a standardized sampling rate of 16,000 Hz
- Includes a text column containing the transcription for each corresponding audio entry