630 speakers from 8 major American English dialect regions each reading 10 phonetically rich sentences. The dataset includes high-quality audio recordings accompanied by time-aligned phonetic and word transcriptions for acoustic-phonetic research.
Use Cases
- Train automatic speech recognition models using the time-aligned .phn and .wrd transcription files
- Conduct dialect classification studies based on the 8 American English dialect labels assigned to speakers
- Perform acoustic-phonetic research by analyzing phoneme durations and spectral characteristics using the .phn segment boundaries
Strengths
- 630 speakers representing 8 distinct American English dialect regions
- 6,300 total utterances with 10 phonetically rich sentences per speaker
- Includes time-aligned phonetic (.phn), word (.wrd), and sentence (.txt) transcription files
- High-quality 16kHz waveform audio recordings for each utterance