Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
CORAA v1.1 contains 290.77 hours of Brazilian Portuguese audio with transcriptions, segmented into over 400,000 audio files. The dataset is compiled from five distinct speech projects, including academic recordings and TEDx talks, and is validated for automatic speech recognition research.
Full dataset description, including detailed validation methods and license information, is only available on the external Hugging Face dataset page.