Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
Featuring multi-speaker, high-quality transcribed audio data for the Sinhala language, consisting of wave files and a TSV file. The data was manually quality checked and was collected by Google in Sri Lanka and contributed by the Path to Nirvana organization.
The dataset consists of wave files and a TSV file, with a 'si_lk.lines.txt' file containing FileID (which includes UserID) and Transcription. Specific file formats and sizes are unknown.