Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
A test audio dataset for the ADLIB language-aware ASR benchmark framework for Japanese. It contains 247 test cases with audio from 3 speakers, focusing on the DevTerm (software development terminology) domain. Reference transcripts and term annotations are provided in a separate JSONL file within the project's GitHub repository.
The dataset's audio files and reference data (test_cases.jsonl) are stored separately; full usage requires cloning the GitHub repository and following installation steps.