Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
Voicebench Ja contains 4 subsets created by applying speech synthesis to samples from three Japanese text benchmarks: Elyza-tasks-100, M-IFEval, and JamC-QA. The dataset was constructed by SB Intuitions using their internal TTS model and JVS corpus audio prompts to quantitatively evaluate performance gaps between audio and text inputs for language models. It was last updated on March 30, 2026.
License is unknown; the full description is truncated and requires visiting the Hugging Face page.