Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
A metadata-only reference dataset containing 1,000 test samples for Thai speech recognition benchmarking. The dataset, created by typhoon-ai, provides audio IDs and human transcriptions derived from the Gigaspeech2 corpus, with the last update recorded on 2026-05-18. Each audio_id links to the original Gigaspeech2 dataset for audio file retrieval.
License is unknown; users must verify terms before use.