Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
2.28 million unique audio clips are organized into streams for training audio-language models and agents. The dataset is structured into six task-specific subsets, including streams for audio understanding and real-time automatic speech recognition. It was created by zhifeixie and last updated on June 3, 2026.
License is unknown, which may restrict commercial use.