Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
This audio-text dataset provides paired audio signals and descriptive captions for the first Audiocaption task, released by RicherMans in 2024. It serves as a benchmark for automated audio description systems and includes baseline code for performance evaluation.
Users should refer to the GitHub repository for the baseline implementation and data loading scripts.