DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

Audio Evaluation Datasets for Speech and Understanding Tasks | DataSalon

Home Speech & AudioAudio Evaluation Datasets for Speech and Understanding Tasks

Speech & Audio

Audio Evaluation Datasets for Speech and Understanding Tasks

Name: Audio Evaluation Datasets for Speech and Understanding Tasks
Creator: XiaomiMiMo
Published: 2025-09-17T02:39:06
Keywords: Regionus, Licensemit

by XiaomiMiMo·Updated 8mo ago

Available on 1 platform

Description

A 2025 collection of multiple audio datasets compiled by XiaomiMiMo for the MiMo-Audio-Eval toolkit. It includes datasets for automatic speech recognition, text-to-speech, and audio understanding tasks such as AISHELL1, LibriSpeech, and SeedTTS.

Use Cases

Benchmark automatic speech recognition models using the AISHELL1 and LibriSpeech datasets.
Evaluate text-to-speech synthesis systems with the SeedTTS dataset.
Test audio understanding and reasoning models on the MMAU and MMSU datasets.

Strengths

Compiled by XiaomiMiMo, a major technology organization.
Includes established benchmark datasets like AISHELL1 and LibriSpeech.
Updated in September 2025, indicating recent maintenance.

Limitations

Specific dataset sizes, row counts, and column structures are unknown.
The composition and balance of the included datasets are not detailed.
File formats and data accessibility details are unspecified.

Provenance

Source: XiaomiMiMo
Collection Method: Collection of existing audio datasets for an evaluation toolkit.
Freshness: Last updated on 2025-09-18.

Users should review the full description on the Hugging Face dataset page for details on included datasets and potential usage terms.

Regionus Licensemit

Related Datasets

Quality Score

D37

Description

Source

Reputation

Quality Score

D37

Description

Source

Reputation

Access

Community

64 downloads

7 likes

0 views

Dataset Info

Author: XiaomiMiMo
Created: Sep 17, 2025
Updated: Sep 18, 2025
Last synced: May 4, 2026

Access

Community

64 downloads

7 likes

0 views

Dataset Info

Author: XiaomiMiMo
Created: Sep 17, 2025
Updated: Sep 18, 2025
Last synced: May 4, 2026

Audio Evaluation Datasets for Speech and Understanding Tasks

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info