DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

Long-TTS-Eval: Long-form Text-to-Speech Evaluation Benchmark | DataSalon

Home Speech & AudioLong-TTS-Eval: Long-form Text-to-Speech Evaluation Benchmark

Speech & Audio

Long-TTS-Eval: Long-form Text-to-Speech Evaluation Benchmark

Name: Long-TTS-Eval: Long-form Text-to-Speech Evaluation Benchmark
Creator: wcy1122
Published: 2025-09-29T17:16:52
Keywords: Size Categories1 Kn10 K, Task Categoriestext To Speech, Librarypolars, Languagezh, Languageen, Voice Cloning, Modalitytext, Librarymlcroissant, Speech Generation, Librarydatasets, Benchmark, Librarypandas, Parquet, Regionus, Arxiv250925131, Arxiv240318814, Long Form, Licenseapache 20, Multimodal

by wcy1122·Updated 8mo ago

Available on 1 platform

Description

This benchmark contains evaluation data for long-form Text-to-Speech (TTS) and speech-audio understanding tasks in English and Chinese. It is designed to test the capabilities of omni-modal large language models in generating personalized, long-horizon speech and interpreting complex audio signals.

Use Cases

Benchmark the accuracy of long-form TTS systems using the provided English and Chinese text inputs
Evaluate the audio understanding performance of omni-modal LLMs against the speech comprehension tasks
Test the consistency of personalized voice cloning over long-horizon speech generation

Strengths

Includes evaluation samples for both English and Chinese language processing
Focuses on long-form TTS tasks to measure performance in extended speech synthesis
Provides test cases for speech and audio understanding within the MGM-Omni framework

Multimodal Parquet Size Categories1 Kn10 K Task Categoriestext To Speech Librarypolars Languagezh Languageen Voice Cloning Modalitytext Librarymlcroissant Speech Generation Librarydatasets Benchmark Librarypandas Regionus Arxiv250925131 Arxiv240318814 Long Form Licenseapache 20

Related Datasets

Quality Score

D35

Description

Source

Reputation

Quality Score

D35

Description

Source

Reputation

Access

Community

239 downloads

10 likes

0 views

Dataset Info

Author: wcy1122
Created: Sep 29, 2025
Updated: Oct 6, 2025

Access

Community

239 downloads

10 likes

0 views

Dataset Info

Author: wcy1122
Created: Sep 29, 2025
Updated: Oct 6, 2025

Long-TTS-Eval: Long-form Text-to-Speech Evaluation Benchmark

Description

Use Cases

Strengths

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info