DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

DisfluencySpeech: Studio-Quality Labeled Disfluent English Speech | DataSalon

Home Speech & AudioDisfluencySpeech: Studio-Quality Labeled Disfluent English Speech

Speech & Audio

DisfluencySpeech: Studio-Quality Labeled Disfluent English Speech

Name: DisfluencySpeech: Studio-Quality Labeled Disfluent English Speech
Creator: amaai-lab
Published: 2024-05-30T09:19:54
Keywords: Size Categories1 Kn10 K, Librarypolars, Librarydask, Modalityaudio, Modalitytext, Librarymlcroissant, Librarydatasets, Parquet, Regionus, Arxiv240608820, Licenseapache 20

by amaai-lab·Updated 1y ago

Available on 1 platform

Description

Nearly 10 hours of studio-quality English speech recordings from a single speaker recreate expressive utterances from the Switchboard-1 Telephone Speech Corpus. These recordings feature labeled paralanguage and disfluencies across three different data components to simulate realistic informal conversations.

Use Cases

Train expressive text-to-speech (TTS) models capable of generating natural disfluencies from text inputs
Develop prosody modeling systems using the studio-quality audio and corresponding Switchboard-derived transcripts
Evaluate speech recognition systems on their ability to handle informal, disfluent speech patterns in high-fidelity environments

Strengths

Nearly 10 hours of single-speaker studio-quality audio recordings
Derived from the Switchboard-1 Telephone Speech Corpus to capture realistic informal speech patterns
Includes labeled paralanguage and disfluency markers for expressive synthesis
Provides three different data components to support predictive synthesis of paralanguage from text

Parquet Size Categories1 Kn10 K Librarypolars Librarydask Modalityaudio Modalitytext Librarymlcroissant Librarydatasets Regionus Arxiv240608820 Licenseapache 20

Related Datasets

Quality Score

D39

Description

Source

Reputation

Quality Score

D39

Description

Source

Reputation

Access

Community

253 downloads

19 likes

0 views

Dataset Info

Author: amaai-lab
Created: May 30, 2024
Updated: Jun 27, 2024
Last synced: May 8, 2026

Access

Community

253 downloads

19 likes

0 views

Dataset Info

Author: amaai-lab
Created: May 30, 2024
Updated: Jun 27, 2024
Last synced: May 8, 2026

DisfluencySpeech: Studio-Quality Labeled Disfluent English Speech

Description

Use Cases

Strengths

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info