Speech-to-Text Transcription Dataset with Acoustic Features

Available on 1 platform

Sign in to view source links and access this dataset

Description

A multimodal dataset for speech recognition tasks. The description suggests it contains acoustic features relevant to speech-to-text transcription. Its origin, size, and temporal coverage are unknown.

Use Cases

Train automatic speech recognition models based on the speech-to-text transcription focus.
Analyze acoustic features for speech signal processing based on the description.
Develop multimodal AI systems integrating audio and other modalities based on the 'multimodal' mention.
Benchmark speech recognition algorithms based on the dataset's stated purpose.

Strengths

Focuses on a core AI task: speech-to-text transcription.
Includes multimodal and acoustic feature data, which suggests a richer representation than raw audio alone.

Limitations

Row count is unknown, which may limit suitability assessment.
Column-level documentation is absent; field semantics must be inferred after download.
Last update date is unknown; freshness unverified.

Audio Multimodal Acoustic Features Speech Recognition

Related Datasets

Quality Score

D19

Description

18

Source

17

Reputation

18

Access

31

Community

0 views

Dataset Info

Last synced: May 1, 2026

Access

31

Community

0 views

Dataset Info

Last synced: May 1, 2026

Speech-to-Text Transcription Dataset with Acoustic Features

Description

Use Cases

Strengths

Limitations

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info