DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

Hue Voice Dataset: Transcripts with MFA Alignment | DataSalon

Home Speech & AudioHue Voice Dataset: Transcripts with MFA Alignment

Speech & Audio

Hue Voice Dataset: Transcripts with MFA Alignment

Available on 1 platform

Description

A dataset of speech transcripts, likely aligned using the Montreal Forced Aligner (MFA) tool. The dataset is published on Kaggle, but details on its size, creation date, and specific source are not provided in the metadata. The title suggests it contains phonetic or word-level alignment data for audio recordings.

Use Cases

Training or evaluating automatic speech recognition (ASR) models (inferred from domain, verify after download)
Developing or testing forced alignment algorithms for audio-text synchronization (inferred from domain, verify after download)
Analyzing phonetic or prosodic features in speech data (inferred from domain, verify after download)

Strengths

Published on Kaggle, a platform with established data sharing infrastructure.

Limitations

Metadata is minimal; actual content requires verification after download.
Row count, file formats, and column definitions are unknown.
License, author, and last update date are unknown.

Provenance

Source: Kaggle
Collection Method: Likely involves forced alignment processing of audio recordings.
Time Range: null
Freshness: Last updated date is unknown; freshness unverified.
Geography: null

null

Text Audio Mfa Speech Recognition Audio Transcripts Phonetic Alignment

Related Datasets

Quality Score

D15

Description

Source

Reputation

Quality Score

D15

Description

Source

Reputation

Access

Community

0 views

Access

Community

0 views

Hue Voice Dataset: Transcripts with MFA Alignment

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Community