DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

YODAS: 369,510 Hours of YouTube Speech and Captions | DataSalon

Home Speech & AudioYODAS: 369,510 Hours of YouTube Speech and Captions

Speech & Audio

YODAS: 369,510 Hours of YouTube Speech and Captions

Name: YODAS: 369,510 Hours of YouTube Speech and Captions
Creator: espnet
Published: 2024-02-10T21:00:10
Keywords: Arxiv240600899, Regionus

by espnet·Updated 2y ago

Available on 1 platform

Description

369,510 hours of speech audio and text captions sourced from YouTube, released by the espnet team in 2024. The dataset pairs audio utterances with either user-uploaded (manual) or system-generated (automatic) captions.

Use Cases

Training automatic speech recognition (ASR) models using audio utterances and caption pairs
Benchmarking speech-to-text alignment using the manual vs. automatic caption subsets
Large-scale acoustic pre-training for speech foundation models

Strengths

369,510 hours of speech data
Includes both manual and automatic caption subsets
CC BY 3.0 license for open usage

Limitations

Manual captions are user-uploaded and may not be human-verified
Audio quality is subject to YouTube compression and varied recording environments
Specific column names and file formats are not detailed in the primary metadata

Provenance

Source: YouTube via espnet
Collection Method: Scraped and extracted from YouTube
Freshness: Last updated June 2024.
Geography: Global

A newer version, YODAS2, is available which provides unsegmented audio and a higher sampling rate of 24k. Users should be aware that 'manual' captions only indicate user-upload status, not necessarily human transcription.

Arxiv240600899 Regionus

Related Datasets

Quality Score

D40

Description

Source

Reputation

Quality Score

D40

Description

Source

Reputation

Access

Community

86.1K downloads

137 likes

0 views

Dataset Info

Author: espnet
Created: Feb 10, 2024
Updated: Jun 10, 2024
Last synced: Jun 3, 2026

Access

Community

86.1K downloads

137 likes

0 views

Dataset Info

Author: espnet
Created: Feb 10, 2024
Updated: Jun 10, 2024
Last synced: Jun 3, 2026

YODAS: 369,510 Hours of YouTube Speech and Captions

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info