DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

ATC ASR: 1,000-10,000 Utterance Pairs for Aviation Speech Recognition | DataSalon

Home Speech & AudioATC ASR: 1,000-10,000 Utterance Pairs for Aviation Speech Recognition

Speech & Audio

ATC ASR: 1,000-10,000 Utterance Pairs for Aviation Speech Recognition

Name: ATC ASR: 1,000-10,000 Utterance Pairs for Aviation Speech Recognition
Creator: jacktol
Published: 2025-05-28T17:58:36
Keywords: Size Categories1 Kn10 K, Librarypolars, Librarydask, Modalityaudio, Modalitytext, Librarymlcroissant, Librarydatasets, Parquet, Regionus

by jacktol·Updated 9mo ago

Available on 1 platform

Description

Comprising between 1,000 and 10,000 audio-transcript pairs for Air Traffic Control speech recognition, compiled by user jacktol in 2025. It merges the UWB ATC Corpus and the ATCO2 1-Hour Test Subset into a fine-tuning-ready format. The records consist of cleanly segmented 16kHz .wav files paired with text utterances.

Use Cases

Fine-tuning ASR models for aviation-specific terminology and callsigns
Evaluating speech-to-text accuracy on 16kHz ATC audio segments
Training acoustic models to handle high-noise cockpit and tower communication environments

Strengths

Derived from two real-world ATC corpora (UWB and ATCO2)
Cleanly segmented at the utterance level for immediate fine-tuning
Standardized 16kHz audio sampling rate across all files

Limitations

Small scale (1K-10K records) compared to general-purpose ASR datasets
Domain-specific vocabulary is restricted to aviation and air traffic control terminology
Potential for high background noise or radio interference typical of ATC environments

Provenance

Source: UWB ATC Corpus and ATCO2 1-Hour Test Subset
Collection Method: Curated and segmented from existing real-world ATC corpora
Freshness: Last updated August 2025.

The dataset is provided in Parquet format and is compatible with the Hugging Face datasets library, Polars, and Dask.

Parquet Size Categories1 Kn10 K Librarypolars Librarydask Modalityaudio Modalitytext Librarymlcroissant Librarydatasets Regionus

Related Datasets

Quality Score

D39

Description

Source

Reputation

Quality Score

D39

Description

Source

Reputation

Access

Community

541 downloads

11 likes

0 views

Dataset Info

Author: jacktol
Created: May 28, 2025
Updated: Aug 25, 2025
Last synced: Jun 8, 2026

Access

Community

541 downloads

11 likes

0 views

Dataset Info

Author: jacktol
Created: May 28, 2025
Updated: Aug 25, 2025
Last synced: Jun 8, 2026

ATC ASR: 1,000-10,000 Utterance Pairs for Aviation Speech Recognition

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info