Speech Commands

Name: Speech Commands
Creator: google
Published: 2022-03-02T23:29:22
Keywords: Source Datasetsoriginal, Language Creatorscrowdsourced, Languageen, Size Categories100 Kn1 M, Task Categoriesaudio Classification, Licensecc By 40, Regionus, Task Idskeyword Spotting, Multilingualitymonolingual, Annotations Creatorsother, Arxiv180403209

by googleUpdated 2y ago

Available on 1 platform

Sign in to view source links and access this dataset

Description

64,727 one-second .wav audio files containing 30 to 35 distinct spoken English words and background noise. The collection includes ten core directional and action commands alongside auxiliary words and a dedicated _silence_ class for noise simulation.

Use Cases

Train a keyword spotting model to recognize specific directional inputs using the core command labels and the audio data
Develop an out-of-vocabulary detection system by leveraging the is_unknown feature to classify non-command words
Build a noise-resilient speech classifier by incorporating the _silence_ class recordings into the training pipeline

Strengths

64,727 audio files in .wav format, each exactly one second long
Includes 10 core command labels such as 'Yes', 'No', 'Up', 'Down', 'Left', 'Right', 'On', 'Off', 'Stop', and 'Go'
Features an is_unknown boolean flag to differentiate between primary commands and auxiliary words like 'Bed', 'Bird', or 'Marvin'
Contains a _silence_ class consisting of environmental recordings and mathematical noise simulations

Source Datasetsoriginal Language Creatorscrowdsourced Languageen Size Categories100 Kn1 M Task Categoriesaudio Classification Licensecc By 40 Regionus Task Idskeyword Spotting Multilingualitymonolingual Annotations Creatorsother Arxiv180403209

Related Datasets

Quality Score

D38

Description

49

Source

36

Reputation

33

Access

22

Community

2.8K downloads

59 likes

0 views

Dataset Info

Author: google
Created: Mar 2, 2022
Updated: Jan 18, 2024
Last synced: Jun 3, 2026

Access

22

Community

2.8K downloads

59 likes

0 views

Dataset Info

Author: google
Created: Mar 2, 2022
Updated: Jan 18, 2024
Last synced: Jun 3, 2026

Speech Commands

Description

Use Cases

Strengths

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info