DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

VoxCeleb1 Short Utterances for Speaker Recognition | DataSalon

Home Speech & AudioVoxCeleb1 Short Utterances for Speaker Recognition

Speech & Audio

VoxCeleb1 Short Utterances for Speaker Recognition

Name: VoxCeleb1 Short Utterances for Speaker Recognition
Creator: s3prl
Published: 2022-07-11T12:21:08
Keywords: Speaker Verification, Audio Classification, Speech Processing, Audio, Regionus, Celebrity Voices

by s3prl·Updated 3y ago

Available on 1 platform

Description

Voxceleb1 Too Short Utts contains audio segments from the original VoxCeleb1 dataset. The dataset was created by s3prl and last updated on Hugging Face in July 2022. It focuses on utterances below a certain duration threshold.

Use Cases

Train speaker embedding models using short-duration audio utterances.
Benchmark speaker verification systems on challenging, brief speech segments.
Analyze the impact of utterance length on speaker recognition accuracy.
Develop models robust to variable-length audio inputs for real-world applications.

Strengths

Derived from the well-known VoxCeleb1 dataset containing over 100,000 utterances.
Focuses on a specific, challenging subset of short-duration speech data.

Limitations

Specific row count and audio duration thresholds are unknown.
Limited to celebrity speech data, which may not represent general population voices.
Data is several years old, with the last update in 2022.

Provenance

Source: VoxCeleb1 dataset.
Collection Method: Subset extraction of short utterances from the original VoxCeleb1 audio files.
Time Range: null
Freshness: Last updated in 2022.
Geography: null

null

Audio Speaker Verification Audio Classification Speech Processing Regionus Celebrity Voices

Related Datasets

Quality Score

D20

Description

Source

Reputation

Quality Score

D20

Description

Source

Reputation

Access

Community

11 downloads

0 views

Dataset Info

Author: s3prl
Created: Jul 11, 2022
Updated: Jul 11, 2022
Last synced: Apr 30, 2026

Access

Community

11 downloads

0 views

Dataset Info

Author: s3prl
Created: Jul 11, 2022
Updated: Jul 11, 2022
Last synced: Apr 30, 2026

VoxCeleb1 Short Utterances for Speaker Recognition

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info