Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
VoxCeleb2 contains over 1 million audio-visual utterances from 6,112 celebrities, extracted from YouTube videos. This large-scale speaker identification dataset includes MP4 video files and associated metadata for training and development. It was updated in early 2026 by user Oldi451.
The dataset is distributed as a multipart archive (vox2_dev_mp4_part*) and requires 7zip (7z) for extraction. It is licensed under the MIT license.