Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
MSDWild is a dataset designed for testing multimodal analysis in tasks including multimodal speaker diarization, multimodal speaker localization, and audio-visual lip synchronization. The dataset is hosted on Hugging Face by author 'taocode' and was last updated on April 29, 2024. A sample can be viewed on the associated GitHub repository.
The database is intended solely for research purposes.