Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
ViSpeR is a large-scale dataset for Visual Speech Recognition (VSR) covering four widely spoken languages: Arabic, Chinese, French, and Spanish. It was created to address the scarcity of publicly available VSR data for non-English languages and is described as larger in size compared to other datasets in its domain. The dataset and models are hosted by the author 'tiiuae' and were last updated on April 17,我们发现一个错误,请关闭当前工具,通过描述错误来反馈给我们。
License is unknown, which may impose restrictions on commercial or research use.