Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
AISHELL8-RealScene is a multimodal dataset of conversational Mandarin speech recorded in real-world settings. It contains 102.19 hours of audio from 171 foreground speakers across 5 different locations. The dataset was created by SMIIP-lab and includes synchronized near-field and 8-channel far-field audio with multi-view facial video.
License is listed as CC BY-NC-SA 4.0 in the description, which restricts commercial use and requires share-alike terms.