Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
Japanese audio data contains 266 hours of speech processed by Scribe v1 for automatic speech recognition and classified using Facebook's audio aesthetics model as a prefilter. The dataset is derived from the Japanese portion of the Emilia Yodas collection and is licensed under CC BY 4.0. It includes text transcriptions and aesthetic scores for audio events.
The creator notes the dataset is at a 'v1' stage and invites collaboration via a provided Discord link. Full transaction timestamps from Scribe v1 are available under a CC BY 4.0 NC license from a separate location.