Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
Neapolitan-Spoken-Corpus (NSC) is the first publicly available speech corpus for benchmarking Automatic Speech Recognition systems on the Neapolitan dialect. It includes 141 sentence-level audio recordings with gold-standard orthographic transcriptions. The dataset was created by anonymous-nsc-author to address the lack of computational resources for dialectological research.
License is listed as 'cc By Nc 40' on the platform, indicating a Creative Commons Attribution-NonCommercial 4.0 license.