Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
A synthetic medical speech dataset contains 101,475 audio-text pairs totaling 184.1 hours of 16 kHz mono speech. It was generated by IntelMedica using the Kokoro-82M TTS system with 19 voices across three English accent groups, focusing on clinical and nursing terminology. The dataset version was noted in April 2026.
License details are unknown and should be verified before use. It is a companion to a larger v1 dataset with different scope. Full description requires visiting the Hugging Face dataset page.