Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
ViMedCSS provides 24.3 hours of Vietnamese medical speech across 11,832 training utterances, developed for the LREC 2026 conference. Each recording features at least one English medical term embedded within Vietnamese speech to support code-switching automatic speech recognition (ASR).
The dataset is licensed under CC BY 4.0 and is provided in Parquet format, requiring compatible libraries such as Hugging Face Datasets or Polars for efficient access.