Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
21,421 cleaned Georgian speech samples totaling 35 hours were curated by NMikka from Mozilla Common Voice 19.0 in 2026. The collection features 24 kHz mono WAV audio from 12 speakers specifically filtered for speech synthesis and recognition tasks.
The dataset is released under a CC-0 license and is provided in Parquet format via Hugging Face.