Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
The customer_service_persian_diarization_dataset is a synthetic multi-speaker speech dataset designed for training and evaluating speaker diarization models in Persian (Farsi). It contains approximately 80 hours of audio, built using utterances from a customer service dataset and processed through a synthesis framework to simulate realistic conversational dynamics. The dataset was created by atiyehghm and was last updated on the platform in February 2026.
License is unknown; terms of use must be verified before application.