Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
SADA (Saudi Audio Dataset for Arabic) is a large-scale Arabic speech corpus designed to support AI model development for Arabic speech processing. It contains over 667 hours of transcribed Arabic audio recordings, primarily featuring various Saudi dialects, and was curated in a collaboration involving the National Center for Artificial Intelligence. The dataset was last updated on the platform in May 2025.
License is unknown; users must verify licensing terms before use.