Loading...
Loading...
Speech recognition, text-to-speech, speaker identification, music classification, audio event detection
1,908 datasets
An open-source collection of 1,093 fully diacritized Arabic speech recordings, crowd-sourced from native speakers via Nahw.ai. The dataset contains audio recordings resampled to 16 kHz paired with their fully diacritized transcriptions. It was created by NahwAI and last updated on 2026-04-21.
A collection of conversational turns with audio recordings and transcripts. The dataset includes columns for conversation identifiers, speaker agents, prompts sent to a Gemini Live model, spoken transcripts, and audio durations. It was created by ShiniChien and last updated on May 18, 2026.