Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
48,078 audio-text pairs totaling 89.63 hours of Hindi speech, collected from YouTube using auto-generated captions for transcription. The dataset is heavily skewed, with one speaker contributing 76.9% of the content. Created by user somu9 and last updated on June 8, 2026.
License information is unknown, which may restrict commercial use.