Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
1 hour and 30 minutes of audio clips extracted from public video footage of Xi Jinping. The dataset is intended for fine-tuning text-to-speech models and was uploaded by KritiAI on June 8, 2025. It includes scripts for processing audio files using Whisper and preparing data for the Bert-VITS2 framework.
Usage requires installing Python dependencies and using the Whisper API. License is unknown, which may restrict usage.