Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
180 rights-cleared short scripted clips form a benchmark for subtitle translation and localization workflows. The package contains 540 timestamped source subtitle segments, 1,080 aligned translation rows, and 540 SRT files across English, Spanish, and Chinese (Simplified). Authored by imaz regi for the AI Translate Video project, this synthetic dataset was last updated in May 2026.
License is CC-BY-4.0. The dataset contains no third-party video or audio assets, only text and subtitle files.