Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
A bilingual dataset combining equal numbers of Welsh and English speech recordings from Common Voice version 18. The Welsh recordings were sourced from the train_all and other_with_excluded splits of the Welsh Common Voice corpus. An equal number of recordings were taken from the official English Common Voice version 18 training set, prioritizing those tagged with a British Isles accent. The dataset was created by author 'techiaith' and last updated on 2024-11 06.
License information is unknown and should be verified on the dataset page.