Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
Amphion released the INTP dataset in late 2024, providing 250,000 synthetic speech preference pairs totaling over 2,000 hours of audio. The collection spans English and Chinese languages across diverse scenarios including regular speech, repeated phrases, and code-switching contexts for speech intelligibility research.
Released under the CC BY-NC 4.0 license, which prohibits commercial use. Users should refer to the associated Arxiv publications for specific methodology on how the preference pairs were generated.