Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
MusicSem is a multimodal dataset containing 35,977 entries of paired text and audio. It includes a withheld test set of 480 entries for leaderboard evaluation. The dataset was curated by Rebecca Salganik, Teng Tu, Fei-Yueh Chen, Xiaohao Liu, Kaifeng Lu, Ethan Luvisia, Zhiyao Duan, Guillaume Salha-Galvan, Anson Kahng, Yunshan Ma, and Jian Kang.
License details are not fully specified in the provided input; the full MIT license description is on the dataset page.