Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
6,000 multimodal question-answer pairs presented in the EchoInk-R1 research paper. The dataset was created by author harryhsing and last updated on the Hugging Face platform in May 2025. It is designed for exploring audio-visual reasoning in multimodal large language models via reinforcement learning.
License is unknown; users must verify terms before use. Associated code for training and inference is hosted on a separate GitHub repository.