Sign in to view source links and access this dataset
Description
7,418 professionally curated samples link film clips with high-quality music, visual descriptions, and main melodies. Proposed in the FilmComposer project, this dataset aims to advance research in music production and video-to-music generation. Author apple-jun uploaded it to Hugging Face on April 27, 2025.
Use Cases
Train video-to-music generation models based on the paired film clips and music.
Develop models for automatic music description based on the provided music description text.
Research cross-modal alignment between visual scenes and musical themes using the visual and music descriptions.
Generate or analyze main melodies for film scenes based on the included main melody data.
Strengths
Contains approximately 7,418 samples, providing a substantial corpus for model training.
Each sample includes multiple aligned modalities: film clip, music, visual description, and music description.
Described as professional, versatile, and high-quality, suggesting a focus on production-grade content.
Limitations
Column-level documentation is absent; field semantics must be inferred after download.
Row count is approximate ('about 7,418'), and exact size, file formats, and sample data are unknown.
License terms are unclear and require contacting an individual email address for use.
Provenance
Source
Hugging Face dataset uploaded by author apple-jun.
Collection Method
Likely curated for the FilmComposer project; specific gathering method is unknown.
Time Range
null
Freshness
Last updated 2025-04-27 14:46:21; freshness should be verified.
Geography
null
License terms are not standard; users must fill out a 'Terms of Use' and email [email protected].