Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
VideoMind-SFT contains 481,000 video-annotation pairs and a 210,000-record Grounder subset released by yeliudev in early 2026. The collection provides videos in both original formats and compressed versions at 3 FPS and 480p resolution without audio for efficient model training.
Videos are provided in two versions; the compressed version is 480p at 3 FPS with no audio. The dataset is licensed under the BSD 3-Clause License.