Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
RSVLM-SFT is a remote sensing instruction-tuning dataset released by FelixKAI in 2026 for training the MF-RSVLM vision-language model. It contains image-text pairs for modality alignment and instruction following, although the specific record count is not disclosed in the metadata.
Requires a CLIP vision encoder and Vicuna-7B LLM for implementation; licensed under Apache 2.0.