Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
Mobile3M consists of approximately 1,000 image-based records captured from Android Cuttlefish Emulators for pre-training Mobile Vision Language Models (MobileVLM). Released by Xiaomi Corporation in late 2024, the data supports research into mobile-specific vision-language tasks and UI interaction.
The dataset is licensed under CC BY-NC-SA 4.0, which prohibits commercial use. It is specifically designed for use with the MobileVLM framework.