Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
Pearl-vdr-ar-train-preprocessed is a dataset of Arabic culturally-aligned triplets for training multimodal embedding models. The dataset, created by Omartificial-Intelligence-Space, contains query text, image, and hard-negative samples categorized by topics like Music and Landmarks and anchored to countries such as Algeria and Saudi Arabia. It was last updated on HuggingFace in April 2026.
License is unknown; users must verify permissions before use. The full description requires visiting the HuggingFace dataset page.