Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
SeaDoc is a challenging visual document retrieval dataset for Southeast Asian languages, introduced in the paper 'Scaling Language-Centric Omnimodal Representation Learning'. It is designed to evaluate and enhance language-centric omnimodal embedding frameworks in low-resource settings. The dataset is hosted by LCO-Embedding and was last updated on 2026-02-06.
License is unknown; terms of use must be verified before application.