Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
F2LLM-v2 is a dataset for training and evaluating multilingual text embedding models, created by researchers Ziyin Zhang, Zihan Liao, Hang Yu, Peng Di, and Rui Wang. The dataset page includes a citation note for a 2026 arXiv paper and a caution about potential contamination with MTEB evaluation splits. It was last updated on Hugging Face in June 2026.
License is unknown; terms of use must be verified before application.