Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
A pretraining dataset for European Portuguese created by filtering the Fineweb2 dataset to URLs from Portuguese domains. Each document is classified into one of 9 categories and scored for educational quality. The dataset was created by duarteocarmo and was last updated on 2026-02-22.
License is unknown; users must verify terms of use before application.