Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
664 tokens per sample on average, according to the provided example. This corpus was used to train the JetonCount model and contains token-level statistics derived from the FineWeb-Edu dataset. It was created by the author 'fromziro' and last updated on June 22, 2026.
License is unknown; users must verify terms of use before application.