Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
1.3M+ source code files from approximately 4,700 top-ranked GitHub developers, curated by ronantakizawa and updated in February 2026. The collection spans 80+ programming languages including Python, Rust, and Go, covering the period from 2015 to 2025.
Data is provided in Parquet format and is licensed under MIT; users may need to join with the 'github-top-developers' dataset for additional developer metadata.