Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
A metadata-only dataset of GitHub commits designed for large-scale AI and software engineering research, created by adhyanshaa and last updated on June 5, 2026. It aims to solve the problem of training models on open-source software history without managing large volumes of raw code. The dataset is accompanied by the GitScope CLI tool for exploration.
License is unknown, which may restrict commercial or redistribution use.