Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
Tokenized commit history data extracted from 53 GitHub repositories. The dataset includes full commit diffs, processed for text analysis tasks. Author, collection method, and specific time range are not specified.
The 'unfiltered' nature suggests raw, potentially noisy data that requires significant cleaning. License terms for the source repositories and the derived dataset are unknown.