Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
175,706 automated software builds from two open-source CI/CD platforms spanning 10 years, collected by Lalit Narayan Mishra. The data supports a study on temporal data leakage in machine learning models for predicting build success or failure. Models using only pre-build features achieved 82.73% accuracy on TravisTorrent (2013-2017) and 83.30% on GHALogs (2023).
Data is provided in XLS (Excel) format. License is CC-BY-4.0.