Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
Colauntangle Post-cutoff dataset contains 300 synthesized tangled commits collected from open-source Java and C# projects. The benchmark is constructed by combining pairs of atomic commits using git cherry-pick, following the methodology of the original Flexeme benchmark. Each data point contains a tangled commit and its corresponding atomic commits, which serve as ground-truth decomposition answers.
Data is provided in a ZIP file format (24.2 MB). License is CC-BY-4.0.