A fork of the AIDev dataset containing all commits and repositories. The dataset is associated with a paper from arXiv and GitHub repositories. It was last updated on March 18, 2026.
Use Cases
- Analyze programming language usage patterns based on repository commits.
- Study productivity metrics like PR merge rates and turnaround times.
- Investigate developer activity and collaboration patterns across repositories.
Strengths
- Includes all commits and repositories from the original AIDev project.
- Linked to a peer-reviewed arXiv paper and open-source GitHub repository.
Limitations
- Column-level documentation is absent; field semantics must be inferred after download.
- Row count is unknown, which may limit suitability assessment.
Provenance
- Source
- GitHub repository https://github.com/SAILResearch/AI_Teammates_in_SE3
- Collection Method
- Fork of the AIDev dataset.
- Freshness
- Last updated 2026-03-18 19:50:29