Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
A 2026 replication package by Zijie Huang for research on code smell detection. It includes the MLCQ benchmark with 14,739 annotations from 522 repositories, 1,840 developer evaluations, and 40 qualitative interview transcripts. The package contains datasets, source code for a Java subsystem, model implementations, and analysis scripts.
Requires Python 3.10+ and specific dependencies; the CodeBERT-Fusion model requires PyTorch with CUDA support and ~6GB VRAM. License is CC-BY-4.0.