A balanced collection of CT-scan images for lung cancer classification. The dataset includes images labeled as Benign, Malignant, and Normal. It was sourced from Kaggle, but the author, organization, and specific collection details are unknown.
Use Cases
- Train image classification models based on the three provided diagnostic categories.
- Benchmark model performance for medical imaging tasks based on the balanced class structure.
- Develop computer-aided diagnosis (CAD) tools based on CT-scan imagery.
- Perform data augmentation research based on a medical imaging dataset.
Strengths
- The dataset is described as balanced across the Benign, Malignant, and Normal classes.
Limitations
- Description metadata is limited; actual data quality requires manual inspection after download.
- Column-level documentation is absent; field semantics must be inferred after download.
- Row count is unknown, which may limit suitability assessment.