A cleaned and machine learning-ready version of the UCI Heart Disease dataset. The original data likely contains clinical and diagnostic features for predicting the presence of heart disease. The dataset was prepared for binary classification tasks.
Use Cases
- Train binary classifiers to predict heart disease presence based on clinical features.
- Benchmark feature engineering and model selection techniques for structured medical data.
- Study the relationship between patient attributes and heart disease outcomes for educational purposes.
Strengths
- Data is described as 'cleaned and machine learning-ready', suggesting it has undergone preprocessing for modeling.
- The dataset is derived from the well-known UCI Heart Disease repository, a standard benchmark in the field.
Limitations
- Row count and specific column definitions are unknown, which limits suitability assessment.
- Description metadata is limited; actual data quality requires manual inspection after download.
- Last update date is unknown; freshness unverified.
Provenance
- Source
- UCI Machine Learning Repository (inferred from title and description).
- Collection Method
- Cleaned and prepared from the original UCI source.