Diabetes_Dataset from openml contains 9 medically relevant parameters for diabetes research. The dataset, sourced from the Pima Indians, includes features such as Pregnancies, Glucose, and BMI, with an Outcome column indicating diabetes presence. It is licensed under CC0-1.0 for open use.
Use Cases
- Predict diabetes likelihood based on parameters like Glucose, Insulin, and BMI.
- Identify patterns and significant contributing factors to diabetes from clinical features.
- Analyze interactions between factors such as pregnancy, age, and BMI in the context of diabetes.
- Support preventative healthcare planning and patient education through statistical analysis.
Strengths
- Includes 9 specific clinical parameters relevant to diabetes research.
- Outcome column provides a clear binary classification target for modeling.
- Released under the permissive CC0-1.0 license.
Limitations
- Row count is unknown, which may limit suitability assessment.
- Last update date is unknown; freshness unverified.
- Description metadata is limited; actual data quality requires manual inspection after download.
Provenance
- Source
- Pima Indians
- Collection Method
- Likely contains clinical measurements from a specific population group.
- Time Range
- null
- Freshness
- Last updated date is unknown.
- Geography
- null