India Diabetes Patient Dataset for Prediction Modeling
Available on 1 platform
Sign in to view source links and access this dataset
Description
India Diabetes Patient Dataset is a synthetic healthcare dataset designed for diabetes prediction and machine learning benchmarking. The dataset was sourced from Kaggle, but its author, organization, and specific creation date are unknown. Its exact size, number of rows, and column-level details are not provided in the available metadata.
Use Cases
Benchmarking machine learning algorithms for binary classification based on the described purpose of diabetes prediction.
Developing synthetic data generation techniques for healthcare based on the dataset's stated synthetic nature.
Training models for early risk detection of diabetes based on the described patient dataset context.
Studying feature importance for clinical prediction tasks based on the dataset's focus on diabetes.
Strengths
Dataset is explicitly designed for machine learning benchmarking, indicating a focus on model development.
The description specifies a clear domain (healthcare) and target condition (diabetes).
Limitations
Description metadata is limited; actual data quality requires manual inspection after download.
Column-level documentation is absent; field semantics must be inferred after download.
Row count is unknown, which may limit suitability assessment.
Provenance
Source
Kaggle
Collection Method
Synthetic data generation, as stated in the description.
Geography
India
License is unknown; terms of use must be verified before application.