1 million rows of synthetic data designed for predicting gym membership churn. The dataset likely contains features related to customer demographics and behavior patterns. It originates from Kaggle, but specific authorship and update details are unknown.
Use Cases
- Train churn classification models based on synthetic customer behavior patterns.
- Benchmark predictive algorithms on a large-scale, structured churn problem.
- Analyze feature importance for churn drivers based on simulated demographic and usage data.
Strengths
- Large-scale synthetic dataset with 1 million rows.
- Designed specifically for the churn prediction task.
Limitations
- Column-level documentation is absent; field semantics must be inferred after download.
- Data is synthetic, which may limit real-world applicability and generalizability.
- Description metadata is limited; actual data quality requires manual inspection after download.
Provenance
- Source
- Kaggle
- Collection Method
- Synthetically generated.