A synthetic dataset designed for e-commerce risk analysis tasks. It likely contains features for classifying orders, such as fraud detection and return prediction. The dataset is hosted on Kaggle, but specific details about its creation, size, and update history are unknown.
Use Cases
- Build fraud detection models based on synthetic order features.
- Train multi-class classifiers for order risk categories.
- Predict customer returns based on synthetic transaction data.
- Benchmark risk classification algorithms on a synthetic e-commerce dataset.
Strengths
- Dataset is explicitly designed for fraud detection and return prediction tasks.
- Synthetic nature may allow for controlled experimentation without privacy concerns.
Limitations
- Row count and dataset size are unknown, which limits suitability assessment.
- Column-level documentation is absent; field semantics must be inferred after download.
- Description metadata is limited; actual data quality requires manual inspection.
Provenance
- Source
- Kaggle
- Collection Method
- Synthetic generation