DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

Cafe Sales: 10,000 Synthetic Records for Data Cleaning Training | DataSalon

Home Computer Graphics & SimulationCafe Sales: 10,000 Synthetic Records for Data Cleaning Training

Computer Graphics & Simulation

Cafe Sales: 10,000 Synthetic Records for Data Cleaning Training

Available on 1 platform

Description

10,000 rows of synthetic cafe sales data designed for data cleaning training. The dataset is synthetic and likely contains intentionally introduced errors or inconsistencies to simulate real-world messy data. Its author, organization, and license are unknown.

Use Cases

Practice identifying and correcting data inconsistencies based on the synthetic nature of the data.
Develop data cleaning pipelines for sales data based on the cafe sales domain.
Benchmark data cleaning algorithms on a controlled, synthetic dataset.
Train data wrangling skills on a dataset with known, intentional errors.

Strengths

10,000 rows provides a substantial volume for training exercises.
Synthetic nature allows for controlled introduction of specific data quality issues for targeted practice.

Limitations

Column-level documentation is absent; field semantics must be inferred after download.
Description metadata is limited; actual data quality requires manual inspection after download.
The dataset is synthetic and may not reflect the complexity of real-world cafe sales data.

Provenance

Collection Method: Synthetic generation.

Tabular Cafe Sales Tabular Training Synthetic Data Data Cleaning Synthetic

Related Datasets

Quality Score

D21

Description

Source

Reputation

Quality Score

D21

Description

Source

Reputation

Access

Community

0 views

Dataset Info

Last synced: May 1, 2026

Access

Community

0 views

Dataset Info

Last synced: May 1, 2026

Cafe Sales: 10,000 Synthetic Records for Data Cleaning Training

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info