A synthetic dataset of healthcare records. The free version offers a 2,000-row sample, while a premium version contains 100,000 rows. The author, organization, and specific data collection details are unknown.
Use Cases
- Train predictive models for patient outcomes based on synthetic medical variables.
- Benchmark data anonymization and privacy-preserving techniques using synthetic records.
- Develop and test healthcare analytics algorithms without using real patient data.
Strengths
- A free 2,000-row sample is available for initial evaluation.
- The premium version scales to 100,000 rows, which suggests a larger volume of data.
Limitations
- Column-level documentation is absent; field semantics must be inferred after download.
- The data is synthetic, which may limit its realism and applicability to real-world scenarios.
- Row count for the free sample is specified, but the full dataset's exact size beyond the premium claim is unknown.
Provenance
- Collection Method
- Synthetically generated.