2000+ synthetic vulnerabilities inspired by Common Vulnerabilities and Exposures (CVE) entries. The dataset is designed for training AI, machine learning, and Security Operations Center (SOC) systems. It originates from Kaggle, but the author, organization, and specific creation details are unknown.
Use Cases
- Training AI models for vulnerability detection based on synthetic CVE-inspired patterns.
- Benchmarking machine learning classifiers for cybersecurity based on labeled vulnerability examples.
- Simulating SOC training scenarios using generated vulnerability data.
- Developing synthetic data pipelines for security testing based on the described CVE inspiration.
Strengths
- Contains over 2000 entries, providing a substantial volume for training.
- Specifically designed for AI, ML, and SOC training applications.
Limitations
- Row count is unknown, which may limit suitability assessment.
- Column-level documentation is absent; field semantics must be inferred after download.
- Last update date is unknown; freshness unverified.
Provenance
- Source
- Kaggle
- Collection Method
- Synthetic generation, likely inspired by real CVE entries.