Malware analysis datasets containing sequences of API calls. The dataset is hosted on Kaggle, but specific details on the number of samples, collection period, and original authors are not provided in the available metadata. The content likely consists of behavioral logs from malware execution.
Use Cases
- Train a sequence model to classify malicious vs. benign software based on API call patterns (inferred from domain, verify after download)
- Analyze the progression of malware execution through its system interactions (inferred from domain, verify after download)
- Benchmark anomaly detection algorithms on system call logs (inferred from domain, verify after download)
Strengths
- Published on Kaggle, a platform with established data sharing and community features.
Limitations
- Metadata is minimal; actual content requires verification after download.
- Column-level documentation is absent; field semantics must be inferred after download.
- Row count, file formats, and license are unknown, which may limit suitability assessment.