Over 15,000 Android applications form this malware dataset. It contains staged static features extracted from the apps, with a focus on certificate information to aid detection models. The dataset was sourced from Kaggle, but details on its author, organization, and last update are unknown.
Use Cases
- Train machine learning classifiers to detect malware based on static features.
- Benchmark new certificate-informed malware detection algorithms.
- Analyze the relationship between app certificates and malicious behavior.
- Develop feature engineering techniques for Android security datasets.
Strengths
- Dataset includes over 15,000 Android applications.
- Features are staged and include certificate information, which is a specific focus mentioned in the title.
Limitations
- Column-level documentation is absent; field semantics must be inferred after download.
- Row count is unknown, which may limit suitability assessment.
- Last update date is unknown; freshness unverified.
Provenance
- Source
- Kaggle
- Collection Method
- Static features were extracted from Android applications.