A 2008 dataset of firewall logs, curated by the TabArena team for evaluating predictive models on independent and identically distributed tabular data. The intended task is classification, and the original data is from a 2018 study on classifying firewall log files with a multiclass support vector machine.
Use Cases
- Benchmarking classification algorithms based on firewall log features mentioned in the description
- Evaluating model performance on independent and identically distributed (IID) tabular data as specified in the study
- Researching network intrusion detection patterns based on packet filtering data
- Developing predictive models for firewall log analysis as referenced in the original paper
Strengths
- Dataset is licensed under CC BY 4.0, permitting sharing and adaptation
- Original source and reference are provided for citation and verification
Limitations
- Column-level documentation is absent; field semantics must be inferred after download
- Row count, file formats, and sample data are unknown, which may limit suitability assessment
- Last update date is unknown; freshness unverified
Provenance
- Source
- https://doi.org/10.24432/C5131M
- Collection Method
- Curated from original firewall log files for a machine learning study.
- Time Range
- 2008
- Freshness
- Dataset Year is 2008; last updated is unknown.
- Geography
- null