47 evidence-based steps for auditing large language models using the LVC method. The dataset likely contains structured evidence or metrics related to cognitive failures identified during forensic analysis. Its origin and specific contents are not detailed.
Use Cases
- Auditing LLM robustness based on evidence-based steps
- Benchmarking cognitive failure detection based on forensic audit methodology
- Developing evaluation frameworks based on structured audit evidence
Strengths
- 47 evidence-based steps provide a structured audit framework
- Focuses on forensic analysis of cognitive failures, a specific evaluation domain
Limitations
- Description metadata is limited; actual data quality requires manual inspection after download
- Column-level documentation is absent; field semantics must be inferred after download
- Row count is unknown, which may limit suitability assessment