CMAT 2026: Audit Logs for LLM Metacognitive Resilience

Available on 1 platform

Sign in to view source links and access this dataset

Description

Audit logs and metrics for evaluating metacognitive resilience in frontier large language models. The dataset is hosted on Kaggle, but the author, organization, and specific collection date are unknown. The description suggests it is designed for auditing and evaluating AI model behavior.

Use Cases

Benchmarking LLM resilience under adversarial prompting based on the described audit logs
Analyzing latent space tightness and model consistency using the provided metrics
Training models for improved metacognitive capabilities based on the evaluation framework

Strengths

Focus on frontier LLM evaluation, a specialized and active research area.
Data is structured as audit logs and metrics, which suggests a systematic evaluation approach.

Limitations

Description metadata is limited; actual data quality requires manual inspection after download.
Column-level documentation is absent; field semantics must be inferred after download.
Row count is unknown, which may limit suitability assessment.

Provenance

Source: Kaggle

License is unknown; users must verify terms before use.

Tabular Ai Safety Audit Logs Llm Evaluation

Related Datasets

Quality Score

D19

Description

18

Source

17

Reputation

18

Access

31

Community

0 views

Dataset Info

Last synced: Apr 11, 2026

Access

31

Community

0 views

Dataset Info

Last synced: Apr 11, 2026

CMAT 2026: Audit Logs for LLM Metacognitive Resilience

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info