NLA Thought Anchors Answer Rollouts: Sentence-Level Neural Network Activation Scores

Name: NLA Thought Anchors Answer Rollouts: Sentence-Level Neural Network Activation Scores
Creator: Realmbird
Published: 2026-05-31T15:53:55
Keywords: Machine Learning, Neural Network Activations, Model Interpretability, Tabular, Natural Language Arithmetic

by RealmbirdUpdated 1mo ago

Available on 1 platform

Sign in to view source links and access this dataset

Description

1319 GSM8K test examples each have 40 sampled Neural Language Arithmetic (NLA) descriptions. The descriptions were split into sentences and scored using a specific model. The dataset was created by Realmbird and last updated on 2026-05-31.

Use Cases

Analyzing sentence-level contributions to model reasoning based on NLA description scores.
Studying the relationship between residual stream activations and generated text descriptions.
Evaluating the consistency of thought anchor descriptions across different problem instances.
Training or benchmarking models for neural network activation analysis.

Strengths

Contains 1319 distinct test examples from the GSM8K benchmark.
Each example includes 40 sampled NLA descriptions, providing a substantial sample size for analysis.
Activation extraction is focused on a specific token (the first digit after the answer prompt), offering a precise target.

Limitations

Column-level documentation is absent; field semantics must be inferred after download.
Row count is unknown, which may limit suitability assessment.
Description metadata is limited; actual data quality requires manual inspection after download.

Provenance

Source: Realmbird on Hugging Face.
Collection Method: Activations extracted from kitft/nla-qwen2.5-7b-L20-av model and scored with kitft/nla-qwen2.5-7b-L20-ar model.
Freshness: Last updated 2026-05-31 15:53:58; freshness should be verified.

Tabular Machine Learning Neural Network Activations Model Interpretability Natural Language Arithmetic

Related Datasets

Quality Score

D37

Description

42

Source

36

Reputation

38

Access

26

Community

9 downloads

1 likes

0 views

Dataset Info

Author: Realmbird
Created: May 31, 2026
Updated: May 31, 2026
Last synced: Jun 7, 2026

Access

26

Community

9 downloads

1 likes

0 views

Dataset Info

Author: Realmbird
Created: May 31, 2026
Updated: May 31, 2026
Last synced: Jun 7, 2026

NLA Thought Anchors Answer Rollouts: Sentence-Level Neural Network Activation Scores

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info