OpenSeeker teacher-generation runs for Claude Sonnet 4.6 and three OpenAI GPT teacher backbones. The dataset, authored by Datasearch, was last updated on May 3, 2026. It contains aggregate metrics and per-task model predictions organized in JSON and TSV files.
Use Cases
- Compare aggregate performance metrics across different teacher model backbones.
- Analyze per-task prediction results and validation flags for model behavior.
- Benchmark Claude Sonnet 4.6 against OpenAI GPT models on specific generation tasks.
Strengths
- Includes results for four distinct model backbones: Claude Sonnet 4.6 and three OpenAI GPT models.
- Provides both aggregate summary metrics and detailed per-task records for granular analysis.
Limitations
- Column-level documentation is absent; field semantics must be inferred after download.
- Row count is unknown, which may limit suitability assessment.
- Description metadata is limited; actual data quality requires manual inspection after download.
Provenance
- Source
- Datasearch
- Collection Method
- Results from OpenSeeker teacher-generation runs.
- Time Range
- null
- Freshness
- Last updated 2026-05-03 00:15:57; freshness should be verified.
- Geography
- null