forreview43's companion dataset supports an anonymous NeurIPS submission on AI-versus-human rubric evaluation. The dataset pairs with a separate code repository to reproduce every headline number from the paper without re-running API-backed stages. The dataset was last updated on May 8, 2026.
Use Cases
- Reproducing headline results from an AI-versus-human rubric study based on the companion data.
- Validating evaluation metrics without re-running API calls based on the provided data.
- Comparing AI and human performance on rubric-based tasks using the paired dataset and code.
Strengths
- Designed for full reproducibility of a NeurIPS submission's headline numbers.
- Pairs with a documented code repository containing runnable scripts and validators.
- Last updated timestamp of 2026-05-08 06:01:58 is provided.
Limitations
- Column-level documentation is absent; field semantics must be inferred after download.
- Row count, file formats, and license are unknown, which may limit suitability assessment.
- The dataset's content and structure are described at a high level only.
Provenance
- Source
- forreview43 on Hugging Face.
- Collection Method
- Likely generated as part of an anonymous NeurIPS submission study.
- Time Range
- Publication timeframe likely aligns with NeurIPS 2026 review cycle.
- Freshness
- Last updated 2026-05-08 06:01:58.
- Geography
- null