Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
Genomic-NIAH is a benchmark dataset created by HuggingFaceBio to measure the long-context retrieval capabilities of genomic language models. It plants random (KEY, VALUE) DNA pairs within real-genome sequences and challenges models to retrieve the VALUE given the context and KEY. The dataset was last updated on May 19, 2026.
License is unknown; terms of use must be verified before application.