Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
Cleanlab created this dataset for benchmarking large language models on structured output tasks. The dataset's description indicates it is specifically designed for evaluating LLM performance on extracting structured information from text. It was last updated on December 3,我们发现了一个错误。
License is unknown; terms of use must be verified before application.