Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
LiveBench is a benchmark for large language models designed to limit test set contamination by releasing new questions monthly. Questions are based on recently-released datasets, arXiv papers, news articles, and IMDb movie synopses. It was created by 'livebench' and last updated in April 2025.
Users must refer to the dataset page for the full description and latest details, as specific column names and data structure are not provided here.