Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
140,000 user votes comparing two language models on a conversation, collected via the LM Arena platform. Each row contains a single vote with the full conversation history and metadata like the winning model and evaluation session. The dataset was created by lmarena-ai and last updated in August 2025.
License is unknown; users must verify terms of use before downloading.