LMSYS Chatbot Arena
ai_benchmark
Overview
Developed byLMSYS Org (UC Berkeley)
Open source✓ Open Source
Use casecrowdsourced LLM ranking through anonymous side-by-side comparisons with Elo ratings
Also see
Alternative to
Knowledge graph stats
Claims7
Avg confidence97%
Avg freshness99%
Last updatedUpdated yesterday
Trust distribution
100% unverified
Governance
Not assessed
LMSYS Chatbot Arena
product — also known as: Chatbot Arena
Crowdsourced platform for evaluating LLMs through anonymous randomized pairwise battles
Compare with...used by
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| OpenAI | ○Unverified | High | Fresh | 1 |
alternative to
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| LMArena | ○Unverified | High | Fresh | 1 |
evaluates
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| overall LLM quality via human preference judgments | ○Unverified | High | Fresh | 1 |
open source
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| true | ○Unverified | High | Fresh | 1 |
primary use case
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| crowdsourced LLM ranking through anonymous side-by-side comparisons with Elo ratings | ○Unverified | High | Fresh | 1 |
first released
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| 2023 | ○Unverified | High | Fresh | 1 |
developed by
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| LMSYS Org (UC Berkeley) | ○Unverified | High | Fresh | 1 |