BigBench
ai_benchmark
Overview
Open source✓ Open Source
Use caseprobing LLM capabilities across 200+ diverse tasks beyond standard benchmarks
Also see
Alternative to
Knowledge graph stats
Claims7
Avg confidence97%
Avg freshness99%
Last updatedUpdated yesterday
Trust distribution
100% unverified
Governance
Not assessed
BigBench
concept — also known as: BIG-Bench, BIG-bench
Collaborative benchmark with 200+ tasks probing LLM capabilities beyond standard benchmarks
Compare with...alternative to
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| MMLU | ○Unverified | High | Fresh | 1 |
used by
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| ○Unverified | High | Fresh | 1 |
evaluates
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| broad cognitive abilities including reasoning, translation, and understanding | ○Unverified | High | Fresh | 1 |
open source
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| true | ○Unverified | High | Fresh | 1 |
primary use case
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| probing LLM capabilities across 200+ diverse tasks beyond standard benchmarks | ○Unverified | High | Fresh | 1 |
first released
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| 2022 | ○Unverified | High | Fresh | 1 |
created by
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| Google and 450+ researchers | ○Unverified | High | Fresh | 1 |