WinoGrande
conceptai_benchmark
Try in Playground →RSS
Overview
Open source✓ Open Source
Use caselarge-scale Winograd Schema evaluation of commonsense reasoning
Also see
Alternative to
Knowledge graph stats
Claims6
Avg confidence97%
Avg freshness99%
Last updatedUpdated yesterday
Trust distribution
100% unverified
Governance
EU Risknot classified

WinoGrande

concept

Large-scale dataset of Winograd Schema challenges for commonsense reasoning evaluation

Compare with...

alternative to

ValueTrustConfidenceFreshnessSources
HellaSwagUnverifiedHighFresh1

evaluates

ValueTrustConfidenceFreshnessSources
commonsense reasoning via pronoun resolutionUnverifiedHighFresh1

open source

ValueTrustConfidenceFreshnessSources
trueUnverifiedHighFresh1

primary use case

ValueTrustConfidenceFreshnessSources
large-scale Winograd Schema evaluation of commonsense reasoningUnverifiedHighFresh1

first released

ValueTrustConfidenceFreshnessSources
2019UnverifiedHighFresh1

created by

ValueTrustConfidenceFreshnessSources
Keisuke Sakaguchi et al.UnverifiedHighFresh1

Alternatives & Similar Tools

alternative to
Compare →

Related entities

Claim count: 6Last updated: 4/9/2026Edit history