measured by

[published]static · preferred

Logical reasoning benchmarks

ConfidenceRankTemporalMethod
Moderate (89%)preferredstaticai_generated

Sources

SourceDomainScoreAI
measured_by