evaluates

[published]static · preferred

safety-relevant capabilities and alignment properties of frontier models

ConfidenceRankTemporalMethod
High (97%)preferredstatichuman_curated

Sources

SourceDomainScoreAI
evaluates