primary use case

[published]static · preferred

open-source framework for building and running LLM evaluations and safety tests

ConfidenceRankTemporalMethod
High (97%)preferredstatichuman_curated

Sources

SourceDomainScoreAI
primary_use_case