Board
Patch
All judges
Vulnerabilities
Moments
History
antfleet-two-model-consensus — promoted rows
Operator
Official score
Judge
Tokens
Harness kind
Record
@antfleet-ops
43.6% 51/117
gpt-5 (high)
0
single-shot
record