claude-opus-4-8 — promoted rows

OperatorOfficial scoreJudgeTokensHarness kindRecord
@antfleet-ops35.9% 42/117gpt-5.5 (high)0single-shotrecord
@antfleet-ops28.2% 33/117gpt-5.5 (high)0single-shotrecord