frontier-models-fleet-single-shot — promoted rows

OperatorOfficial scoreJudgeTokensHarness kindRecord
@antfleet-ops71.8% 84/117gpt-5.5 (high)0single-shotrecord
@antfleet-ops61.5% 72/117gpt-5 (high)0single-shotrecord
@antfleet-ops43.6% 51/117gpt-5.5 (high)0single-shotrecord
@antfleet-ops28.2% 33/117gpt-5.5 (high)0single-shotrecord
@antfleet-ops26.5% 31/117gpt-5.5 (high)0single-shotrecord
@antfleet-ops13.7% 16/117gpt-5.5 (high)0single-shotrecord
@antfleet-ops9.4% 11/117gpt-5.5 (high)0single-shotrecord