03RUNS

runsv2-smoke-20260425T154553Z

openaicompletedv2 smoke

Gpt 5

openai/gpt-5
Headline
100.0/100
CI [100.0, 100.0]
Cost
$0.0000
0 judge calls
Cases
6/6≥ 70%
0 failing
Per-band breakdown
  • MEDIUM
    n=3 · ci [100.0, 100.0]
    100.0
  • TRIVIAL
    n=1 · ci [100.0, 100.0]
    100.0
  • EASY
    n=2 · ci [100.0, 100.0]
    100.0
Cluster radar
Latency & throughput
  • Total elapsed
  • Avg judge latency
  • Judge calls0
  • Total spend$0.0000
  • CompletedSat, 25 Apr 2026 15:45:53 GMT