03RUNS
runsv2-smoke-20260425T170433Z
openaicompletedv2 smoke
Gpt 5 Nano
openai/gpt-5-nanoHeadline
67.3/100
CI [35.1, 92.2]Cost
$0.0000
0 judge callsCases
8/12≥ 70%
2 failingPer-band breakdown
- HARD92.9n=7 · ci [78.6, 100.0]
- EXPERT52.1n=5 · ci [20.0, 84.1]
Cluster radar
Latency & throughput
- Total elapsed—
- Avg judge latency—
- Judge calls0
- Total spend$0.0000
- CompletedSat, 25 Apr 2026 17:04:33 GMT
Q0.976LATp50 1.2s · p95 2.7s · p99 5.1sJUDGEgpt-4.1-miniQ-DEPTH0$/EVAL$0.00014