03RUNS
runsv2-smoke-20260425T160855Z
openaicompletedv2 smoke
Gpt 5 Nano
openai/gpt-5-nanoHeadline
90.8/100
CI [84.7, 100.0]Cost
$0.0000
0 judge callsCases
11/12≥ 70%
0 failingPer-band breakdown
- MEDIUM91.7n=6 · ci [75.0, 100.0]
- TRIVIAL100.0n=1 · ci [100.0, 100.0]
- EXPERT85.3n=1 · ci [85.3, 85.3]
- EASY100.0n=4 · ci [100.0, 100.0]
Cluster radar
Latency & throughput
- Total elapsed—
- Avg judge latency—
- Judge calls0
- Total spend$0.0000
- CompletedSat, 25 Apr 2026 16:08:55 GMT
Q0.976LATp50 1.2s · p95 2.7s · p99 5.1sJUDGEgpt-4.1-miniQ-DEPTH0$/EVAL$0.00014