Blog 8 log
scripted_deterministic_scripted.console.log
scripted_deterministic_scripted.console.log / 2.6 KB / 36 lines
SRE-Zero Full Eval Plan
+-----------------------------------------------------------------------------+
| Kind | Baseline | Model | Episodes | Output |
|---------------+----------+-------------------+----------+-------------------|
| deterministic | scripted | deterministic/sc� | 5 | scripted_episode� |
+-----------------------------------------------------------------------------+
[11:18:30] START run=1/1 baseline=scripted run_all_eval.py:242
model=deterministic/scripted episodes=5
[11:18:31] END run=1/1 baseline=scripted run_all_eval.py:295
model=deterministic/scripted score=93.198
success=1.000 errors=0
output=D:\SRE-Zero\notes\runs\managed\blog-mistr
al-small-easy-agent-styles-2026-06-14\outputs\sc
ripted_episodes5.json
full sweep �
1/1 scripted | deterministic/scripted | load_balancer_tls_cert_expired 11/1� �
SRE-Zero Baseline Marks
+-----------------------------------------------------------------------------+
| Basel� | Model | Marks | Succe� | Reward | Evide� | Inval� | Steps | Erro� |
|--------+--------+-------+--------+--------+--------+--------+-------+-------|
| scrip� | deter� | 93.2 | 1.00 | 0.941 | 1.00 | 0.00 | 4.73 | 0 |
+-----------------------------------------------------------------------------+
Wrote records and marks to
D:\SRE-Zero\notes\runs\managed\blog-mistral-small-easy-agent-styles-2026-06-14\
target_summaries\scripted_deterministic_scripted.summary.json
SRE-Zero Marks by Difficulty
+-----------------------------------------------------------------------------+
| | | | | | | Root | Correct |
| Diffic� | Baseli� | Model | Marks | Success | Eviden� | Cause | Fix |
|---------+---------+---------+-------+---------+---------+---------+---------|
| easy | script� | determ� | 93.2 | 1.00 | 1.00 | 1.00 | 1.00 |
+-----------------------------------------------------------------------------+
Wrote run log to
D:\SRE-Zero\notes\runs\managed\blog-mistral-small-easy-agent-styles-2026-06-14\
logs\scripted_deterministic_scripted.run.log