Blog 8 log
random_deterministic_random.console.log
random_deterministic_random.console.log / 2.6 KB / 36 lines
SRE-Zero Full Eval Plan
+-----------------------------------------------------------------------------+
| Kind | Baseline | Model | Episodes | Output |
|---------------+----------+-------------------+----------+-------------------|
| deterministic | random | deterministic/ra� | 5 | random_episodes5� |
+-----------------------------------------------------------------------------+
[11:18:28] START run=1/1 baseline=random run_all_eval.py:242
model=deterministic/random episodes=5
END run=1/1 baseline=random run_all_eval.py:295
model=deterministic/random score=5.667
success=0.000 errors=0
output=D:\SRE-Zero\notes\runs\managed\blog-mistr
al-small-easy-agent-styles-2026-06-14\outputs\ra
ndom_episodes5.json
full sweep �
1/1 random | deterministic/random | load_balancer_tls_cert_expired 11/11 ep� �
SRE-Zero Baseline Marks
+-----------------------------------------------------------------------------+
| Basel� | Model | Marks | Succe� | Reward | Evide� | Inval� | Steps | Erro� |
|--------+--------+-------+--------+--------+--------+--------+-------+-------|
| random | deter� | 5.7 | 0.00 | 0.005 | 0.05 | 0.09 | 3.11 | 0 |
+-----------------------------------------------------------------------------+
Wrote records and marks to
D:\SRE-Zero\notes\runs\managed\blog-mistral-small-easy-agent-styles-2026-06-14\
target_summaries\random_deterministic_random.summary.json
SRE-Zero Marks by Difficulty
+-----------------------------------------------------------------------------+
| | | | | | | Root | Correct |
| Diffic� | Baseli� | Model | Marks | Success | Eviden� | Cause | Fix |
|---------+---------+---------+-------+---------+---------+---------+---------|
| easy | random | determ� | 5.7 | 0.00 | 0.05 | 0.02 | 0.07 |
+-----------------------------------------------------------------------------+
Wrote run log to
D:\SRE-Zero\notes\runs\managed\blog-mistral-small-easy-agent-styles-2026-06-14\
logs\random_deterministic_random.run.log