Blog 8 log

scripted_deterministic_scripted.console.log

scripted_deterministic_scripted.console.log / 2.6 KB / 36 lines

                            SRE-Zero Full Eval Plan                            
+-----------------------------------------------------------------------------+
| Kind          | Baseline | Model             | Episodes | Output            |
|---------------+----------+-------------------+----------+-------------------|
| deterministic | scripted | deterministic/sc� |        5 | scripted_episode� |
+-----------------------------------------------------------------------------+
[11:18:30] START run=1/1 baseline=scripted                  run_all_eval.py:242
           model=deterministic/scripted episodes=5                             
[11:18:31] END run=1/1 baseline=scripted                    run_all_eval.py:295
           model=deterministic/scripted score=93.198                           
           success=1.000 errors=0                                              
           output=D:\SRE-Zero\notes\runs\managed\blog-mistr                    
           al-small-easy-agent-styles-2026-06-14\outputs\sc                    
           ripted_episodes5.json                                               
full sweep                                                                   � 
1/1 scripted | deterministic/scripted | load_balancer_tls_cert_expired 11/1� � 
                            SRE-Zero Baseline Marks                            
+-----------------------------------------------------------------------------+
| Basel� | Model  | Marks | Succe� | Reward | Evide� | Inval� | Steps | Erro� |
|--------+--------+-------+--------+--------+--------+--------+-------+-------|
| scrip� | deter� |  93.2 |   1.00 |  0.941 |   1.00 |   0.00 |  4.73 |     0 |
+-----------------------------------------------------------------------------+
Wrote records and marks to 
D:\SRE-Zero\notes\runs\managed\blog-mistral-small-easy-agent-styles-2026-06-14\
target_summaries\scripted_deterministic_scripted.summary.json
                         SRE-Zero Marks by Difficulty                          
+-----------------------------------------------------------------------------+
|         |         |         |       |         |         |    Root | Correct |
| Diffic� | Baseli� | Model   | Marks | Success | Eviden� |   Cause |     Fix |
|---------+---------+---------+-------+---------+---------+---------+---------|
| easy    | script� | determ� |  93.2 |    1.00 |    1.00 |    1.00 |    1.00 |
+-----------------------------------------------------------------------------+
Wrote run log to 
D:\SRE-Zero\notes\runs\managed\blog-mistral-small-easy-agent-styles-2026-06-14\
logs\scripted_deterministic_scripted.run.log