Blog 7 log

scripted_deterministic_scripted.console.log

scripted_deterministic_scripted.console.log / 2.6 KB / 36 lines

                            SRE-Zero Full Eval Plan                            
+-----------------------------------------------------------------------------+
| Kind          | Baseline | Model             | Episodes | Output            |
|---------------+----------+-------------------+----------+-------------------|
| deterministic | scripted | deterministic/sc� |        5 | scripted_episode� |
+-----------------------------------------------------------------------------+
[22:35:37] START run=1/1 baseline=scripted                  run_all_eval.py:225
           model=deterministic/scripted episodes=5                             
           END run=1/1 baseline=scripted                    run_all_eval.py:278
           model=deterministic/scripted score=93.198                           
           success=1.000 errors=0                                              
           output=D:\SRE-Zero\notes\runs\managed\blog-qwen-                    
           easy-agent-styles-2026-06-13\outputs\scripted_ep                    
           isodes5.json                                                        
full sweep                                                                   � 
1/1 scripted | deterministic/scripted | load_balancer_tls_cert_expired 11/1� � 
                            SRE-Zero Baseline Marks                            
+-----------------------------------------------------------------------------+
| Basel� | Model  | Marks | Succe� | Reward | Evide� | Inval� | Steps | Erro� |
|--------+--------+-------+--------+--------+--------+--------+-------+-------|
| scrip� | deter� |  93.2 |   1.00 |  0.941 |   1.00 |   0.00 |  4.73 |     0 |
+-----------------------------------------------------------------------------+
Wrote records and marks to 
D:\SRE-Zero\notes\runs\managed\blog-qwen-easy-agent-styles-2026-06-13\target_su
mmaries\scripted_deterministic_scripted.summary.json
                         SRE-Zero Marks by Difficulty                          
+-----------------------------------------------------------------------------+
|         |         |         |       |         |         |    Root | Correct |
| Diffic� | Baseli� | Model   | Marks | Success | Eviden� |   Cause |     Fix |
|---------+---------+---------+-------+---------+---------+---------+---------|
| easy    | script� | determ� |  93.2 |    1.00 |    1.00 |    1.00 |    1.00 |
+-----------------------------------------------------------------------------+
Wrote run log to 
D:\SRE-Zero\notes\runs\managed\blog-qwen-easy-agent-styles-2026-06-13\logs\scri
pted_deterministic_scripted.run.log