Blog 7 log

open_source_qwen_qwen3.6-35b-a3b.console.log

open_source_qwen_qwen3.6-35b-a3b.console.log / 18.8 KB / 272 lines

                            SRE-Zero Full Eval Plan                            
+-----------------------------------------------------------------------------+
| Kind | Baseline    | Model                | Episodes | Output               |
|------+-------------+----------------------+----------+----------------------|
| llm  | open_source | qwen/qwen3.6-35b-a3b |        1 | open_source_qwen_qw� |
+-----------------------------------------------------------------------------+
[22:35:39] START run=1/1 baseline=open_source               run_all_eval.py:225
           model=qwen/qwen3.6-35b-a3b episodes=1                               
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request failed model=qwen/qwen3.6-35b-a3b attempt=1/6 error=LLM provider returned HTTP 504: error code: 504
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=2/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=2/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request failed model=qwen/qwen3.6-35b-a3b attempt=1/6 error=LLM provider returned HTTP 504: error code: 504
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=2/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=2/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request failed model=qwen/qwen3.6-35b-a3b attempt=1/6 error=Unexpected message content type: NoneType; finish_reason='length'; native_finish_reason='length'; message_keys=['content', 'refusal', 'role']; has_reasoning=False
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=2/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=2/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request failed model=qwen/qwen3.6-35b-a3b attempt=1/6 error=Unexpected message content type: NoneType; finish_reason='length'; native_finish_reason='length'; message_keys=['content', 'refusal', 'role']; has_reasoning=False
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=2/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=2/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request failed model=qwen/qwen3.6-35b-a3b attempt=1/6 error=Unexpected message content type: NoneType; finish_reason='length'; native_finish_reason='length'; message_keys=['content', 'refusal', 'role']; has_reasoning=False
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=2/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request failed model=qwen/qwen3.6-35b-a3b attempt=2/6 error=Unexpected message content type: NoneType; finish_reason='length'; native_finish_reason='length'; message_keys=['content', 'refusal', 'role']; has_reasoning=False
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=3/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=3/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request failed model=qwen/qwen3.6-35b-a3b attempt=1/6 error=Unexpected message content type: NoneType; finish_reason='length'; native_finish_reason='length'; message_keys=['content', 'refusal', 'role']; has_reasoning=False
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=2/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=2/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
[23:07:37] END run=1/1 baseline=open_source                 run_all_eval.py:278
           model=qwen/qwen3.6-35b-a3b score=31.419                             
           success=0.182 errors=0                                              
           output=D:\SRE-Zero\notes\runs\managed\blog-qwen-                    
           easy-agent-styles-2026-06-13\outputs\open_source                    
           _qwen_qwen3.6-35b-a3b_episodes1.json                                
full sweep                                                                   � 
1/1 open_source | qwen/qwen3.6-35b-a3b | load_balancer_tls_cert_expired 11/� � 
                            SRE-Zero Baseline Marks                            
+-----------------------------------------------------------------------------+
| Basel� | Model  | Marks | Succe� | Reward | Evide� | Inval� | Steps | Erro� |
|--------+--------+-------+--------+--------+--------+--------+-------+-------|
| open_� | qwen/� |  31.4 |   0.18 |  0.217 |   0.67 |   0.00 |  6.55 |     0 |
+-----------------------------------------------------------------------------+
Wrote records and marks to 
D:\SRE-Zero\notes\runs\managed\blog-qwen-easy-agent-styles-2026-06-13\target_su
mmaries\open_source_qwen_qwen3.6-35b-a3b.summary.json
                         SRE-Zero Marks by Difficulty                          
+-----------------------------------------------------------------------------+
|         |         |         |       |         |         |    Root | Correct |
| Diffic� | Baseli� | Model   | Marks | Success | Eviden� |   Cause |     Fix |
|---------+---------+---------+-------+---------+---------+---------+---------|
| easy    | open_s� | qwen/q� |  31.4 |    0.18 |    0.67 |    0.55 |    0.64 |
+-----------------------------------------------------------------------------+
Wrote run log to 
D:\SRE-Zero\notes\runs\managed\blog-qwen-easy-agent-styles-2026-06-13\logs\open
_source_qwen_qwen3.6-35b-a3b.run.log