SRE-Zero Full Eval Plan +-----------------------------------------------------------------------------+ | Kind | Baseline | Model | Episodes | Output | |------+-------------+----------------------+----------+----------------------| | llm | open_source | qwen/qwen3.6-35b-a3b | 1 | open_source_qwen_qw… | +-----------------------------------------------------------------------------+ [22:35:39] START run=1/1 baseline=open_source run_all_eval.py:225 model=qwen/qwen3.6-35b-a3b episodes=1 SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0 SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0 SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0 SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0 SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0 SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0 SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0 SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0 SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0 SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0 SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0 SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0 SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0 SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0 SREZERO_LLM request failed model=qwen/qwen3.6-35b-a3b attempt=1/6 error=LLM provider returned HTTP 504: error code: 504 SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=2/6 SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0 SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=2/6 SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0 SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0 SREZERO_LLM request failed model=qwen/qwen3.6-35b-a3b attempt=1/6 error=LLM provider returned HTTP 504: error code: 504 SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=2/6 SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0 SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=2/6 SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0 SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0 SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0 SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0 SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0 SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0 SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0 SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0 SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0 SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0 SREZERO_LLM request failed model=qwen/qwen3.6-35b-a3b attempt=1/6 error=Unexpected message content type: NoneType; finish_reason='length'; native_finish_reason='length'; message_keys=['content', 'refusal', 'role']; has_reasoning=False SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=2/6 SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0 SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=2/6 SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0 SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0 SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0 SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0 SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0 SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0 SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0 SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0 SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0 SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0 SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0 SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0 SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0 SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0 SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0 SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0 SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0 SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0 SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0 SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0 SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0 SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0 SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0 SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0 SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0 SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0 SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0 SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0 SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0 SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0 SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0 SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0 SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0 SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0 SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0 SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0 SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0 SREZERO_LLM request failed model=qwen/qwen3.6-35b-a3b attempt=1/6 error=Unexpected message content type: NoneType; finish_reason='length'; native_finish_reason='length'; message_keys=['content', 'refusal', 'role']; has_reasoning=False SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=2/6 SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0 SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=2/6 SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0 SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0 SREZERO_LLM request failed model=qwen/qwen3.6-35b-a3b attempt=1/6 error=Unexpected message content type: NoneType; finish_reason='length'; native_finish_reason='length'; message_keys=['content', 'refusal', 'role']; has_reasoning=False SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=2/6 SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0 SREZERO_LLM request failed model=qwen/qwen3.6-35b-a3b attempt=2/6 error=Unexpected message content type: NoneType; finish_reason='length'; native_finish_reason='length'; message_keys=['content', 'refusal', 'role']; has_reasoning=False SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=3/6 SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0 SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=3/6 SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0 SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0 SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0 SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0 SREZERO_LLM request failed model=qwen/qwen3.6-35b-a3b attempt=1/6 error=Unexpected message content type: NoneType; finish_reason='length'; native_finish_reason='length'; message_keys=['content', 'refusal', 'role']; has_reasoning=False SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=2/6 SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0 SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=2/6 SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0 SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6 SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0 SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6 [23:07:37] END run=1/1 baseline=open_source run_all_eval.py:278 model=qwen/qwen3.6-35b-a3b score=31.419 success=0.182 errors=0 output=D:\SRE-Zero\notes\runs\managed\blog-qwen- easy-agent-styles-2026-06-13\outputs\open_source _qwen_qwen3.6-35b-a3b_episodes1.json full sweep … 1/1 open_source | qwen/qwen3.6-35b-a3b | load_balancer_tls_cert_expired 11/… … SRE-Zero Baseline Marks +-----------------------------------------------------------------------------+ | Basel… | Model | Marks | Succe… | Reward | Evide… | Inval… | Steps | Erro… | |--------+--------+-------+--------+--------+--------+--------+-------+-------| | open_… | qwen/… | 31.4 | 0.18 | 0.217 | 0.67 | 0.00 | 6.55 | 0 | +-----------------------------------------------------------------------------+ Wrote records and marks to D:\SRE-Zero\notes\runs\managed\blog-qwen-easy-agent-styles-2026-06-13\target_su mmaries\open_source_qwen_qwen3.6-35b-a3b.summary.json SRE-Zero Marks by Difficulty +-----------------------------------------------------------------------------+ | | | | | | | Root | Correct | | Diffic… | Baseli… | Model | Marks | Success | Eviden… | Cause | Fix | |---------+---------+---------+-------+---------+---------+---------+---------| | easy | open_s… | qwen/q… | 31.4 | 0.18 | 0.67 | 0.55 | 0.64 | +-----------------------------------------------------------------------------+ Wrote run log to D:\SRE-Zero\notes\runs\managed\blog-qwen-easy-agent-styles-2026-06-13\logs\open _source_qwen_qwen3.6-35b-a3b.run.log