Blog 7 log
open_source_react_qwen_qwen3.6-35b-a3b.console.log
open_source_react_qwen_qwen3.6-35b-a3b.console.log / 16.9 KB / 251 lines
SRE-Zero Full Eval Plan
+-----------------------------------------------------------------------------+
| Kind | Baseline | Model | Episodes | Output |
|------+-------------------+-------------------+----------+-------------------|
| llm | open_source_react | qwen/qwen3.6-35b� | 1 | open_source_reac� |
+-----------------------------------------------------------------------------+
[23:13:38] START run=1/1 baseline=open_source_react run_all_eval.py:225
model=qwen/qwen3.6-35b-a3b episodes=1
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request failed model=qwen/qwen3.6-35b-a3b attempt=1/6 error=LLM provider returned HTTP 504: error code: 504
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=2/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=2/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request failed model=qwen/qwen3.6-35b-a3b attempt=1/6 error=Unexpected message content type: NoneType; finish_reason='length'; native_finish_reason='length'; message_keys=['content', 'refusal', 'role']; has_reasoning=False
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=2/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=2/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request failed model=qwen/qwen3.6-35b-a3b attempt=1/6 error=LLM provider returned HTTP 504: error code: 504
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=2/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=2/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request failed model=qwen/qwen3.6-35b-a3b attempt=1/6 error=LLM provider returned HTTP 504: error code: 504
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=2/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=2/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request failed model=qwen/qwen3.6-35b-a3b attempt=1/6 error=LLM provider returned HTTP 504: error code: 504
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=2/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request failed model=qwen/qwen3.6-35b-a3b attempt=2/6 error=LLM provider returned HTTP 504: error code: 504
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=3/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=3/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM request start model=qwen/qwen3.6-35b-a3b attempt=1/6
SREZERO_LLM throttle sleep model=qwen/qwen3.6-35b-a3b seconds=15.0
SREZERO_LLM request success model=qwen/qwen3.6-35b-a3b attempt=1/6
[23:38:06] END run=1/1 baseline=open_source_react run_all_eval.py:278
model=qwen/qwen3.6-35b-a3b score=52.104
success=0.455 errors=0
output=D:\SRE-Zero\notes\runs\managed\blog-qwen-
easy-agent-styles-2026-06-13\outputs\open_source
_react_qwen_qwen3.6-35b-a3b_episodes1.json
full sweep �
1/1 open_source_react | qwen/qwen3.6-35b-a3b | load_balancer_tls_cert_expir� �
SRE-Zero Baseline Marks
+-----------------------------------------------------------------------------+
| Basel� | Model | Marks | Succe� | Reward | Evide� | Inval� | Steps | Erro� |
|--------+--------+-------+--------+--------+--------+--------+-------+-------|
| open_� | qwen/� | 52.1 | 0.45 | 0.511 | 0.74 | 0.00 | 6.00 | 0 |
+-----------------------------------------------------------------------------+
Wrote records and marks to
D:\SRE-Zero\notes\runs\managed\blog-qwen-easy-agent-styles-2026-06-13\target_su
mmaries\open_source_react_qwen_qwen3.6-35b-a3b.summary.json
SRE-Zero Marks by Difficulty
+-----------------------------------------------------------------------------+
| | | | | | | Root | Correct |
| Diffic� | Baseli� | Model | Marks | Success | Eviden� | Cause | Fix |
|---------+---------+---------+-------+---------+---------+---------+---------|
| easy | open_s� | qwen/q� | 52.1 | 0.45 | 0.74 | 0.91 | 0.82 |
+-----------------------------------------------------------------------------+
Wrote run log to
D:\SRE-Zero\notes\runs\managed\blog-qwen-easy-agent-styles-2026-06-13\logs\open
_source_react_qwen_qwen3.6-35b-a3b.run.log