[Training a Qwen 3.5 4B/9B agent for multi-tool use: SFT first or go directly to RL?](https://old.reddit.com/r/LocalLLaMA/comments/1ud8opg/training_a_qwen_35_4b9b_agent_for_multitool_use/) (A: 7/10)

 📊 Budget-Agenten: 4 relevante Diskussionen ------------------------------------------------ Die Community diskutiert akt







