A user explores using Qwen 27B for long-horizon task planning and Qwen 35B-A3B for rapid execution, noting the 27B runs at 7-10 tokens per second and the 35B-A3B at ~18 tokens per second. The user considers switching between models to leverage their different strengths, though currently uses the 35B-A3B exclusively and questions whether the intelligence gap between models is significant.