Lab · Alibaba (Qwen)
arxiv arXiv cs.CL · 9d ago

Language Models Encode Value of Their Current Trajectory

Qwen3-8B internally tracks the value of its current trajectory, defined as the likelihood of achieving its goals. This 'value' axis distinguishes confidence levels, backtracking behavior, and code correctness, and shows that preference optimization boosts confidence in rewarded behaviors. The model assigns low value to politically sensitive queries post-training, and fine-tuning increases confidence within specific domains.

media r/LocalLLaMA · 9d ago

Cheapest hardware for Qwen 3.6: 27B and 35B-A3B models

A Reddit post discusses the cost-effective hardware setup for running Qwen 3.6 models, both 27B and 35B-A3B, noting that RTX 3090 24GB offers better long-term value over Tesla V100 due to discontinuation and upcoming Chinese alternatives. The proposed build totals $1,995.65, including a Ryzen 5 5600X, RTX 3090 24GB, and essential components, with the total price being a key concern for users seeking affordability.