Does Verbose Chain-of-Thought Really Help? In-Distribution Evidence that Content, Not Length, Matters
This study investigates whether verbose chain-of-thought prompting improves large language model reasoning through increased computation or by providing useful semantic content. The authors present evidence from in-distribution sampling and controlled interventions to determine the specific factors driving performance gains.