Einstein World Models: Visualizing Counterfactuals for LLM Reasoning
The article introduces Einstein World Models (EWMs), a framework designed to enhance large language model reasoning by integrating visual-temporal rollouts into the reasoning trace. This approach allows models to utilize visual thought experiments as inspectable hypotheses to complement text-based processing.