Understanding Evaluation Illusion in Diffusion Large Language Models
A study reveals that evaluating diffusion large language models (dLLMs) is highly sensitive to prompt templates, creating an illusion that parallel decoding improves efficiency without performance loss.