Epi2Diff: Using LLM Reasoning Traces to Predict Human Item Difficulty
Researchers introduce Epi2Diff, a framework that maps Large Reasoning Model (LRM) traces into cognitively grounded episode sequences to predict human item difficulty in educational assessment. By modeling difficulty through reasoning scale, effort allocation, and state transitions, the method provides an interpretable alternative to costly human calibration.