arxiv arXiv cs.LG · 6d ago · research

LLM Alignment Using Implicit User Feedback

from English

A new dataset, IFLLM, collects mouse trajectories and eye gazing data from users interacting with LLMs. It shows that implicit feedback significantly improves LLM alignment, boosting text-based reward model accuracy from 55% to 64% and nearly tripling response quality improvements after DPO training on eight LLMs.

Importance 3/3 New harness with differentiators arXiv cs.LG OpenAI Google DeepMind Meta AI Evaluation & benchmarks Research paper Safety & alignment

Read original