arxiv arXiv cs.CL · 2d ago · src: 6d ago · research

LLM-Based Multi-Reference Evaluation for Phrase Break Annotations

from English

LMRE addresses limitations of single-reference evaluation by modeling multiple valid phrasings of speech. It outperforms traditional methods in aligning with human judgment on acceptance and scoring, demonstrating scalability and robustness for Korean speech annotations.

Importance 2/3 arXiv cs.CL Evaluation & benchmarks Reasoning models

Read original