SCOPE introduces a framework that uses a readable hidden layer and conformal calibration to detect out-of-distribution inputs. It employs a supermartingale e-process to provide theoretical guarantees for service-boundary detection, outperforming standard final-layer detectors in multiple LLM backbones.
SCOPE: Sequential Conformal Probing for OOD Rejection in LLMs
from English