SIFT introduces claim-conditioned re-scoring of evidence spans to better align with full claims, recovering up to 27.6 points in accuracy on FEVER, SciFact, 5PILS, and DP. WSP, an automatic NLI check, achieves AUC 0.92 and precision 0.98 when calibrating against human gold evidence.
SIFT and WSP Improve Fact-Checking Accuracy
from English