Topic · Retrieval & RAG
arxiv arXiv cs.CL · 8d ago

ProvenanceGuard: Source-Aware Factuality Verification for MCP-Based LLM Agents

ProvenanceGuard introduces a source-aware verifier for MCP-based LLM agents that detects cross-source conflation by routing claims to specific evidence sources and comparing stated attribution with actual source ownership. It achieves block F1 of 0.802 and source accuracy of 0.858 on 260 source-eligible claims, outperforming source-blind baselines, and detects all injected attribution swaps in 50 clinical probes.

arxiv arXiv cs.AI · 8d ago

ProvenanceGuard: Source-Aware Factuality Verification for MCP-Based LLM Agents

ProvenanceGuard introduces a source-aware verifier for MCP-based LLM agents that detects cross-source conflation by routing claims to specific evidence sources and comparing stated attribution with actual source ownership. It achieves block F1 of 0.802 and source accuracy of 0.858 on 260 source-eligible claims, outperforming source-blind baselines, and detects all injected attribution swaps in 50 clinical probes.

arxiv arXiv cs.CL · 8d ago

HistoRAG: Integrating Historical Methodology into RAG

HistoRAG introduces architectural changes to Retrieval-Augmented Generation based on historiographical principles. It separates retrieval and generation, implements temporal windowing for balanced source representation, and uses LLM-as-judge evaluation for transparent relevance judgments. Evaluated on 102,189 Der Spiegel articles (1950-1979), the framework addresses deficiencies in standard RAG, including temporal skew and weak retrieval correlation, and proposes Zwischentexte as a responsible integration method for LLM-generated content in scholarly work.

arxiv arXiv cs.CL · 8d ago

MODE-RAG: Evaluating and Reducing Hallucinations in M-RAG

MODE-RAG proposes a multi-agent system using Variational Free Energy to dynamically gate interventions and reduce cross-modal hallucinations in retrieval-augmented generation. It integrates Monte Carlo Tree Search and logit perturbations to address causal fabrications and sycophancy, with dedicated agents ensuring factual verification and formatting stability. Evaluated via ModeVent, a subset of MultiVent, the system significantly improves robustness against logical fabrications.

media r/LocalLLaMA · 6d ago

LFM2.5-Embedding-350M and LFM2.5-ColBERT-350M Released

LFM2.5-Embedding-350M is a dense bi-encoder that provides fast multilingual retrieval with one vector per document, achieving best-in-class accuracy for its size and inference speed comparable to smaller models. LFM2.5-ColBERT-350M is a late interaction retriever with best-in-class multilingual accuracy, enabling cross-lingual retrieval by storing one vector per token and supporting retrieval in multiple languages with high precision. Both models are designed as drop-in replacements for existing RAG pipelines.