All articles
arxiv arXiv cs.CL · 4h ago

MemDelta: Controlled Baselines and Hidden Confounds in Agent Memory Evaluation

The article introduces MemDelta, a controlled evaluation protocol for agent memory systems that isolates individual components to prevent confounding variables from skewing results. Using the LongMemEval-S dataset with 500 questions across three model families, the study reveals that reported gains often mix changes in memory methods with variations in language models or retrieval pipelines.

arxiv arXiv cs.CL · 5h ago

Node-to-Neighborhood Semantic Consistency: Text-Topology Alignment for TAGs Anomaly Detection

This paper addresses graph anomaly detection on text-attributed graphs by formalizing it as a node-to-neighborhood semantic consistency problem, where anomalies stem from mismatches between textual semantics and topological relationships. The authors propose N2NSC, a framework that uses two complementary fusion paths to align graph topology with textual semantics, enabling large language models to leverage both structural and textual neighborhood information.