AI agents
arxiv arXiv cs.CL · 7d ago

Generative Engine Optimization: Measuring AI Search Visibility

A large-scale study of 100K+ AI prompt responses across 100+ brands reveals a three-tier brand visibility ladder: global brands appear in 73% of answers, mid-market in 44%, and niche brands in just 11%. AI engines primarily cite corporate websites, with YouTube leading non-corporate sources, and best-of listicles accounting for 21% of citations. Sentiment in brand mentions is unstable, flipping six times more often than mere mention.

arxiv arXiv cs.LG · 7d ago

Act2Answer Evaluates Knowledge Retention in Vision-Language-Action Models

Act2Answer introduces a lightweight protocol to assess commonsense and world knowledge retention in VLA models by requiring agents to answer questions through object placement actions. A large-scale study of 7 VLA models and 9 VLM baselines reveals that VLAs perform well on simple concepts but show larger gaps on rich semantic categories compared to their source VLMs, with VQA co-training improving knowledge retention and peak answer-relevant signals observed in middle VLA layers.

arxiv arXiv cs.AI · 7d ago

User as Engram: Local Parametric Edits for Personal Memory

User as Engram proposes storing per-user facts as surgical, hash-keyed edits to a memory table, leaving reasoning in a shared adapter. This design achieves 5.6x higher indirect-reasoning accuracy and maintains base-level reasoning performance, with a memory footprint 33,000x smaller than per-user LoRA. The approach enables disjoint user edits that compose losslessly, outperforming retrieval pipelines beyond 100 facts.