AI agents
arxiv arXiv cs.CL · 7d ago

Generative Engine Optimization: Measuring AI Search Visibility

A large-scale study of 100K+ AI prompt responses across 100+ brands reveals a three-tier brand visibility ladder: global brands appear in 73% of answers, mid-market in 44%, and niche brands in just 11%. AI engines primarily cite corporate websites, with YouTube leading non-corporate sources, and best-of listicles accounting for 21% of citations. Sentiment in brand mentions is unstable, flipping six times more often than mere mention.

arxiv arXiv cs.LG · 7d ago

Act2Answer Evaluates Knowledge Retention in Vision-Language-Action Models

Act2Answer introduces a lightweight protocol to assess commonsense and world knowledge retention in VLA models by requiring agents to answer questions through object placement actions. A large-scale study of 7 VLA models and 9 VLM baselines reveals that VLAs perform well on simple concepts but show larger gaps on rich semantic categories compared to their source VLMs, with VQA co-training improving knowledge retention and peak answer-relevant signals observed in middle VLA layers.