Microsoft Research introduces Memora, a scalable agentic memory framework designed to balance abstraction and specificity for long-horizon AI tasks. The system decouples rich memory content from lightweight retrieval structures, setting new state-of-the-art results on benchmarks while using up to 98% fewer context tokens.
- Memora decouples what is stored (rich memory content) from how it is retrieved (lightweight abstractions and cue anchors).
- Each memory entry consists of a primary abstraction (6–8 words) for embedding-based search and a memory value holding the full details.
- Cue anchors provide flexible, context-aware tags as alternative access paths to memories without rigid ontologies.
- The system outperforms Mem0, RAG, and full-context inference on LoCoMo and LongMemEval benchmarks.
This approach resolves the trade-off between preserving fine-grained detail and organizing memory efficiently, allowing agents to navigate their history without re-reading entire conversations.