All articles
media MarkTechPost · 10d ago

The 7 Types of Agent Memory: A Technical Guide

Large language models are stateless by default, requiring memory mechanisms to retain context across interactions. The seven types of agent memory—working, semantic, episodic, procedural, retrieval, parametric, and prospective—categorize memory by form and duration, enabling agents to plan, learn, and act over time. Each type serves distinct use cases, from storing user preferences to scheduling future goals, and together they form a comprehensive system for long-horizon, context-aware AI agents.

media Hugging Face Forums · 10d ago

Capability Is Not in the Weights: Empirical Negative Result on MLP Weight Projection

An empirical study found that projecting MLP weights from one transformer model into another fails to transfer semantic capability. Every tested variant performed worse than the unmodified host model, indicating a structural limitation in weight projection. The results challenge public claims about model capabilities based on benchmarks, showing such claims do not reflect actual internal weight geometry.

media Hugging Face Forums · 10d ago

The Clockwork Dark: A Local-First AI Narrative-RPG Engine

The Clockwork Dark is a local-first, AI-driven narrative-RPG engine that uses a deterministic state machine to resolve all game mechanics. It features two autonomous LLMs that narrate the story, with one acting as a patient world voice and the other as an unreliable, godlike assistant. The game offers players a choice: fight the encroaching supernatural corruption or embrace a quiet life in a bakery, with both paths considered valid endings.

media Hugging Face Forums · 10d ago

Small-scale debug comparison of OLMo-core with Engram graft

A 200-step training comparison between a base OLMo3 600M model and a version with a DeepSeek-style Engram graft shows lower training and evaluation loss, faster grad-norm stabilization, and improved early learning behavior. The Engram graft, injected into layers 1 and 5, increases trainable parameters to ~1.7B but maintains only a 40k increase in active parameters per token, indicating efficient memory usage.