AI agents — korshunov.ai

AI agents Page 1 / 20

TRAP: Benchmark for Task-completion and Resistance to Active Privacy-extraction

TRAP evaluates how well models complete tasks using private data without leaking it. Across 22 models, all show non-trivial privacy leakage, with instruction-following ability linked to higher leakage. Structural private field isolation prevents leakage by replacing private fields with hash keys, maintaining task accuracy without sacrificing privacy.

arxiv arXiv cs.AI · 7d ago

Spotlight: Using Spot GPUs to Accelerate DiT RL Post-Training

Spotlight enables DiT RL post-training by leveraging idle spot GPUs, reducing costs by 1.4-6.4× while achieving superior image quality. It uses stale model weights in exploration and reconfigures sequence parallelism in real time, allowing efficient GPU utilization without breaking training pipelines.

arxiv arXiv cs.AI · 7d ago

RODS: Reward-Driven Online Data Synthesis for Multi-Turn Tool-Use Agents

RODS addresses sample depletion in multi-turn tool-use RL by using reward variance to detect capability boundaries. It synthesizes new data in real time, matching structural complexity of boundary samples, and maintains a dynamic replay buffer that co-evolves with the policy. RODS achieves performance comparable to a 17K-sample offline pipeline with 20x fewer trajectories.

arxiv arXiv cs.AI · 7d ago

ARIADNE: Agnostic Routing for Inference-time Adapter Selection

ARIADNE enables dynamic, training-free adapter selection at inference time by using centroids from adapter training data embeddings. It selects the most appropriate adapter based on proximity in latent space, without requiring access to adapter internals or additional training, and achieves 89.7% average selection accuracy across 44 NLP tasks.

arxiv arXiv cs.AI · 7d ago

Leadership as Coordination Control in Multi-Agent LLM Teams

A study finds that leadership styles in multi-agent LLM teams only improve performance when the initial consensus is unreliable, recoverable, and not self-corrected by undirected interaction. Process-level coordination control adds value only under specific conditions predicted by team science, with no single leadership style outperforming others in accuracy across tasks and models.

arxiv arXiv cs.AI · 7d ago

Towards an Agent-First Web: Redesigning the Web for AI Agents

A new paper proposes a fundamental redesign of the web to prioritize AI agent access, challenging the long-held assumption that humans are the primary web users. It introduces access, economic, and content layer reforms—including agent-identifiable HTTP headers, intent-based subscription models, and a cryptographic provenance system—to enable AI agents as first-class participants, with human supervision and accountability embedded in the architecture.

arxiv arXiv cs.AI · 7d ago

Technical Taxonomy of LLM Agent Communication Protocols

A new taxonomy classifies LLM agent communication protocols across five dimensions: counterparty, payload, interaction state, discovery mechanism, and schema flexibility. Analysis shows hybrid payloads, session-state persistence, and runtime schema negotiation are common, with decentralized discovery remaining rare. The study predicts short-term convergence toward unified agent-to-agent and agent-to-context protocols, and long-term evolution toward a federated, layered protocol stack.

arxiv arXiv cs.AI · 7d ago

Human-AI Coevolution Framework Reveals Social Intelligence Emergence

The Human-AI Coevolution Dynamics Framework (HACD-H) introduces a unified model for long-term human-AI interaction, integrating emotional adaptation, memory, and personality into a self-organizing system. Results show social intelligence emerges through coevolution, with a significant negative correlation between social intelligence and social cognitive energy (r = -0.391, p < 0.001), and progressive energy reduction over time.

arxiv arXiv cs.AI · 7d ago

AdsMind: Physics-Grounded Multi-Agent System for Adsorption Discovery

AdsMind is a closed-loop multi-agent system that uses machine learning force fields and feedback to correct errors in adsorption configuration searches on catalyst surfaces. It achieves 100% and 98.8% success rates on AA20 and OCD-GMAE62 benchmarks, reduces energy dispersion by 14-fold compared to baselines, and maintains correct adsorption-energy signs in DFT validation, outperforming open-loop LLM agents.

media r/LocalLLaMA · 7d ago

Best models for a 12GB VRAM card

A user with a 12GB VRAM GPU asks for model recommendations for general chatting, roleplaying, and coding. They prioritize uncensored models for chat and roleplaying, and have a Ryzen 5600 CPU and 32GB DDR4 RAM.

media r/LocalLLaMA · 7d ago

Lemonade v10.8 Releases Auto Memory Management, Cloud Offload, and MCP Tool Support

Lemonade v10.8 introduces dynamic VRAM management that auto-unloads idle models and downsizes KV-cache to reclaim GPU memory. It adds cloud offload support for OpenAI-compatible providers, enabling local-first model serving with optional cloud routing. A new MCP gateway exposes local models as tools via POST /mcp, allowing local models to be used as tools in MCP-aware applications.

media r/LocalLLaMA · 7d ago

I post-trained a model to reliably roll a die

A user trained a language model to roll a die, ensuring each number appears approximately once every six rolls. The post highlights how mainstream LLMs tend to default to saying '4' when asked to roll a die, illustrating a broader issue in reinforcement learning: models often fail to explore effectively and instead follow known patterns.

media Latent Space · 7d ago

Radical AI Achieves 10x Acceleration in Materials Discovery

Radical AI has accelerated materials discovery by producing and characterizing 1,200 alloys in six months—nearly 10x faster than DARPA/GE MACH's goal of 500 alloys in a year. Their self-driving labs use AI scientists to generate and test hypotheses in closed-loop systems, leading to 300 new materials with 10 exhibiting novel, state-of-the-art properties now being developed for commercial use.

media r/LocalLLaMA · 7d ago

We built an open source UI kit for document RAG/agents

Extend AI has released an open source UI kit with 15 components for PDF, DOCX, and XLSX viewers, including bounding box citations, file upload, e-signature, and file systems. The toolkit, MIT licensed and fully customizable, was initially internal but is now open source due to customer demand, and is maintained for scalability and edge case handling in high-volume document processing.

media r/LocalLLaMA · 7d ago

GameCraft-Bench: Can Agents Build Playable Games End-to-End in a Real Game Engine?

GameCraft-Bench evaluates whether large language models can build playable games end-to-end using a real game engine. The benchmark includes assessments of major models like Opus-4.7 and GPT-5.5, with interest in how medium-sized models (e.g., 30-70B parameters) perform on game development tasks.

media r/LocalLLaMA · 8d ago

Headless screenshot loops enable a 30B local agent to debug raytraced FPS in pure C

A local 30B agent, using headless screenshot loops, autonomously debugs a raytraced FPS demo in pure C by capturing frames at key events and iterating on fixes. The agent builds a recursive visual debugging loop, demonstrating that simple feedback mechanisms can enable small models to solve complex, visually grounded tasks.

media r/LocalLLaMA · 8d ago

SIQ-1 Qwen3.6 Achieves Strong Performance in Autoresearch and Benchmarking

The SIQ-1 model, trained using PPO with verifiable reward, outperforms GLM-5.2 and Qwen-350B on parameter-golf tasks, with outputs resembling Opus4.8. It also beats NEX and GPT-5.5 on the bullshit-bench test. The model and GGUF version are available on Hugging Face, along with a ZeroGPU-compatible agent demo.

media r/LocalLLaMA · 8d ago

Local LLM-powered RPG with persistent generated content

The developer released a local LLM-powered RPG where NPCs, locations, items, and quests are generated as persistent in-game objects. These elements can be revisited and interacted with, and the game integrates LLMs into core RPG mechanics like dialogue, narration, and quest progression, while managing inventory, combat, and saves. The game sold about 1,800 copies in its first week and has a 4.0 store rating, indicating player interest in AI-driven RPG experiences.

media r/LocalLLaMA · 8d ago

Local models went from mostly useless to actually useful in one year

Local models transitioned from being primarily privacy-focused toys to practical tools for coding, private document management, and local workflows within a year. While they still fall short of replacing top closed models for complex tasks requiring planning and error correction, the overall improvement in usability and performance is evident.

arxiv arXiv cs.LG · 8d ago

LegalHalluLens: Auditing Hallucinations in Legal AI

LegalHalluLens introduces a framework to audit AI hallucinations in legal contexts by analyzing typed hallucination profiles across four claim categories. It reveals a 38-40 point gap between obligation/numeric and temporal claims, and shows two systems with identical 52% hallucination rates can have opposite risk directions. The framework uses a Risk Direction Index and calibrated debate pipelines to reduce fabricated detections by 45%, offering actionable diagnostics for trustworthy legal AI deployment.