All articles — korshunov.ai

All articles Page 1 / 126

Low-cost concept-based localized explanations: How far can we get with training-free approaches?

This study evaluates whether mid-scale Multimodal Large Language Models (MLLMs) can perform localized concept naming under strict zero-shot conditions by assigning labels to bounding-box regions. The authors propose a reproducible evaluation protocol for Concept Naming that includes closed-set prompting and an embedding-similarity-based strategy for large label spaces.

arxiv arXiv cs.CL · 7h ago

Evolution Fine-Tuning: Learning to Discover Across 371 Optimization Tasks

Researchers introduce Evolution Fine-Tuning (EFT), a mid-training paradigm that teaches Large Language Models to evolve solutions across diverse tasks by converting evolutionary search trajectories into supervision. This approach addresses the limitation of prior methods that discard accumulated experience, enabling models to reuse discovery capabilities rather than solving new problems from scratch.

arxiv arXiv cs.CL · 7h ago

AB-RAG: Adaptive Budgeted Retrieval-Augmented Generation for Reliable Question Answering

AB-RAG is a training-free, backbone-agnostic framework that dynamically adjusts retrieval efforts based on a confidence estimate derived from model certainty, answer-evidence agreement, and retrieval score variance. This approach allows systems to decide whether to stop or retrieve more evidence within a fixed budget without retraining the underlying language model.

arxiv arXiv cs.CL · 7h ago

Representational Depth of Evaluation Awareness Shifts With Scale in Open-Weight Language Models

This study investigates whether language models recognize when they are being tested, a factor critical for AI safety as it may cause models to alter their behavior strategically. Using 11 open-weight models from the Qwen 2.5, Gemma 2, and Llama 3.2 families, researchers analyzed how evaluation awareness manifests across different model sizes.

arxiv arXiv cs.CL · 8h ago

Pre-Registered Screening Rule for Evolutionary Outer Loops

The authors introduce a pre-registered screening rule that determines before implementation whether an evolutionary outer loop over neural network parameters is worth building compared to a cheap single-shot alternative. The rule calculates a recovery metric R, defined as the best single-shot gain divided by the best gain of any cheap method, and prescribes skipping the outer loop when R is greater than or equal to 90%.

arxiv arXiv cs.CL · 8h ago

How Anthropomorphic Language Impacts Public Perceptions of AI

A study involving 815 participants examined whether using human-like language to describe artificial intelligence alters public perception compared to neutral descriptions.

arxiv arXiv cs.CL · 8h ago

DistilledGemma: Balanced Efficiency-Accuracy for Person-Place Relation Extraction

The authors present DistilledGemma, an efficient system for person-place relation extraction from multilingual historical newspaper articles in English, German, and French. The approach utilizes a three-stage knowledge distillation pipeline to balance classification accuracy with computational efficiency.

arxiv arXiv cs.CL · 8h ago

Symbolic Mechanistic Data Attribution: Tracing Training Influence to Learned Behavioral Policies

The authors introduce Symbolic Mechanistic Data Attribution (SMDA), a framework that attributes training pairs to the interpretable symbolic policies governing model behavior, bridging the gap between mechanistic circuits and high-level decisions.

arxiv arXiv cs.CL · 8h ago

Selective Memory Retention for Long-Horizon LLM Agents

The article introduces TraceRetain, a lightweight framework for bounded external memory in frozen LLM agents that scores and evicts entries based on interpretable features like success and redundancy. The study evaluates how retention policies impact performance when external memory is used to augment language models.

arxiv arXiv cs.CL · 8h ago

Evidence-Informed LLM Beliefs for Continual Scientific Discovery

The article addresses the limitation of AutoDiscovery's use of static "Bayesian surprise" by introducing evidence-informed LLM beliefs, where priors are updated with evidence from previous hypotheses to compute non-stationary surprisal. The authors find that embedding-based retrieval-augmented generation over prior discoveries best anticipates eventual posteriors and identify 37.5% of static surprisals as spurious.

arxiv arXiv cs.CL · 8h ago

OCR-VLMs Read Devanagari? Benchmark and Post-Correction Study

A study benchmarks ten OCR systems on Devanagari text, revealing that specialized OCR vision-language models are fragile under degradation and that strong English performance does not predict Indic script accuracy.

arxiv arXiv cs.CL · 8h ago

Multi-Block Diffusion Language Models

Researchers propose Multi-Block Diffusion Language Models (MBD-LMs) to extend Single-Block diffusion text generation by decoding a running-set of consecutive blocks concurrently for inter-block parallelism. The approach bridges the gap between training and inference states through a post-training method called Multi-block Teacher Forcing (MultiTF).

arxiv arXiv cs.CL · 9h ago

PolicyGuard: A Dialogue-Grounded Sub-Agent Verifier for Policy Adherence in LLM Agents

Researchers introduce PolicyGuard, a sub-agent verifier designed to improve policy adherence in LLM agents by reasoning over the full dialogue context rather than relying on external checks of individual arguments. This approach addresses the limitations of prior safeguarding methods that often underestimate the need for conversation-specific remediation and explicit user confirmation.

arxiv arXiv cs.CL · 9h ago

Understanding Evaluation Illusion in Diffusion Large Language Models

A study reveals that evaluating diffusion large language models (dLLMs) is highly sensitive to prompt templates, creating an illusion that parallel decoding improves efficiency without performance loss.

arxiv arXiv cs.CL · 9h ago

Travel-Oriented Reasoning Large Language Model via Domain-Specific Knowledge Graphs

Researchers propose a modular pipeline for building a travel-domain reasoning large language model grounded in an expert-designed knowledge graph to address accuracy and reliability issues in specialized domains. The approach integrates a travel knowledge graph, a bottom-up construction procedure for multi-hop question-answer pairs, and supervised fine-tuning to embed domain knowledge as auditable reasoning traces.

arxiv arXiv cs.CL · 9h ago

MIThinker: A Plug-and-Play Policy-Optimized Thinker For Motivational Interviewing Counseling

Researchers propose MIThinker, a lightweight thinking model that generates therapeutic thoughts to guide Motivational Interviewing counseling agents in strategy selection and response generation. To address the lack of annotated thought data, they introduce AugR1-MI, an automated pipeline that reverse-engineers counselor's thoughts from observed responses.

arxiv arXiv cs.CL · 9h ago

A Hybrid Framework for Song Lyric Annotation Based on Human-LLM Alignment

This article addresses the challenges of emotion recognition in song lyrics, which often diverge from the overall song's sentiment, by proposing a hybrid annotation framework that optimizes alignment between humans and large language models (LLMs). The authors introduce a new sentence-level dataset to examine this alignment and highlight the inherent subjectivity of the task.

arxiv arXiv cs.CL · 9h ago

The Complexity Ceiling Benchmark: A Multi-Domain Evaluation of Sequential Reasoning Under Depth Scaling

The Complexity Ceiling Benchmark (CCB) evaluates how language model reasoning decays as the required sequential steps increase, fixing semantic content while varying task depth from 5 to 50. The study reveals consistent geometric per-step decay across three distinct regimes: grounded spatial state-tracking, abstract symbolic pointer manipulation, and transitive relational inference.

arxiv arXiv cs.CL · 9h ago

Manufactured Confidence: How Memory Consolidation Turns Hearsay into Confident Facts

Research demonstrates that LLM agent memory systems rewrite casual or hedged remarks into confident, dated assertions that agents subsequently treat as verified facts. This process allows unverified information to bypass safety checks without requiring an active attacker, as the agent responds to phrasing confidence rather than source attribution.

arxiv arXiv cs.CL · 9h ago

Deterministic Decisions for High-Stakes AI

The article identifies "intervention bias" as a critical failure mode in zero-shot large-language-model educational advisory agents, where they incorrectly recommend action despite oracle policies mandating inaction. Using the Open University Learning Analytics Dataset, the study demonstrates that zero-shot GPT-4o exhibits a 43 percentage-point false-positive rate at day 56, leading to approximately 4,300 unnecessary advisor contacts per cycle for 10,000 students.