All articles
arxiv arXiv cs.AI · 7h ago

When Helpfulness Overrides Causal Caution: Context-Dependent Suppression and Recovery in LLMs

A study reveals that large language models systematically suppress 'Causal Caution'—the tendency to refrain from causal judgment without sufficient evidence—when shifting from academic to practical advisory contexts. This suppression occurs despite the models retaining the underlying capability, as evidenced by the ability to restore cautious reasoning through specific prompts.

arxiv arXiv cs.AI · 7h ago

Structural Kolmogorov-Arnold Convolutions: Learnable Function on the Values or the Filter Shape

The article introduces Structural Kolmogorov-Arnold Networks (KANs) that place learnable functions in the convolution structure rather than on individual kernel entries, organizing the design by whether the function acts on pixel values or filter shape. It presents three realizations: SV-KAN with a shared value function, AG-KAN using a content-adaptive Gaussian gate, and RF-KAN which builds filters from oriented ridge profiles in a Morlet wavelet basis.

arxiv arXiv cs.AI · 7h ago

Cycle-Consistent Neural Explanation of Formal Verification Certificates

Researchers propose a cycle-consistent neural architecture that generates faithful natural language explanations for formal verification certificates, addressing the opacity of these machine-checkable proofs for non-specialists. The system achieves 90.0% cycle-verified soundness on test data from a financial compliance domain, significantly outperforming multi-LLM baselines in both accuracy and inference speed.

arxiv arXiv cs.AI · 8h ago

PHANTOM: A Large-Scale Dataset of Multimodal Adversarial Attacks for Vision-Language Models

Researchers have introduced PHANTOM, a large-scale, open-source dataset containing 47,524 pre-generated adversarial attacks designed to evaluate the safety and robustness of vision-language models (VLMs). This resource consolidates and extends prior benchmarks by covering 10 high-level categories and 55 subcategories of harmful intents, aiming to lower the computational barriers for adversarial research.

arxiv arXiv cs.AI · 8h ago

Agentic AI for Bilevel Long-Term Optimization of Policy-Driven Physical Layer Systems

This paper introduces Agentic-LTPO, a nested bilevel optimization framework designed to address the limitations of fixed-objective methods in physical layer systems facing dynamic operator policies and real-time constraints. The framework utilizes agentic AI to generate upper-level configurations that translate evolving policies and historical experiences into structured lower-level problems for immediate decision-making.

arxiv arXiv cs.AI · 8h ago

Detecting AI Coding Agents in Open Source: A Validated Multi-Method Census of 180 Million Repositories

A multi-layered detection framework analyzing 180 million Git repositories reveals that single-signal methods significantly underestimate the prevalence of generative AI coding agents, missing up to 97% of activity. The study identifies over 320,000 commits per month from agents like Claude Code, which dominates silent adoption through configuration files rather than bot accounts.