Reasoning models — korshunov.ai

Reasoning models Page 1 / 35

Universal Encoders for Modular Relational Deep Learning

The paper proposes a modular relational deep learning approach that decouples row encoding from graph message-passing. It introduces a transformer-based Universal Row Encoder that uses schema metadata to generate invariant row embeddings, enabling better generalization across databases and improving convergence on RelBench benchmarks.

arxiv arXiv cs.LG · 23h ago

BIPC Framework Accelerates Mixed-Integer Optimization with Machine Learning

The BIPC framework reduces solution time for large-scale mixed-integer programs by identifying a backdoor subset of variables that drive computational complexity. Using supervised learning, it predicts backdoor variable values and intervals, then solves a reduced problem with these predictions, achieving significant speedups with minimal quality loss. This enables rapid, high-quality solutions under parameter perturbations in real-world systems like power and supply chains.

arxiv arXiv cs.LG · 23h ago

Post-Training Speech Enhancement with Perceptual Rewards

A new post-training method uses multi-metric perceptual rewards to optimize speech enhancement models. It directly applies non-differentiable metrics like DNSMOS, WER, and UTMOS as rewards via Group Sequence Policy Optimization, achieving state-of-the-art results on DNS2020. Human evaluation confirms that combining multiple metrics outperforms single-metric approaches, reducing reward hacking.

arxiv arXiv cs.LG · 1d ago

Decoupling Declarative and Procedural Knowledge in Vision-Language-Action Models

w$^{2}$VLA introduces a modular approach that decouples declarative and procedural knowledge in Vision-Language-Action models. By restructuring information flow, it enables robust behavior cloning and unprecedented zero-shot skill transfer across unseen, dissimilar objects.

media r/LocalLLaMA · 1d ago

llama-server crashes with 'pasted as file' prompt for image data extraction

llama-server crashes when a user pastes a long prompt as a text file alongside an image, treating it as a file attachment. The server works fine when the prompt is sent in smaller batches but fails when the full prompt is combined into one text block and submitted with the image.

Universal Encoders for Modular Relational Deep Learning

BIPC Framework Accelerates Mixed-Integer Optimization with Machine Learning

Post-Training Speech Enhancement with Perceptual Rewards

Decoupling Declarative and Procedural Knowledge in Vision-Language-Action Models

llama-server crashes with 'pasted as file' prompt for image data extraction

Studying User Mental Models in Speech Translation

Task Decomposition for Efficient Annotation

Posterior Refinement: Fast Language Generation via Any-Order Flow Maps

Match Task to Objective Framework for Encoder-Decoder Models

MedBench v5: Dynamic Benchmark for Clinical AI

Aspect-Based Sentiment Evolution in Multi-Round Peer Reviews

CALIBER: Calibrating Confidence Before and After Reasoning in Language Models

Bad Prompts Cause Model Collapse and Mistakes

Transformer Models: Architectures, Applications, and Critical Assessment

Age of LLM: Benchmark for LLM Reasoning and Diplomacy

ExtractConf: Multi-Signal Confidence Engine for LLM Document Extraction

Bayesian Control for Coding Agents

RaDaR: AI Model Improves Rare Disease Diagnosis

Cross-Lingual Exploration for Parametric Knowledge

Qwen-AgentWorld: Language World Models for General Agents