Research paper — korshunov.ai

Research paper Page 1 / 19

Attention Sinks and Collapse Are Universal Consequences of Content-Based Routing

The study demonstrates that attention sinks, representation collapse, and norm stratification are not unique to transformer architectures but are inherent consequences of content-based routing under a fixed similarity metric. It establishes an identity showing softmax attention functions as Boltzmann-weighted aggregation over Euclidean distances with constant key norms, rendering it blind to key magnitude due to the omission of a specific norm term. This framework predicts that any router utilizing a metric ill-matched to its representations will compensate by concentrating routing and collapsing the routed representations. The authors validate this hypothesis across diverse models including nine pretrained transformers, graph attention networks, selective state-space models, recurrent mixers, and learned residual layers. Experimental results confirm that all tested architectures exhibit this identical signature of collapse regardless of their specific domain or structure. Furthermore, within-model ablations isolate the routing mechanism as the primary cause rather than incidental training dynamics. The onset of this phenomenon is shown to be contingent on the strength of the positional brake accompanying the content score, which can shift the effect across its range. However, the underlying mechanism remains invariant and does not require norm stratification, as routers with norm-normalized keys exhibit the same concentration behavior.

arxiv arXiv cs.LG · 6h ago

No Reference-Free Generalization in Quantum Machine Learning

This study addresses the identifiability problem in quantum machine learning where training data lacks a preferred basis or reference frame. The authors formulate supervised learning without an external quantum reference frame, requiring classifiers to preserve unitary symmetries unbroken by the training data. They prove that if training states do not span the full Hilbert space, all pure states orthogonal to this span receive identical predictions. This limitation arises from missing reference information rather than state discrimination or computational constraints. The research establishes a robust version under weak symmetry breaking and shows that learning generic concepts requires exponentially many oriented training directions. Numerical illustrations visualize the resulting prediction collapse and its controlled relaxation. The results identify feature maps, measurement bases, and diverse training states as essential operational resources for generalization.

arxiv arXiv cs.LG · 6h ago

Null-Calibrated Conformal Selection via Target-Membership Scores

The article introduces Null-Calibrated Conformal Selection (NCCS), a method that utilizes target-membership probability scores to identify test candidates within a target region while controlling the false discovery rate. The authors argue that these membership scores provide a more natural ranking for selection tasks than conventional prediction-oriented nonconformity scores, particularly for complex targets. This distinction is critical for interval-valued, variance-driven, multimodal, or multi-condition targets where traditional scores may be misaligned with selection power. NCCS ranks test scores against confirmed non-target calibration examples to yield finite-sample valid null p-values under null exchangeability. These p-values can be combined with the Benjamini-Yekutieli procedure under arbitrary dependence or the Benjamini-Hochberg procedure under standard positive-dependence conditions. Experiments demonstrate that membership scores match conventional scores on mean-monotone targets but substantially improve performance on variance-driven targets. In rare-target regimes, NCCS trades power for finite-sample null validity, addressing issues where direct empirical-FDP thresholding can be anti-conservative.

arxiv arXiv cs.LG · 7h ago

Flow Annealing Posterior Sampling for Function-Space Regression and Inverse Problems

The authors introduce Flow Annealing Posterior Sampling (FAPS), a novel framework that unifies stochastic-process regression with PDE inverse problems in function space. Built upon pretrained function-space flow-matching priors, FAPS facilitates likelihood-guided posterior inference using sparse and noisy observations. The method supports variable query discretizations and avoids the need for explicit prior-density evaluation during sampling. It employs a Langevin correction mechanism that utilizes a low-rank covariance preconditioner to exploit dominant function-space correlations across different discretizations. Benchmarks on both Gaussian and non-Gaussian stochastic processes demonstrate that FAPS produces coherent posterior samples with accurate uncertainty quantification. The approach significantly outperforms existing functional regression baselines in these standard tasks. Furthermore, it achieves competitive or superior performance in noisy PDE inverse problems compared to diffusion-based samplers while reducing test-time sampling costs.

media r/LocalLLaMA · 7h ago

Backtrack Sampler and Verifier Drastically Improve Tiny Model Coding Performance

A new backtrack sampler combined with a verifier model significantly enhances the coding performance of tiny 0.5B parameter models, potentially making them competitive with larger 2-4B class models without weight changes. The approach theoretically addresses hallucination issues in large models by correcting errors during generation through re-sampling. However, this method incurs a 5-30% decode speed penalty due to the need for backward passes and requires training a verifier model of similar size to the original. This requirement doubles VRAM usage and increases compute demands by 1.5 to 3 times compared to standard inference. Despite these costs, the verifier generalizes across models of equal or lower weight classes if trained on diverse data distributions. Training the verifier is highly efficient, requiring only approximately 0.01% of the token size used for full pre-training.

arxiv arXiv cs.CL · 8h ago

Weave of Formal Thought: Uniting Rigorous Syntactic Validation with Learned Structural Representations

The authors introduce Weave of Formal Thought (WoFT), a paradigm combining rigorous syntactic validation with learned structural representations for code generation. The approach utilizes a formal engine and constrained decoder that is sound and complete regarding the full Tree-sitter specification. By augmenting generalized LR parsing with speculative lexing, the system maintains concurrent lexer-state hypotheses to admit valid program prefixes while rejecting invalid ones. Additionally, WoFT employs latent-variable fine-tuning to train models to interleave non-terminal grammar symbols directly into the generation process. This method uses the reweighted wake-sleep algorithm to optimize the importance-weighted evidence lower bound of the surface text. The model learns to selectively retain formal derivations as an adaptive structural scratchpad during inference. Experiments on Python show that fine-tuning StarCoder2-3B with this objective reduces per-token cross-entropy by 14.3% compared to a text-only baseline.

lab OpenAI News · 9h ago

OpenAI Research Shows AI Agents Transforming Work

A new research paper from OpenAI demonstrates how artificial intelligence agents are fundamentally changing the nature of work. The study highlights the capability of these agents to execute longer and more complex tasks than previously possible. This technological advancement is credited with expanding productivity across a wide variety of professional roles. The findings suggest a significant shift in how labor is organized and performed through automation. By handling intricate workflows, AI agents are enabling users to achieve greater efficiency. The paper serves as evidence of the growing impact of autonomous systems on modern employment.

arxiv arXiv cs.CL · 9h ago

Tatoxa: A Novel Text Detoxification System for Low-Resource Tatar

The paper introduces Tatoxa, a state-of-the-art system designed for automated text detoxification in the low-resource language of Tatar. This work addresses the lack of research attention given to abusive content mitigation in languages with limited digital resources. The authors present a new dataset specifically created for fine-tuning and evaluating detoxification models within these constrained settings. Comparative experiments demonstrate that Tatoxa outperforms both existing open-source and proprietary commercial large language models on key quality metrics. Furthermore, the study investigates cross-lingual transfer capabilities to assess the viability of using data from other languages. Results indicate that training on native Tatar data is significantly more effective than transferring knowledge from culturally close languages like Russian. Even when a large Russian corpus is available, cross-lingual approaches perform worse than models trained exclusively on native Tatar text.

arxiv arXiv cs.CL · 9h ago

Detect, Unlearn, Restore: Defending Text Summarization Models Against Data Poisoning

The study addresses the threat of training-time data poisoning during fine-tuning for abstractive text summarization models. Adversaries manipulate small task-specific datasets to induce persistent summarization failures while maintaining standard evaluation metrics. A unified post-hoc defense framework is proposed to detect and remediate poisoning across the machine learning supply chain. In white-box settings, detection relies on influence-function analysis identifying abnormally high training influence in poisoned pairs. Black-box defenses utilize behavioral auditing based on increased sensitivity to semantics-preserving perturbations. The authors introduce novel attacks targeting factual distortion and representational bias that evade conventional alarms. Experiments across nine architectures and six benchmarks show 85-92% detection precision for the proposed defenses. Gradient-ascent unlearning restores up to 96% of original behavior with less than 0.6% ROUGE degradation.

arxiv arXiv cs.CL · 10h ago

Natural Ungrokking: Asymmetric Control of Which Rules Survive Pretraining

A study identifies 'natural ungrokking,' a phenomenon where small language models lose learned grammatical rules midway through pretraining despite the evidence remaining in the data. Researchers observed that a model learning pronoun-gender agreement with Sue collapsed from 0.94 accuracy to near zero by step 3,500 without any corresponding spike in the loss curve. The survival of these rules is determined by support frequency within the training stream, while the data-to-parameter ratio only modulates the depth of the collapse. This emergence-then-collapse dynamic was replicated across multiple corpora, budgets, and seeds, and confirmed in public Pythia checkpoints where collapse depth correlated with model scale. The forgetting process acts as a displacement mechanism where a competing surface pattern out-competes the rule, causing the log-probability margin to cross zero within 100 steps of behavioral failure. Control over this fate is asymmetric; while injecting counter-evidence can destroy rules via a monotone dose-response, restoring support even at 450 times the sustaining level fails to recover them.

arxiv arXiv cs.CL · 10h ago

Keyword Lexicon Blindness Distorts Rhetorical Stance Measurement

A study analyzing 85 interviews with four public intellectuals reveals that keyword-based scoring can produce statistical artifacts regarding rhetorical stance. Initial analysis showed a robust negative-affect and emphatic-certainty co-occurrence pattern with high correlation coefficients ranging from r = 0.72 to 0.93. However, replacing this method with LLM-based zero-shot semantic classification on the full diarized corpus of 32,625 sentences significantly reduced these correlations. For instance, Dalio's correlation dropped from 0.851 to 0.206, while other speakers exhibited negative or null relationships between negativity and certainty. In contrast, the LLM analysis revealed a strong coupling between negative sentiment and hedging language, aligning with conventional expectations of pessimistic discourse. The discrepancy stems from three structural failures in keyword lexicons: syntactic blindness, polysemy blindness, and categorical absence. These flaws can invert semantic meaning, such as scoring 'never absolutely totally confident' as high certainty. The authors argue that keyword counts measure lexical co-occurrence tendencies rather than epistemic certainty, constituting a category error.

arxiv arXiv cs.CL · 10h ago

Structuring an Arabic-English Machine-Readable Dictionary Using Parsing Expression Grammars

This paper presents a method for structuring a machine-readable version of the Arabic-English Al-Mawrid dictionary, addressing the lack of standardization in printed formats. The approach converts unstructured streams of words and punctuation into explicit hierarchical structures that define entry components such as subentries, domain labels, and translation equivalences. Parsing serves as the central step within a cascaded design, implemented using the parsing expression grammars formalism. This technique allows for the automatic or semi-automatic organization of dictionary entries despite the absence of microstructure standardization in Arabic dictionaries. The study demonstrates that inducing microstructure enables plausible accuracy in structuring these complex lexical resources. By transforming raw text into defined formats, the work supports downstream natural language processing applications requiring machine-readable lexical data.

arxiv arXiv cs.CL · 11h ago

Automatic Generation of Highlights for Academic Paper Via Prompt-based Learning

This study investigates prompt-based learning for the automatic generation of academic paper highlights to address the lack of labeled training data in existing supervised methods. The researchers designed task-specific prompt templates combined with paper abstracts as inputs for several language models, including locally deployed GPT-2 and T5, as well as ChatGPT accessed via API. Experiments conducted on three datasets demonstrated that ChatGPT with prompt templates achieved performance comparable to previous supervised methods without requiring task-specific training samples. When a small number of examples were added to the prompts, the model significantly outperformed state-of-the-art methods on two of the datasets. The analysis revealed that while ChatGPT possesses strong language modeling capabilities, its performance is highly sensitive to the specific information provided within the prompt. Case studies indicated that the generated highlights are generally coherent, informative, and closely resemble those written by authors. This approach does not rely on domain-specific training corpora, supporting downstream text mining and bibliometric research for papers lacking existing highlights.

arxiv arXiv cs.CL · 11h ago

Measuring Research Difficulty in NLP: An Inverted U-Shaped Relationship with Academic Impact

This study proposes a comprehensive evaluation system for measuring the difficulty of academic research, focusing on Natural Language Processing as a case study. The authors extract internal and external features from papers, including collaboration, content, and references, to compute multiple difficulty indicators. These indicators are weighted using the entropy weight method and summed to generate a final research difficulty score. Academic impact is quantified by citation frequency, while expert assessments validate the reliability of the measurement approach. Empirical results indicate that page count, reference count, and high-level institutional participation significantly correlate with academic impact. Crucially, the analysis reveals an inverted U-shaped relationship between research difficulty and impact. This suggests that moderately difficult research tends to achieve the highest level of academic influence.

arxiv arXiv cs.CL · 11h ago

Data-Driven Evolution of Library and Information Science Research Methods (1990-2022)

This study analyzes the influence of data-centric research on Library and Information Science by examining methodological evolution from 1990 to 2022. Researchers automatically extracted four key categories of data-driven entities from academic papers: algorithms and models, data resources, software and tools, and metrics. The analysis evaluates trends across three dimensions, including temporal characteristics, topic-specific evolution, and cross-method features. Findings identify data resources as the primary driver of methodological changes within the discipline. The research reveals a cyclical pattern characterized by emergence followed by stability or practical application in LIS methods. This perspective highlights how big data advancements have reshaped the field's technical landscape over three decades.

arxiv arXiv cs.CL · 11h ago

iLLaDA: An 8B Masked Diffusion Language Model with Fully Bidirectional Attention

The authors introduce iLLaDA, an 8B parameter masked diffusion language model trained from scratch using fully bidirectional attention. This approach contrasts with the predominant autoregressive factorization and causal attention used in modern large language models. The model's pre-training scaled to 12 trillion tokens, followed by supervised fine-tuning on a 25 billion-token instruction corpus for 12 epochs. iLLaDA maintains the masked diffusion objective throughout both training phases and employs variable-length generation for efficiency. It also introduces confidence-based scoring to enhance performance on multiple-choice evaluation tasks. Benchmark results show significant improvements over its predecessor, LLaDA, including gains of 21.6 points on BBH and 14.9 points on ARC-Challenge for the base model. The instruction-tuned variant achieved increases of 14.5 points on MATH and 16.5 points on HumanEval. Despite its non-autoregressive nature, iLLaDA remains competitive with Qwen2.5 7B across several metrics.

arxiv arXiv cs.CL · 11h ago

Local Branch Routing: Efficient Trainable Test-Time Scaling for Language Models

The authors introduce Local Branch Routing (LBR), a token-level framework designed to improve language model reasoning through efficient test-time scaling. LBR expands a small local lookahead tree and forwards all sampled branches through the model, using a lightweight router to select the depth-1 subtree for commitment. This approach allows each token decision to utilize evidence from candidate local futures without incurring the computational costs of full solution-level search. The method employs a prune-shift-grow decoding process that preserves discrete branch identities and defines a tractable tree-trajectory likelihood. Consequently, LBR enables end-to-end reinforcement learning with verifiable rewards, jointly optimizing the base model and router under the same likelihood-ratio principle as discrete-token RLVR. Experimental results on synthetic hierarchical-planning tasks demonstrate that post-candidate hidden states provide useful routing evidence. Furthermore, benchmarks in mathematical reasoning show that LBR improves both Pass@1 and Pass@32 metrics compared to discrete chain-of-thought and other baselines.

arxiv arXiv cs.CL · 11h ago

Neural Machine Translation for Low-Resource Tangkhul-English

This study addresses low-resource machine translation for the Tangkhul-English language pair, focusing on a severely under-resourced Tibeto-Burman language with minimal prior NLP infrastructure. The authors present two systems: a primary model based on ByT5-large and a contrastive system using mT5-small, both fine-tuned on 38,336 parallel sentence pairs. Evaluation on a held-out test set of 3,856 sentences shows the ByT5-large system achieving a corpus BLEU score of 39.97 and a chrF++ score of 58.07. Additional metrics include a BERTScore F1 of 0.8104 and a COMET score of 0.7302 using the wmt22-comet-da model. The research highlights orthographic challenges related to Tangkhul's Latin-script diacritics as a specific technical hurdle. Furthermore, the training corpus exhibits domain bias, consisting primarily of biblical texts, stories, and conversational data. Future work aims to improve performance through data diversification and domain adaptation strategies.

arxiv arXiv cs.CL · 12h ago

Computational Stylometry of English Pali Canon Translations Across Pitakas

This study presents a computational stylometric analysis of the Tipitaka across all three Pitakas in English translation, extending previous work on the Sutta Pitaka. The corpus comprises 134,831 segments from Bhikkhu Sujato's Sutta Pitaka, Bhikkhu Brahmali's Vinaya Pitaka, I.B. Horner's 1938 Vinaya translation, three English translations of the Abhidhammattha Sangaha, and cross-tradition Vinaya texts. The authors compute Zipf rank-frequency distributions, MATTR-500 lexical diversity, numeral-word density, and vocabulary overlap metrics. Main findings indicate that all corpora show Zipf-consistent distributions with R-squared values above 0.989. The Sutta and Theravada Vinaya exhibit nearly identical lexical diversity scores of 0.399 and 0.400, while the Sangaha corpus is more diverse at 0.560. The Sangaha corpus also displays the highest numeral-word density at 3.26%, reflecting its systematic enumeration of categories. Additionally, the Mulasarvastivada Vinaya shares significant vocabulary overlap with the Theravada Vinaya, whereas two English translations of the same source share only 24.2% of their vocabulary.

arxiv arXiv cs.CL · 12h ago

Story Operators: Decomposing the Original to Sequel Transformation in Embedding Space

This study models literary transformations as geometric operations within a sentence-embedding space using all-mpnet-base-v2 vectors from the PG19 corpus. By calculating displacement vectors between original novels and their sequels, the author decomposes these changes along a content basis derived via PCA. Analysis of thirteen verified author pairs reveals a taxonomy of sequel types: formulaic, concentrated, and compositional. Formulaic transformations involve minimal rank changes, such as Doyle's Holmes collections with a norm of 0.12. Concentrated shifts are dominated by a single axis, exemplified by Alcott's Little Women to Little Men where 75% of the change occurs on one move. Compositional transformations involve many small axes, seen in works by Twain, Burroughs, and Nesbit. For Tom Sawyer to Huckleberry Finn, the dominant axis is structural, reflecting a shift from domesticity to picaresque adventure rather than surface themes like vernacular voice. The geometric findings are corroborated against Mark Twain's documented authorial intent in letters to Howells.