Abstractions of Queries in Ontology-Based Data Access
This article addresses query abstraction in ontology-based data access (OBDA) by translating data queries to the ontology layer using existential rules and certain answer semantics.
This article addresses query abstraction in ontology-based data access (OBDA) by translating data queries to the ontology layer using existential rules and certain answer semantics.
This paper investigates the challenges of Competency Question (CQ) verification, a process where ontologies are evaluated against natural language questions to ensure proper modeling. The authors analyze why CQs become difficult and how an LLM assistant can support users during this evaluation.
This paper introduces a categorical account of infinitesimal causality in Frobenius Markov categories equipped with tangent-bundle semantics. It defines causal sufficiency through the compatibility of two distinct Frobenius structures: one encoding classical variable operations and another representing geometric integrability.
The authors introduce Themis, an XAI-enabled testing and evaluation framework that combines transparency through explainability with alignment via human feedback for safe Reinforcement Learning systems.
The authors propose a multi-agent framework that sanitizes retrieved content in Retrieval-Augmented Generation (RAG) systems through semantic rewriting to prevent privacy leakage from malicious prompts. By employing three specialized agents for privacy extraction, semantic analysis, and reconstruction, the approach removes sensitive identifiers while preserving the core meaning of the text.
The article introduces SAFARI, a framework designed to diagnose failures in autonomous agents by replacing linear context loading with a tool-augmented diagnostic loop. This approach decouples diagnostic accuracy from architectural context limits by using specialized tools and short-term memory to analyze trajectory segments.
This article examines how intentional, pluralistic design choices in AI-enabled digital platforms can produce visualizations that emphasize nuance and intergroup commonalities, thereby reducing political polarization. It highlights a specific deliberative technology initiative that maps high-dimensional opinion spaces to reveal areas of both consensus and dissensus among diverse populations.
JetBrains has open-sourced the Mellum2 models, a series of 12B-2.5A LLMs trained from scratch to target fast inference on H100/H200 hardware as well as local deployments.
Researchers propose CineCap, a framework that combines structured reasoning with spatio-temporal anchors and reinforcement learning to improve cinematographic video captioning. The method grounds professional film-language descriptions in explicit visual evidence while balancing descriptive completeness and factual correctness.
Anthropic has launched Claude Tag, a new workflow feature that allows teams to delegate work to Claude asynchronously within Slack. Positioned as a shift from one-user chat to teamwide collaboration, the tool enables Claude to join as a team member with access to selected channels, tools, and codebases.
Power consumption represents 40% of the operating expenses for running an AI factory, with performance per watt becoming a critical efficiency metric that directly impacts token costs.
A developer shares their experience of creating a centralized web access layer to manage interactions between local AI models and external services. This approach addresses the maintenance burden of building individual integrations for every new agent project.
Red Hat and NASA researchers are developing the Crew Medical Officer Digital Assistant (CMO-DA), a medical AI system that runs large language models on local hardware with zero cloud dependency. This initiative addresses the impracticality of Earth-based telehealth for astronauts on Moon or Mars missions due to light delay and communication blackouts.
A user successfully configured an NVIDIA H200 NVL GPU on a workstation built with ASUS WRX90E-SAGE SE motherboard and a 64-core Threadripper processor, demonstrating that high-end AI accelerators can run on non-server hardware.
A user tested the 4-bit version of GLM-5.2 (GLM-5.2-UD-Q4_K_XL) on a server equipped with an Epyc Rome 7452 processor and 512GB of RAM. The model was evaluated using a complex coding prompt requiring the creation of a self-contained 3D arena game in HTML, CSS, and JavaScript.
A developer with over 25 years of experience in web technologies is transitioning into AI engineering to move beyond using tools and understand how to build with them.
A user reports that their private Hugging Face Space, specifically 'Ark-kun/tangent', stopped working abruptly and cannot be restarted. Attempts to restart or perform a factory rebuild both fail with a "503. Something went wrong when restarting this Space" error.
NVIDIA introduces DFlash speculative decoding to significantly boost inference performance on its Blackwell architecture, addressing the latency challenges inherent in autoregressive LLMs.
NVIDIA introduces the BioNeMo Agent Toolkit to facilitate the creation of AI scientists capable of reading papers, writing code, and generating hypotheses for life science discovery.
Telecom operators are adopting AI across network operations, customer care, and back-office workflows, but most remain early in their journey toward full autonomy. Current automation efforts typically operate at Level 2–3 of TM Forum’s taxonomy, focusing on streamlining predefined solutions within selective domains.