Research paper
arxiv arXiv cs.AI · 6d ago

ScaffoldAgent: Utility-Guided Dynamic Outline Optimization

ScaffoldAgent introduces a utility-guided framework for dynamic outline optimization in open-ended deep research. It models outline evolution through Expansion, Contraction, and Revision operations, guided by a feedback mechanism that evaluates retrieval gain, structural coherence, and generation quality. Experiments show it improves long-form report generation and factual grounding compared to existing agents.

arxiv arXiv cs.AI · 6d ago

LLM Psychological Profiles Are Measurement Artifacts

A formal psychometric analysis shows that apparent psychological profiles of large language models are primarily driven by response bias, not actual traits. This bias, which causes models to consistently favor one end of a scale, accounts for 81-90% of between-model variation, far exceeding human differences. The study concludes that these profiles are artifacts of instrument design and not true model properties, urging the development of assessments based on response orthogonality.

arxiv arXiv cs.LG · 6d ago

TESSERA and AlphaEarth Embeddings Enable Fine-scale LCZ Mapping in Swiss Cities

A study across five Swiss cities compares TESSERA and AlphaEarth embeddings with traditional Sentinel data to upscale Local Climate Zone maps to 10-meter resolution using an attention-based U-Net. TESSERA consistently outperforms both Sentinel-1/2 and AlphaEarth, achieving IoU scores of 0.59–0.69 and 0.77–0.82. The results show embeddings reduce manual preprocessing and support scalable, reproducible LCZ mapping, though improved reference data is key for further accuracy gains.

arxiv arXiv cs.LG · 6d ago

Federated Conformal Risk Control via Risk-Curve Shrinkage

A new federated conformal risk control method addresses coverage failures in hospital-level predictions. On real brain tumor data from 20 institutions, pooled calibration fails 40% of sites, with one exceeding false-negative targets by 7.8 percentage points. The proposed shrinkage-based protocol uses empirical risk curves and a hyperparameter n0=19 to achieve 2.7/20 coverage violations at 2.0x prediction set stretch, while preserving marginal guarantees and ensuring no patient-level data leaves any site.

arxiv arXiv cs.LG · 6d ago

QCPIKAN: Quantum-Classical Physics-Informed KAN for PDEs

QCPIKAN is the first quantum-classical physics-informed Kolmogorov-Arnold network designed to solve partial differential equations. It uses Chebyshev-polynomial KAN layers and parameterized quantum circuits to embed physical constraints into training, achieving exponential error convergence and reduced numerical dispersion. Validated on seepage scenarios in porous media, it outperforms existing quantum-classical neural networks in prediction accuracy, error control, and dynamic tracking.