The ALeRCE astronomical database introduces a text-to-SQL system using large language models, enabling natural language queries to generate executable SQL. The system, evaluated on 110 NL/SQL pairs, uses a step-by-step framework that outperforms direct-inference baselines, with Claude Opus 4.6 achieving high precision on simple queries and among the best overall performance across evaluated models.
arxiv
arXiv cs.AI
·
8d ago
·
research
ALeRCE Launches Text-to-SQL System with LLMs
from English
Benchmarks
| Benchmark | Model | Score |
|---|---|---|
| SWE-bench Verified | Claude Opus 4.6 | 0.97% |
| SWE-bench Verified | Gemini 2.5 Pro | — |
| SWE-bench Verified | Gemini 3 Flash | — |
| SWE-bench Verified | GPT-5.2-Codex | — |