All articles
blog Simon Willison · 9d ago

sqlite-utils 4.0rc1 Adds Migrations and Nested Transactions

sqlite-utils 4.0rc1 introduces database migrations and db.atomic() for nested transactions. Migrations support script-based schema changes using a simplified API, while db.atomic() enables nested transactions via savepoints, improving error handling and data integrity. The release includes backwards-incompatible changes, such as updated upsert behavior and dropped Python 3.8 support, with options to maintain older behaviors.

media r/LocalLLaMA · 9d ago

Updated Vision Model Benchmark Results and Recommendations

A revised benchmark of local vision language models evaluates 23 models across 30 images with 3 tests each, totaling 2,070 tests and 60 to 70 inference hours. The top-performing model is Qwen3.6 27B (nothink) at Q4 with a 79.6 score, followed by Qwen3.5 4B (nothink) at Q4, and Qwen3-VL 8B at Q8. Key findings include thinking mode degrading vision performance, MoE models underperforming compared to dense models, and Q8 quantization not universally improving results.

media AI News (smol.ai) · 10d ago

GLM-5.2 Breakout and Open-Model Progress Highlighted

Zhipu's GLM-5.2 emerged as the top open-weight model, praised for its frontier-adjacent performance in daily use, with improvements in coding tasks and reduced 1M-token inference cost via IndexShare. It outperformed other open models in agentic knowledge work benchmarks, reaching 1266 Elo in Artificial Analysis' AA-Briefcase test, though only 3% of tasks were fully satisfied by top models, indicating persistent challenges in real-world long-horizon agent performance.