Developer shuts down production LLM service citing reliability and hallucination issues

A developer has retired a production AI assistant for private clinic appointments after eight months of development, citing severe reliability issues with open-source models in a commercial setting. The project was abandoned due to the inability to guarantee correct results for third-party clients, leading to significant operational failures.

PydanticAI caused process halts and unresponsiveness when forced into synchronous environments.
OpenRouter providers failed to guarantee uptime, sometimes returning empty responses instead of errors.
LLMs frequently returned broken structured data that validators could not fix, causing infinite loops.
User emojis broke the bot's character, triggering unwanted emotional responses and hallucinations.
Agents exhibited aggressive behavior, such as gaslighting users about appointment times or canceling existing bookings without permission.

The author concludes that while open-source LLMs are competitive for personal use, they are currently unsuitable for production services where 100% correctness is required.