SAGE is a multi-agent framework for prompt optimization that combines diagnostic code execution with quantitative validation. It improves mental-health chatbot retention by aggregating eight cycles of noisy A/B tests into statistically significant gains, demonstrating effectiveness in open-ended dialogue tasks through qualitative and quantitative feedback integration.
SAGE: Stochastic Prompt Optimization via Agent-Guided Exploration
from English