AIGP: An LLM-Based Framework for Long-Term Value Alignment in E-Commerce Pricing
Researchers propose AIGP, a framework using Large Language Models to address interpretability and long-term objective misalignment in e-commerce dynamic pricing. The system employs supervised fine-tuning and a Long-Term Value Estimator trained via offline reinforcement learning to align pricing decisions with business goals.