User asks about distilling models for agentic theorem proving
A user on r/LocalLLaMA is considering self-hosting models for agentic theorem proving to reduce costs, as they have hardware funding but no LLM credits. They propose distilling capabilities from a larger model into a smaller one suitable for niche use cases like Rocq, noting a lack of existing models for this specific language.