A Reddit user is soliciting real-world data on enterprise deployments of self-hosted artificial intelligence, distinguishing actual production use from hobbyist testing.
- One general model with RAG or context injection
- Multiple smaller specialist models
- Fine-tuned 70B-class models
- Larger 405B-class deployments
- One shared base model with multiple adapters
The inquiry seeks to understand the primary drivers behind these architectural choices, such as cost, privacy, latency, model quality, compliance, vendor risk, or operational simplicity.