What are companies actually using for self-hosted AI right now, and why?

A Reddit user is soliciting real-world data on enterprise deployments of self-hosted artificial intelligence, distinguishing actual production use from hobbyist testing.

One general model with RAG or context injection
Multiple smaller specialist models
Fine-tuned 70B-class models
Larger 405B-class deployments
One shared base model with multiple adapters

The inquiry seeks to understand the primary drivers behind these architectural choices, such as cost, privacy, latency, model quality, compliance, vendor risk, or operational simplicity.