LongCat-2.0 is introduced as a large-scale Mixture of Experts (MoE) language model featuring 1.6 trillion total parameters with approximately 48 billion activated per token.
- The model utilizes a Mixture of Experts architecture.
- It contains a total of 1.6 trillion parameters.
- Approximately 48 billion parameters are activated per token.
- The model was previously available stealthily on OpenRouter under the name 'owl-alpha'.