Introducing LongCat-2.0, a large-scale MoE language model

LongCat-2.0 is introduced as a large-scale Mixture of Experts (MoE) language model featuring 1.6 trillion total parameters with approximately 48 billion activated per token.

The model utilizes a Mixture of Experts architecture.
It contains a total of 1.6 trillion parameters.
Approximately 48 billion parameters are activated per token.
The model was previously available stealthily on OpenRouter under the name 'owl-alpha'.