LongCat-2.0 is introduced as a large-scale Mixture of Experts (MoE) language model featuring 1.6 trillion total parameters with approximately 48 billion activated per token.

  • The model utilizes a Mixture of Experts architecture.
  • It contains a total of 1.6 trillion parameters.
  • Approximately 48 billion parameters are activated per token.
  • The model was previously available stealthily on OpenRouter under the name 'owl-alpha'.