UCTF: A Universal Compressed Training Format for Multilingual AI
A new concept called UCTF (Universal Compressed Training Format) proposes a mediator layer to address semantic redundancy in multilingual LLM training by compressing diverse languages into a unified, language-agnostic token format.