A user asks for experience regarding the ablation of Mandarin, Russian, and Arabic from a model to create a primarily Latin-based version. The goal is to free up space for further training or safe pruning in contexts where English has no activation.
The author describes creating a Swadesh-esque noun/verb pair list across the four languages, ensuring each pair either token-matches every other pair or gets padded to match.