Closing the Quality Gap in Low-Resource Text-to-Speech: LoRA Fine-Tuning of VoxCPM2 for Khmer and Korean
Researchers address the quality gap in low-resource text-to-speech by fine-tuning the 2.4B-parameter VoxCPM2 model using Low-Rank Adaptation (LoRA) on a shared corpus of Khmer and Korean.