indic-parler-ttsModel45/100 via “fine-tuning-and-adaptation-for-custom-voices-and-languages”
text-to-speech model by undefined. 7,72,616 downloads.
Unique: Supports parameter-efficient fine-tuning through LoRA adapters on speaker encoder and language-specific components, reducing fine-tuning memory requirements by 50-70% compared to full fine-tuning. Fine-tuning pipeline includes language-specific data preprocessing (grapheme-to-phoneme conversion, text normalization) to ensure custom data is processed correctly.
vs others: Enables faster fine-tuning than training TTS from scratch through transfer learning, while maintaining quality comparable to models trained on large custom datasets. LoRA-based fine-tuning reduces computational barriers compared to full fine-tuning, making model adaptation accessible to resource-constrained teams.