Article
The SOTA Text-to-speech and Zero Shot Voice cloning model that no one knows about...
By
•
•
61Hi! Thanks for the discovery :) How hard would it be to train this model for French TTS ? Would it only require 100k hours of audio dataset? or some other (maybe complete) code? I'm wondering since Llama3.2 is multilingual how does this relate to this model?