Babe wake up, another TTS system just dropped for mrq to look at: https://ai.facebook.com/blog/voicebox-generative-ai-model-speech/
It's not auto-regressive and uses something called 'flow…
I tried that fork and I the voice replication is comparable to using the non-finetuned custom voice of Tortoise in that it kind of replicate the voice of characters but it doesn't do well with…
Sort of off-topic but Microsoft just published Natural Speech 2 which seems to be a significant improvement over VALLE architecture. A short skim through of the paper it seems to be a latent…
I'm biting the bullet and dumping in LibriTTS clean-100 (247 speakers, 30k unsegmented lines, don't have an idea about duration yet or final line count).
If you are going to use LibriTTS then…
Are there's still any advantages to tortoise after playing around with VALL-E?
I was going to say it's a bit of a shame that most of it is already Persona 4, but if it's Golden, then that's golden.
Yeah the one on the doc are the golden version! At least according to…
Have you checked out the Pony Preservation Project Datasets You can found them here:
https://mega.nz/folder/jkwimSTa#_xk0VnR30C8Ljsy4RCGSig/folder/OloAmDqZ
and here (These are non-MLP…