StyleTTS2 - New Free Voice Cloning TTS #454
Labels
No Label
bug
duplicate
enhancement
help wanted
insufficient info
invalid
news
not a bug
question
wontfix
No Milestone
No project
No Assignees
3 Participants
Notifications
Due Date
No due date set.
Dependencies
No dependencies set.
Reference: mrq/ai-voice-cloning#454
Loading…
Reference in New Issue
Block a user
No description provided.
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
This project looks promising
https://github.com/yl4579/StyleTTS2
I wish there was a GUI or a tutorial for how to use it though.
(It was mainly built to read audio-books)
To be followed indeed...!
You can use the Inference_LibriTTS.ipynb file locally
Just install these
Then open the Inference_LibriTTS.ipynb file
@Bluebomber182 Hey, I've actually seen the videos you've made on YouTube,!
But, I don't think you can fine tune your own model using the Inference files, thankfully someone made a guide on how to finetune our models https://github.com/IIEleven11/StyleTTS2FineTune
So I'm kinda confused, how did you clone these voices on YouTube? did you fine tune your own model? did you just do the One-Shot voice clone? or did you just use the included voice in StyleTTS2 and used your model of RVC with it?
@FortermalGreek
I finetune the voices by following these guides
https://github.com/yl4579/StyleTTS2/discussions/65#discussioncomment-7737796
https://gist.github.com/Shiro836/1c1881435ee75dc81069c505a14a9423
https://github.com/yl4579/StyleTTS2/discussions/81
I then put the styletts2 output files through rvc.
I also made a styletts2 tutorial. This doesn't include finetuning.
https://www.youtube.com/watch?v=2dg_xRnMYT4
I found an improved styletts2 base model for those interested in fine tuning.
https://huggingface.co/ShoukanLabs/Vokan