StyleTTS2 - New Free Voice Cloning TTS #454

Open
opened 2023-11-26 16:54:48 +00:00 by FortermalGreek · 5 comments

This project looks promising

https://github.com/yl4579/StyleTTS2

I wish there was a GUI or a tutorial for how to use it though.
(It was mainly built to read audio-books)

This project looks promising https://github.com/yl4579/StyleTTS2 I wish there was a GUI or a tutorial for how to use it though. (It was mainly built to read audio-books)

To be followed indeed...!

To be followed indeed...!

You can use the Inference_LibriTTS.ipynb file locally
Just install these

pip install notebook jupyter

Then open the Inference_LibriTTS.ipynb file

jupyter notebook Inference_LibriTTS.ipynb

You can use the Inference_LibriTTS.ipynb file locally Just install these >pip install notebook jupyter Then open the Inference_LibriTTS.ipynb file >jupyter notebook Inference_LibriTTS.ipynb
Author

@Bluebomber182 Hey, I've actually seen the videos you've made on YouTube,!

But, I don't think you can fine tune your own model using the Inference files, thankfully someone made a guide on how to finetune our models https://github.com/IIEleven11/StyleTTS2FineTune

So I'm kinda confused, how did you clone these voices on YouTube? did you fine tune your own model? did you just do the One-Shot voice clone? or did you just use the included voice in StyleTTS2 and used your model of RVC with it?

@Bluebomber182 Hey, I've actually seen the videos you've made on YouTube,! But, I don't think you can fine tune your own model using the Inference files, thankfully someone made a guide on how to finetune our models https://github.com/IIEleven11/StyleTTS2FineTune So I'm kinda confused, how did you clone these voices on YouTube? did you fine tune your own model? did you just do the One-Shot voice clone? or did you just use the included voice in StyleTTS2 and used your model of RVC with it?

@FortermalGreek
I finetune the voices by following these guides
https://github.com/yl4579/StyleTTS2/discussions/65#discussioncomment-7737796
https://gist.github.com/Shiro836/1c1881435ee75dc81069c505a14a9423
https://github.com/yl4579/StyleTTS2/discussions/81
I then put the styletts2 output files through rvc.

I also made a styletts2 tutorial. This doesn't include finetuning.
https://www.youtube.com/watch?v=2dg_xRnMYT4

@FortermalGreek I finetune the voices by following these guides https://github.com/yl4579/StyleTTS2/discussions/65#discussioncomment-7737796 https://gist.github.com/Shiro836/1c1881435ee75dc81069c505a14a9423 https://github.com/yl4579/StyleTTS2/discussions/81 I then put the styletts2 output files through rvc. I also made a styletts2 tutorial. This doesn't include finetuning. https://www.youtube.com/watch?v=2dg_xRnMYT4

I found an improved styletts2 base model for those interested in fine tuning.
https://huggingface.co/ShoukanLabs/Vokan

I found an improved styletts2 base model for those interested in fine tuning. https://huggingface.co/ShoukanLabs/Vokan
Sign in to join this conversation.
No Milestone
No project
No Assignees
3 Participants
Notifications
Due Date
The due date is invalid or out of range. Please use the format 'yyyy-mm-dd'.

No due date set.

Dependencies

No dependencies set.

Reference: mrq/ai-voice-cloning#454
No description provided.