Implement Training #30
Loading…
Reference in New Issue
Block a user
No description provided.
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Can you please include all the existing voices so the voices folder is not empty and we can test everything works during the first run?
Also might be a good idea to collaborate with https://github.com/152334H/tortoise-tts-fast as end goal seems to be similar.
I don't know if the intention is to leave the voices folder empty, butfor now you can grab them from an older commit:811539b20a/tortoise/voices
Edit: it probably was intentional, as the include in MANIFEST.in is still there
I restored the
random
voice option, as that got forgotten from adding in a web UI in commit37d25573ac
. It would've been nice when I was testing the colab notebook instead of importing a latent every setup, but oh well. On the note of the colab:Right. I did it for a number of (in post, stupid) reasons that I don't think anyone wants to be bothered hearing the laundry list of "despite it not really mattering in the end" or some other cope, but in the end it helps with the consistency of userdata being outside of the
./tortoise/
folder (honestly, voices have no reason to be in that folder, but I wouldn't know better).To restore them, run:
git checkout 811539b20adfe6d85d2bc3e6728d55fd2427aae0 ./tortoise/voices/
git checkout 811539b20adfe6d85d2bc3e6728d55fd2427aae0 .\tortoise\voices\
Do not forget to move them out from the
./tortoise/
folder and into./voices/
, because of the nightmare of trying to maintain compatibility between people using either the old or the new spot (which I might as well just make it grab voices from either locations).!TODO!: fill in later, as I'm already spending too much time into this section, so I want the rest of the comment submitted first.Actually, I'll keep the unabridged version to myself, much less, any of the abridged version, as they're rather blunt and aren't necessarily within the scope of this repo.Damn, I feel like a foolish fool, as I removed them (initially) solely because they kept getting copied during
setup.py install
.Understood. Btw, they've seemed to have figured out the training part, please take a look and see if this can be added to this repo, with everything accessible via gradio.
Christ, that was quick. Props to that mad lad for getting it working. Conveniently just in time too, as I felt I was starting to run out of ways to boost performance/quality to make up for the lack of being able to fine-tune/retrain.
I suppose I'll list out my thoughts around it (bulleted, because my autism likes bulleting things):
kv_cache
ingtrain.py
nicely to train ittts.tts_with_preset
from the original tortoiseAs for what's on my plate (my own sort of to-do, I suppose to put it in writing):
Training
tabHowever, I make no promises on my to-do.
Existing voices and Random voiceto Implement TrainingI should have everything working for training under the new, cleaned up repo. It took a lot of headaches from how many oddball fixes were needed (thinking about them right now is making my right eye twitch), but it easily handles everything from:
Defer TTS Load
setting and restarting the web UI)And all of this will reuse the existing tortoise-tts files, and the
dvae.pth
gets easily downloaded and stored alongside the other models. Literally zero configuration outside of providing your training material and the parameters.It's not exactly what I tried to sought for: