Looking through my install notes.... I also had to do this at one point:
pip uninstall whisper pip install openai-whisper
One more thing, I also have a few programs installed directly:
TorToiSe 2.4.5 D:\APPLICATIONS\ai-voice-cloning-2\ai-voice-cloning\modules\tortoise-tts dlas …
Here is my requirements.txt:
git+https://github.com/openai/whisper.git
more-itertools ffmpeg-python gradio<=3.23.0 music-tag voicefixer psutil phonemizer pydantic==1.10.11 websockets…
I suspect it is, yes. Is there a command that can see what versions of software are installed? I have a functional copy of the software on my computer (I haven't trained in awhile...). I could…
@Atoli Your .pth files should stay in the "models" folder.
For instance \ai-voice-cloning\training\white_female3a\finetune\models It is looking for .wav files in the voices folder. …
@DoctorPopi Re: are there background sounds or noises? Nope. I have found that running any samples through a musical vocal remover actually has helped clarify my audio samples. I already…
@DoctorPopi How does it sound though? I am not sure how VALL-E sounds - I have stuck with using Tortoise TTS engine. My datasets are about 30 to 40.
It seems that for as well as Tortoise…
@MrMustachio43 What I'm going to say may seem intuitive, because I don't know that it's explicitly written up in any of the documentation. You need to go to settings and load the finetuned…
@DoctorPopi I was using 20 short clips, and it seemed like one off clip ruined the whole voice. So back to the drawing board. I am wondering if I should have just pressed forward with more…
Reinstalled under a fresh environment. Not quite sure if the error was precisely related to not having even numbers? I'll never know, but had a clean install this time. Red flag should have…
I don't know German, but this sounds fantastic. I am wondering how whisper translation to German (not sure if that is one of the options) would turn out.
Question -- based on the Alice in…
This is probably another artifact of my unelegant install with base tortoise-tts environment. I am going to reinstall, now that I think I understand a bit more.
I set my voice chunks to 512 or 256 -- but I think the key is Temperature to 1 or very high -- as someone said about 0.75. I realized this when watching people use the tool via Youtube.
I…
Use a small subset then.
With a small subset (8 clips of ~4 seconds each):
1 chunk: https://vocaroo.com/15lY8pR1WRhb 2 chunks: https://vocaroo.com/19R30vtl8gjn 4 chunks:…