psammites
  • Joined on 2023-03-11
psammites commented on issue mrq/ai-voice-cloning#285 2023-06-27 18:44:36 +00:00
Training with Deutsche Audio Files, speaks english. How to make Deutsche Models?

How many epochs did you train and what was the final loss_gpt_total?

psammites commented on issue mrq/ai-voice-cloning#284 2023-06-27 01:19:46 +00:00
Unable to complete training.

That's a pytorch error, if you google it you will find some suspected causes and possible workarounds.

psammites commented on issue mrq/ai-voice-cloning#282 2023-06-26 20:35:45 +00:00
Model sounds bad and nothing like the original

Depends on the batch size/step count, but in your case yes. Looking at the final line of your training log:

23-06-24 17:35:30.808 - INFO: Training Metrics: {"loss_text_ce": 4.19026517868042,…

psammites commented on issue mrq/ai-voice-cloning#282 2023-06-26 14:40:55 +00:00
Model sounds bad and nothing like the original

Hmm, I can't find anything wrong with the audio samples but 500 steps is probably too few for the number of samples you have. You might need to do around 3-4x more to get something that matches…

psammites commented on issue mrq/ai-voice-cloning#283 2023-06-26 04:14:30 +00:00
Large dataset finetuning

Taking a wild guess that the problem is trying to process UTF encoded bopomofo using the default tokenizer. You might need to look at the example in models/tokenizers/japanese.json and write…

psammites commented on issue mrq/ai-voice-cloning#279 2023-06-26 01:08:25 +00:00
Missing dataset: whisper.json.

Also, i have a question, the documentation says ffmpeg is needed for training, however, it never says where to put ffmpeg after download it.

I believe you need to install it via pip so it can…

psammites commented on issue mrq/ai-voice-cloning#279 2023-06-25 19:52:57 +00:00
Missing dataset: whisper.json.

"[WinError 2] The system cannot find the file specified" is the same error as before, so it looks like the problem is not related to the cloning software but more likely something to do with your…

psammites commented on issue mrq/ai-voice-cloning#282 2023-06-25 16:22:09 +00:00
Model sounds bad and nothing like the original

Please upload your training data for the model (excluding the finetune folder).

psammites commented on issue mrq/ai-voice-cloning#281 2023-06-25 16:19:37 +00:00
Attempting to restart training doesn't actually restart the training.

What did you set for Resume State Path when you tried to resume the training?

psammites commented on issue mrq/ai-voice-cloning#279 2023-06-25 16:17:34 +00:00
Missing dataset: whisper.json.

To install the normal whisper what i did was what the github repo said: pip install git+https://github.com/openai/whisper.git . That worked, in the screenshot you can see it says it loaded…

psammites commented on issue mrq/ai-voice-cloning#279 2023-06-25 01:58:55 +00:00
Missing dataset: whisper.json.

Hmm, it's a valid .wav file... it should be able to convert it from there. Can you try running whisperx on (or just whisper if that's what you have installed) and see if it throws an error?

psammites commented on issue mrq/ai-voice-cloning#279 2023-06-25 00:33:30 +00:00
Missing dataset: whisper.json.

Please run ffprobe on the file and post the results.

psammites commented on issue mrq/ai-voice-cloning#279 2023-06-24 16:48:31 +00:00
Missing dataset: whisper.json.

It's the second-to-last message that indicates the root of the problem. It can't find the file, so it can't transcribe anything, so the whisper.json never gets made.

psammites commented on issue mrq/ai-voice-cloning#279 2023-06-24 04:39:23 +00:00
Missing dataset: whisper.json.

Something may have gone wrong with the transcription, please post your console log.

psammites commented on issue mrq/ai-voice-cloning#274 2023-06-23 15:41:42 +00:00
Can't use whisperX.

The error is that large model isn't supported

According to the whisperx documentation large is aliased to large-v2, ex:

image

psammites commented on issue mrq/ai-voice-cloning#267 2023-06-23 05:16:01 +00:00
Are conditioning latents harder to generate for larger datasets?

Can you tell me how to generate the cond_latents files?

Click the (Re)Compute Voice Latents button.

psammites commented on issue mrq/ai-voice-cloning#278 2023-06-22 22:20:19 +00:00
I ran the update.bat and now it won't run (screen shot attached)

Activate the venv and then run pip install -r requirements.txt

If that doesn't fix it run the setup script again.

psammites commented on issue mrq/ai-voice-cloning#268 2023-06-22 22:18:21 +00:00
Install halts

Delete the repo, re-clone, then try again.

psammites commented on issue mrq/ai-voice-cloning#274 2023-06-21 20:27:59 +00:00
Can't use whisperX.

What's the error? Please post your console log.

psammites commented on issue mrq/ai-voice-cloning#273 2023-06-20 13:05:21 +00:00
Maybe dumb question; Could running multiple instances share the same models/resources?

Unless your time is worthless the amount of development (not just coding but testing and debugging) this would require would likely cost more than buying a second GPU.