psammites
  • Joined on 2023-03-11
psammites commented on issue mrq/ai-voice-cloning#174 2023-03-25 20:23:28 +00:00
Fresh install and I get this error when recording into the microphone

It's not an error, just a warning. Does it still work?

psammites commented on issue mrq/ai-voice-cloning#173 2023-03-25 20:22:23 +00:00
Recommendations for generating latents and finetunes?

What's the recommendation on number of epochs when training for a dataset of 200 vs something like 1000 clips (assuming they're all cut down between 1 and 11 seconds and transcribed properly)?

psammites commented on issue mrq/ai-voice-cloning#172 2023-03-25 03:06:06 +00:00
json.decoder.JSONDecodeError: Expecting value: line 1 column 1 (char 0)

I had actually assumed that, since I couldn't get the program to work for me, I could just skip this step by manually transcribing each .wav file in the whisper.json file manually; line by line.…

psammites commented on issue mrq/ai-voice-cloning#160 2023-03-24 22:29:00 +00:00
Can't train a single good model

After you've trained a model am I correct in saying that the voice chunks should be set to 0 when you're using that model?

IFUIC when set to 0 it'll attempt to calculate a size automatically…

psammites commented on issue mrq/ai-voice-cloning#171 2023-03-24 21:21:08 +00:00
Blank error when attempting to prepare dataset for training.

It makes sense, since there's no actual option to refresh the "dataset source" list, there was no way for me to select my voice from the list

"Refresh Voice List" on the Generate tab will…

psammites commented on issue mrq/ai-voice-cloning#171 2023-03-24 20:47:54 +00:00
Blank error when attempting to prepare dataset for training.

Hmm. What's in the directory for the voice you're attempting to prepare the dataset from? Are the files valid .wav's?

psammites commented on issue mrq/ai-voice-cloning#171 2023-03-24 20:34:18 +00:00
Blank error when attempting to prepare dataset for training.

What do you have set as your Whisper Backend?

psammites commented on issue mrq/ai-voice-cloning#170 2023-03-24 19:27:46 +00:00
Error involving zipfile upon attempting to resume training.

The .pth files are actually zips. See if you can open your 300_gpt.pth in 7z or similiar archive program. If it's corrupted you might be out of luck.

psammites commented on issue mrq/ai-voice-cloning#166 2023-03-24 16:30:59 +00:00
utf-8 codec can't decod ebyte 0x81 in position 2

Somehow I missed the [Training] [2023-03-23T01:40:04.035070] ModuleNotFoundError: No module named 'dlas' bit above. You might need to re-run the setup script. If that doesn't fix it I could try…

psammites commented on issue mrq/ai-voice-cloning#166 2023-03-24 03:51:26 +00:00
utf-8 codec can't decod ebyte 0x81 in position 2

If you have Notepad++ you can open up those two files, then go to Encoding>Convert to UTF8, save them and see if there's any difference.

psammites commented on issue mrq/ai-voice-cloning#169 2023-03-24 03:31:40 +00:00
Training starts, then immediately stops and reports as "finished".

Huh, that's weird. Here's a log for some training I did earlier today to compare with:

Spawning process:  ./train.sh ./training/HyeonSeo/train.yaml
[Training] [2023-03-23T13:12:10.328080]…
psammites commented on issue mrq/ai-voice-cloning#169 2023-03-24 03:08:31 +00:00
Training starts, then immediately stops and reports as "finished".

And the lr_scheduler one (Though it didn't show in this instance):

This this one occurs for me also, but AFAIK it's harmless, the only thing there that looks weird to me is:

[Training]…

psammites commented on issue mrq/ai-voice-cloning#160 2023-03-23 18:48:17 +00:00
Can't train a single good model

Use a small subset then.

With a small subset (8 clips of ~4 seconds each):

1 chunk: https://vocaroo.com/15lY8pR1WRhb 2 chunks: https://vocaroo.com/19R30vtl8gjn 4 chunks: https://vocaroo.c

psammites commented on issue mrq/ai-voice-cloning#160 2023-03-23 17:15:58 +00:00
Can't train a single good model

Too large. Start small and increase upwards.

Large data set, smaller values OOM.

psammites commented on issue mrq/ai-voice-cloning#160 2023-03-23 15:18:27 +00:00
Can't train a single good model

...because you need to click (Re)compute Voice Latents when you want to regenerate them.

<face palm emoji>

Anyway, with regenerating the latents between each:

512 chunks:…

psammites commented on issue mrq/ai-voice-cloning#160 2023-03-22 22:51:11 +00:00
Can't train a single good model

Regardless of semantics, the same principle I've preached applies: play around with it

Okay:

sneed@FMRLYCHKS:~/ai-voice-cloning/results/HyeonSeo$ ll
total 849648
drwxrwxrwx 1 sneed…
psammites opened issue mrq/ai-voice-cloning#167 2023-03-22 18:14:54 +00:00
"No such file or directory: 'train.sh'" after updating to f822c87344d2ce01bd077b48d9db42740c51ea04
psammites commented on issue mrq/ai-voice-cloning#160 2023-03-22 15:05:48 +00:00
Can't train a single good model

Which goes back to the main thing I keep telling you all: play around with the damn voice latent chunk size slider. The defaults will never, ever be a catch-all size.

On the Wiki you…

psammites commented on issue mrq/ai-voice-cloning#166 2023-03-22 07:32:14 +00:00
utf-8 codec can't decod ebyte 0x81 in position 2

Likely to be invalid UTF8 characters in your train.txt or validation.txt files. Are you training a language other than English?