psammites

psammites commented on issue mrq/ai-voice-cloning#152

2023-06-06 14:17:20 +00:00

VALL-E Integration (and In Response To TorToiSe: a Quick Retrospective)

There's a relatively new TTS called Balacoon, aimed at low end devices. I tried it out on my desktop and it was faster than RT.

How was the quality?

psammites commented on issue mrq/ai-voice-cloning#244

2023-05-28 17:47:17 +00:00

Step by step data prep and training/finetuning guide?

A large chunk of memory is already being used or set aside or what I'm not sure.

Enable Do Not Load TTS On Startup and restart.

psammites commented on issue mrq/ai-voice-cloning#249

2023-05-25 15:04:49 +00:00

Out of memory errors and using whisperX

As noted in issue #68 from the whisperx repo "with VAD" is now the default, so you could try changing utils.py to just call transcribe() and see if that works. I always prepare my datasets…

psammites commented on issue mrq/ai-voice-cloning#244

2023-05-25 14:57:12 +00:00

Step by step data prep and training/finetuning guide?

Hmm. The only thing that sticks out to me is that the audio is mono. I don't see any reason why that should be a problem but all my samples are stereo. Can you try and reproduce the fault with…

psammites commented on issue mrq/ai-voice-cloning#249

2023-05-25 04:34:22 +00:00

Out of memory errors and using whisperX

There's probably some way to tinker with your current install to get it working but I think the most efficient thing to do is wipe it (save your datasets, of course), reclone the repo, and run the…

psammites commented on issue mrq/ai-voice-cloning#249

2023-05-24 15:11:22 +00:00

Out of memory errors and using whisperX

Im still searching for how to include this authorization token the error asks for assuming it actually is the problem. There doesn't seem to be an obvious way.

It's [in the Wiki](https://git.e…

psammites commented on issue mrq/ai-voice-cloning#152

2023-05-24 04:35:25 +00:00

VALL-E Integration (and In Response To TorToiSe: a Quick Retrospective)

Isn't that just for finetuning a wav2vec2 model?

That's what they are, as far as I can tell.

psammites commented on issue mrq/ai-voice-cloning#244

2023-05-24 04:00:09 +00:00

Step by step data prep and training/finetuning guide?

Can you run ffprobe on the clips and post the output?

psammites commented on issue mrq/ai-voice-cloning#249

2023-05-24 03:58:23 +00:00

Out of memory errors and using whisperX

After activating the venv does pip list installed show whisperx?

psammites commented on issue mrq/ai-voice-cloning#249

2023-05-23 22:07:44 +00:00

Out of memory errors and using whisperX

Try activating the venv in the directory you cloned the repo into and then git submodule update –remote

psammites commented on issue mrq/ai-voice-cloning#244

2023-05-23 16:59:27 +00:00

Step by step data prep and training/finetuning guide?

I adjusted the batch size down to 4 as suggested when validating the training file and even down to 2 with the gradient accumulation size at 1 but it still gives the same error. I shrank down my…

psammites commented on issue mrq/ai-voice-cloning#249

2023-05-23 16:57:14 +00:00

Out of memory errors and using whisperX

Can you run whisperx from the command line? (Not though importing it in a python session, just from the prompt.)

psammites commented on issue mrq/ai-voice-cloning#152

2023-05-23 16:55:16 +00:00

VALL-E Integration (and In Response To TorToiSe: a Quick Retrospective)