psammites
  • Joined on 2023-03-11
psammites commented on issue mrq/ai-voice-cloning#152 2023-06-06 14:17:20 +00:00
VALL-E Integration (and In Response To TorToiSe: a Quick Retrospective)

There's a relatively new TTS called Balacoon, aimed at low end devices. I tried it out on my desktop and it was faster than RT.

How was the quality?

psammites commented on issue mrq/ai-voice-cloning#244 2023-05-28 17:47:17 +00:00
Step by step data prep and training/finetuning guide?

A large chunk of memory is already being used or set aside or what I'm not sure.

Enable Do Not Load TTS On Startup and restart.

psammites commented on issue mrq/ai-voice-cloning#249 2023-05-25 15:04:49 +00:00
Out of memory errors and using whisperX

As noted in issue #68 from the whisperx repo "with VAD" is now the default, so you could try changing utils.py to just call transcribe() and see if that works. I always prepare my datasets…

psammites commented on issue mrq/ai-voice-cloning#244 2023-05-25 14:57:12 +00:00
Step by step data prep and training/finetuning guide?

Hmm. The only thing that sticks out to me is that the audio is mono. I don't see any reason why that should be a problem but all my samples are stereo. Can you try and reproduce the fault with…

psammites commented on issue mrq/ai-voice-cloning#249 2023-05-25 04:34:22 +00:00
Out of memory errors and using whisperX

There's probably some way to tinker with your current install to get it working but I think the most efficient thing to do is wipe it (save your datasets, of course), reclone the repo, and run the…

psammites commented on issue mrq/ai-voice-cloning#249 2023-05-24 15:11:22 +00:00
Out of memory errors and using whisperX

Im still searching for how to include this authorization token the error asks for assuming it actually is the problem. There doesn't seem to be an obvious way.

It's [in the Wiki](https://git.e

psammites commented on issue mrq/ai-voice-cloning#152 2023-05-24 04:35:25 +00:00
VALL-E Integration (and In Response To TorToiSe: a Quick Retrospective)

Isn't that just for finetuning a wav2vec2 model?

That's what they are, as far as I can tell.

psammites commented on issue mrq/ai-voice-cloning#244 2023-05-24 04:00:09 +00:00
Step by step data prep and training/finetuning guide?

Can you run ffprobe on the clips and post the output?

psammites commented on issue mrq/ai-voice-cloning#249 2023-05-24 03:58:23 +00:00
Out of memory errors and using whisperX

After activating the venv does pip list installed show whisperx?

psammites commented on issue mrq/ai-voice-cloning#249 2023-05-23 22:07:44 +00:00
Out of memory errors and using whisperX

Try activating the venv in the directory you cloned the repo into and then git submodule update –remote

psammites commented on issue mrq/ai-voice-cloning#244 2023-05-23 16:59:27 +00:00
Step by step data prep and training/finetuning guide?

I adjusted the batch size down to 4 as suggested when validating the training file and even down to 2 with the gradient accumulation size at 1 but it still gives the same error. I shrank down my…

psammites commented on issue mrq/ai-voice-cloning#249 2023-05-23 16:57:14 +00:00
Out of memory errors and using whisperX

Can you run whisperx from the command line? (Not though importing it in a python session, just from the prompt.)

psammites commented on issue mrq/ai-voice-cloning#152 2023-05-23 16:55:16 +00:00
VALL-E Integration (and In Response To TorToiSe: a Quick Retrospective)

https://github.com/facebookresearch/fairseq/blob/main/examples/mms

facebook released some models, not sure how to use it tho

It's right there in the [TTS](https://github.com/facebookres

psammites commented on issue mrq/ai-voice-cloning#232 2023-05-09 13:50:46 +00:00
ModuleNotFoundError: No module named 'dlas' when trying to run Training

Try git submodule update --remote

psammites commented on issue mrq/ai-voice-cloning#232 2023-05-09 02:17:00 +00:00
ModuleNotFoundError: No module named 'dlas' when trying to run Training

Did you run the setup script?

psammites commented on issue mrq/ai-voice-cloning#230 2023-05-05 15:10:01 +00:00
tqdm.write() got an unexpected keyword argument 'desc'

Can you run whisperx from the command line?

psammites commented on issue mrq/ai-voice-cloning#227 2023-05-04 13:20:00 +00:00
Error when setting CVVP value.

Are you using a voice sample or random? If the former, what's the file size of the latents?

psammites commented on issue mrq/ai-voice-cloning#227 2023-05-03 16:02:26 +00:00
Error when setting CVVP value.

Do you Slimmer Computed Latents turned on? If so that'll stop CVVP from working.

psammites commented on issue mrq/ai-voice-cloning#221 2023-05-01 01:06:26 +00:00
Getting total gibberish when finetuning on a new language

I don't think the IPA tokenizer would be required for Dutch. What's your loss graph look like?

psammites commented on issue mrq/ai-voice-cloning#221 2023-04-30 11:33:47 +00:00
Getting total gibberish when finetuning on a new language

How long have you trained the model for?