psammites

psammites commented on issue mrq/ai-voice-cloning#187

2023-04-02 03:37:24 +00:00

Recommendations for creating tokenizer.json for a specific language

Are you trying to create a tokenizer that handles devanagari or are you romanizing words is kadar?

psammites commented on issue mrq/ai-voice-cloning#152

2023-04-02 00:12:04 +00:00

VALL-E Integration (and In Response To TorToiSe: a Quick Retrospective)

Hey @mrq have you seen this one?

https://github.com/svc-develop-team/so-vits-svc

psammites commented on issue mrq/ai-voice-cloning#152

2023-04-01 15:30:19 +00:00

VALL-E Integration (and In Response To TorToiSe: a Quick Retrospective)

This process would be a lot of work for one person. Unless there is a superior method.

Almost all of what you've proposed above can be done automatically. whisperx produces millisecond-granu…

psammites commented on issue mrq/ai-voice-cloning#166

2023-03-31 16:51:32 +00:00

utf-8 codec can't decod ebyte 0x81 in position 2

If you check the training/\<voice name\>/finetune/models/ directory is there a 800_gpt.pth file there? Can you use it to generate samples?

psammites commented on issue mrq/ai-voice-cloning#166

2023-03-31 14:53:50 +00:00

utf-8 codec can't decod ebyte 0x81 in position 2

Does it happen with other python packages?

psammites commented on issue mrq/ai-voice-cloning#184

2023-03-31 14:52:29 +00:00

Unable to use Whispercpp and Whisperx on colab

I've never used colab notebook; you might want to check out the issue tracker on whisperx's repo and see if someone over there knows.

psammites commented on issue mrq/ai-voice-cloning#184

2023-03-30 08:10:53 +00:00

Unable to use Whispercpp and Whisperx on colab

You can use --model_dir to point it at the location where your model files are stored.

psammites commented on issue mrq/ai-voice-cloning#184

2023-03-29 16:54:36 +00:00

Unable to use Whispercpp and Whisperx on colab

I've never been able to get whispercpp working, so no idea there. For whisperx I would try installing it separately using the instructions in the repo: https://github.com/m-bain/whisperX

psammites commented on issue mrq/ai-voice-cloning#183

2023-03-28 22:20:39 +00:00

generating voice clip is so much slower compared to using original Tortoise TTS

Hmm. I see the first time it took extremely long because it had to generate the latents for that voice and model, but that doesn't explain why it took so long the second time. Try changing "Sample…

psammites commented on issue mrq/ai-voice-cloning#183

2023-03-28 22:02:27 +00:00

generating voice clip is so much slower compared to using original Tortoise TTS

Post your console log.

psammites commented on issue mrq/ai-voice-cloning#152

2023-03-28 18:02:52 +00:00

VALL-E Integration (and In Response To TorToiSe: a Quick Retrospective)

I would need to do wav2vec2 alignment on the transcription text, and do my own segmenting to pair down the audio. I suppose it's simple, as I can maybe subjugate whisperX to do just that, but I…

psammites commented on issue mrq/ai-voice-cloning#152

2023-03-28 14:52:34 +00:00

VALL-E Integration (and In Response To TorToiSe: a Quick Retrospective)

Forgive me for butting in, but howcome you haven't worked on building a more varied dataset then? There's hundreds of hours of video game dialogue & podcasts available for you to build a more…

psammites commented on issue mrq/ai-voice-cloning#179

2023-03-27 12:22:25 +00:00

Can validate a training configuration, but can't save it.

That's the right one. You can try git restore models/.template.dlas.yaml but it's odd that it isn't there already. Have you run update.bat?

psammites commented on issue mrq/ai-voice-cloning#179

2023-03-27 00:02:38 +00:00

Can validate a training configuration, but can't save it.

Did you run the setup script?

psammites commented on issue mrq/ai-voice-cloning#178

2023-03-26 20:54:22 +00:00

Would love a hand-holding tutorial on preparing a dataset...

See #177

psammites commented on issue mrq/ai-voice-cloning#174

2023-03-26 18:26:04 +00:00

Fresh install and I get this error when recording into the microphone

I don't have a mic so I can't check but you might need to adjust your microphone settings so that the recording sample rate is 22050.

psammites commented on issue mrq/ai-voice-cloning#175

2023-03-26 18:24:06 +00:00

ModuleNotFoundError: No module named 'psutil'

Collecting numba
  Using cached numba-0.56.4.tar.gz (2.4 MB)
  Preparing metadata (setup.py) ... error
  error: subprocess-exited-with-error

  × python setup.py egg_info…

psammites commented on issue mrq/ai-voice-cloning#177

2023-03-26 16:07:44 +00:00

whisper.json

how to do this

psammites commented on issue mrq/ai-voice-cloning#177

2023-03-26 15:25:17 +00:00

whisper.json

Are those .wav files valid? You can use ffprobe to check.

psammites commented on issue mrq/ai-voice-cloning#175

2023-03-26 15:21:56 +00:00

ModuleNotFoundError: No module named 'psutil'

setup-cuda.bat

Run it again and post the log, please.