deviandice
  • Joined on 2023-02-18
deviandice commented on issue mrq/ai-voice-cloning#152 2023-03-29 00:13:35 +00:00
VALL-E Integration (and In Response To TorToiSe: a Quick Retrospective)

And you can only really get good compute with Ada cards (4070Ti and up) or multiple Ampere cards

I mean if that's the case I'll have a second 3090 with NVLINK sometime next month, so maybe…

deviandice commented on issue mrq/ai-voice-cloning#152 2023-03-28 13:48:56 +00:00
VALL-E Integration (and In Response To TorToiSe: a Quick Retrospective)

For zero-shot inferencing applications, diversity (ick) is a HUGE factor in having a good model. There's only so much data to sample from when trying to mimic voices. I worry that when I finally…

deviandice commented on issue mrq/ai-voice-cloning#85 2023-03-07 16:59:22 +00:00
Long prompts and random voice

Each new line restarts the voice process. IMO, if you find a line you like, you should use that as your voice.

Otherwise you're asking a complete system rework.

deviandice commented on issue mrq/ai-voice-cloning#61 2023-03-07 14:24:00 +00:00
Memory Leak

Cuda can keep things cached, I have torch.cuda.empty_cache() added to get_device so it trigger's every time the TTS system is reloaded. Supposedly stuff can remained cached, which can cause weird…

deviandice opened issue mrq/ai-voice-cloning#84 2023-03-07 14:13:34 +00:00
20% Inference Speed increase for Large VRAM (3090+) GPUS
deviandice pushed to main at deviandice/tortoise-tts 2023-03-07 14:05:29 +00:00
e650800447 Update 'tortoise/utils/device.py'
deviandice created repository deviandice/tortoise-tts 2023-03-07 12:56:58 +00:00
deviandice created repository deviandice/ai-voice-cloning 2023-03-07 12:55:45 +00:00
deviandice commented on issue mrq/ai-voice-cloning#78 2023-03-07 11:23:53 +00:00
Implement BigVGAN Full Fat

There's a seperate config file for it. Here's the raw JSON.

Also, funny joke ;)

config.json
{
    "resblock": "1",
    "num_gpus": 0,
    "batch_size": 32,
    "learning_rate":…
deviandice opened issue mrq/ai-voice-cloning#78 2023-03-06 23:43:00 +00:00
Implement BigVGAN Full Fat
deviandice commented on issue mrq/ai-voice-cloning#74 2023-03-06 15:55:39 +00:00
Issues with einops on Windows

You can jury rig it a bit. It's to do with whisperX. If you're not using it just do the following in powershell. Had the same issue on linux and this worked for me.

./venv/scripts/activate…
deviandice commented on issue mrq/ai-voice-cloning#59 2023-03-05 03:03:46 +00:00
Set Whisper default to Base-EN

Yeah that sounds like a good middle ground. It's only the english model that get's this benefit anyway.

deviandice created repository deviandice/DL-Art-School 2023-03-04 17:56:14 +00:00
deviandice opened issue mrq/ai-voice-cloning#59 2023-03-04 01:27:34 +00:00
Set Whisper default to Base-EN
deviandice commented on issue mrq/ai-voice-cloning#52 2023-03-03 10:12:17 +00:00
Implement BigVGAN

Thanks for implementing this so quickly, and its pretty neato that it's having a noticeable effect.

deviandice opened issue mrq/ai-voice-cloning#52 2023-03-03 04:21:28 +00:00
BigVGAN
deviandice commented on issue mrq/ai-voice-cloning#3 2023-02-18 23:28:31 +00:00
FileNotFoundError immediately after starting training

I had this error on windows. I fixed it by dropping ffmpeg.exe into the root folder of the repo.