• https://git.ecker.tech/ aims to provide a place to share my efforts while maintaining true ownership of my code, as I do not trust GitHub.

    XMR: 4B9TQdkAkBFYrbj5ztvTx89e5LpucPeTSPzemCihdDi9EBnx7btn8RDNZTBz2zihWsjMnDkzn5As1LU6gLv3KQy8BLsZ8SG

  • Joined on 2022-10-10
mrq pushed to master at mrq/vall-e 2023-09-07 23:18:49 +00:00
8837bc34d7 added option to specify parameters to freeze per-model in YAML (because I need to see about committing atrocities with convering an AR into an AR+NAR)
mrq commented on issue mrq/ai-voice-cloning#373 2023-09-07 22:20:50 +00:00
RuntimeError: CUDA error: the launch timed out and was terminated CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect.

is this because the input is too long should I split it up with the line Delimiter?

No stop tokens found in one of the generated voice clips. This typically means the spoken audio is too…

mrq pushed to master at mrq/vall-e 2023-09-07 22:11:01 +00:00
c47fc3274e added backwards compat flag
mrq pushed to master at mrq/vall-e 2023-09-07 22:07:33 +00:00
ab5134f385 tweaks and fixes
mrq pushed to master at mrq/vall-e 2023-09-07 21:47:14 +00:00
b2c2dec291 added homebrewed per-RVQ-bin embedding solutions
mrq pushed to master at mrq/vall-e 2023-09-07 14:13:12 +00:00
e7a67410d1 oops
mrq pushed to master at mrq/vall-e 2023-09-07 01:32:13 +00:00
712808494f added support for optional prodigy optimizer (https://github.com/konstmish/prodigy) although it consumes a lot more VRAM per parameter
mrq commented on issue mrq/ai-voice-cloning#152 2023-09-07 00:55:54 +00:00
VALL-E Integration (and In Response To TorToiSe: a Quick Retrospective)

I feel rather silly.

I imagine the lifeiteng/vall-e implementation had the right idea with having an (almost) single model that handles both AR and NAR…

mrq pushed to master at mrq/vall-e 2023-09-07 00:46:34 +00:00
7ce06432fd fixed the AR+NAR dual model, the resp_emb has to be split up (classifier might too)
mrq pushed to master at mrq/vall-e 2023-09-07 00:14:21 +00:00
0e15aa7e33 fixed the AR+NAR dual model, the resp_emb has to be split up (classifier might too)
mrq pushed to master at mrq/vall-e 2023-09-06 23:58:05 +00:00
100ca6b7d0 added option to use SGD optimizer through the YAML, added option to pass in additional optimizer parameters through the YAML, added experimental unified AR+NAR model (does not seem fruitful in testing)
mrq commented on issue mrq/ai-voice-cloning#152 2023-09-06 03:06:45 +00:00
VALL-E Integration (and In Response To TorToiSe: a Quick Retrospective)

mmm... I guess I'm due for a bit of an update.

  • 15% through my dataset's "epoch" (out of ~5640894 samples, I think I should have a better metric that should calculate how many iterations it will…
mrq pushed to master at mrq/vall-e 2023-09-05 20:37:09 +00:00
451726fdd5 added ability to disable activation checkpointing through the YAML (it is very VRAM intensive at double layer size)
mrq commented on issue mrq/ai-voice-cloning#370 2023-09-05 18:00:44 +00:00
No Module named Deepspeed

DeepSpeed isn't available on Windows natively without going through a nightmare of hoops to get it compiled and installed. The best bet is to have an environment under WSL2 but you're still under…

mrq pushed to master at mrq/ai-voice-cloning 2023-09-04 22:09:19 +00:00
5f80ee9b38 set use-deepspeed to false because it's not a dependency and installing it as a dependency under windows is a huge nightmare
mrq pushed to master at mrq/tortoise-tts 2023-09-04 22:07:41 +00:00
d7e6914fb8 Merge pull request 'main' (#47) from ken11o2/tortoise-tts:main into main
b7c7fd1c5f add arg use_deepspeed
2478dc255e update TextToSpeech
18adfaf785 add use_deepspeed to contructor and update method post_init_gpt2_config
ac97c17bf7 add use_deepspeed
Compare 5 commits »
mrq pushed to master at mrq/ai-voice-cloning 2023-09-04 22:04:06 +00:00
29c270d1cc master (#369)
mrq merged pull request mrq/ai-voice-cloning#369 2023-09-04 22:04:04 +00:00
master
mrq commented on pull request mrq/ai-voice-cloning#369 2023-09-04 20:04:18 +00:00
master

ai_voice_cloning_edited_to_use_deepspeed.zip

Resubmit the PR without this file, please.

mrq merged pull request mrq/tortoise-tts#47 2023-09-04 20:01:18 +00:00
main