• https://git.ecker.tech/ aims to provide a place to share my efforts while maintaining true ownership of my code, as I do not trust GitHub.

    XMR: 4B9TQdkAkBFYrbj5ztvTx89e5LpucPeTSPzemCihdDi9EBnx7btn8RDNZTBz2zihWsjMnDkzn5As1LU6gLv3KQy8BLsZ8SG

  • Joined on 2022-10-10
mrq pushed to master at mrq/vall-e 2023-09-16 00:07:42 +00:00
22ffaf3a33 have loss for the NAR not-ignore the text prompt, I imagine this should help the NAR and explain why it's always had a bit of an issue with training
mrq commented on issue mrq/ai-voice-cloning#384 2023-09-15 21:03:34 +00:00
Why so many models? And about a thousand of other questions :)
  1. I understood that technically this is fine tuning a model. But which model are we fine tuning exactly?

With the default backend selected (Tortoise), you're finetuning Tortoise's…

mrq commented on issue mrq/ai-voice-cloning#386 2023-09-15 19:24:44 +00:00
Implementing XTTS by coqui?

Think I got a great understanding of how it functions. It'd be better to simply "subjugate" (for lack of a better term) the XTTS model

mrq commented on issue mrq/ai-voice-cloning#386 2023-09-15 13:42:34 +00:00
Implementing XTTS by coqui?

its very fast and the quality is pretty good also though the cloning isnt perfect ... but idk if you can figure out how to improve it or make it fine tunable..or use its inference to improve…

mrq commented on issue mrq/ai-voice-cloning#152 2023-09-15 13:20:18 +00:00
VALL-E Integration (and In Response To TorToiSe: a Quick Retrospective)

Don't expect any updates for a while.

Both the full sized model and double-deepend models are being retrained from scratch and not stitched and glued from existing ARs to the monolithic…

mrq commented on issue mrq/ai-voice-cloning#383 2023-09-14 22:07:58 +00:00
SSL: CERTIFICATE_VERIFY_FAILED

Possibly a brief issue with either HuggingFace, or something funny either with your Python install or your connection (for example, ProtonVPN under some settings will treat catbox.moe as "unsafe"…

mrq commented on issue mrq/ai-voice-cloning#384 2023-09-14 22:04:27 +00:00
Why so many models? And about a thousand of other questions :)

about 700 clips (is that too much? First question haha)

The more the better. If I remember right, I've been able to get some finetune results with ~50.

I have 101 output models in the…

mrq commented on issue mrq/ai-voice-cloning#382 2023-09-13 21:01:52 +00:00
Blocked while training attempt - name 'str2optimizer8bit_blockwise' is not defined"

According to https://github.com/TimDettmers/bitsandbytes/issues/62 you should have a bunch of messages from BitsAndBytes hinting at a misconfigured system.

The solution seems to be passing in…

mrq pushed to master at mrq/vall-e 2023-09-13 18:18:16 +00:00
4aef798135 added picking final candidate based on sum of score instead of first candidate (this changes nothing).
mrq commented on issue mrq/ai-voice-cloning#381 2023-09-13 17:38:41 +00:00
training model from scratch

I haven't done any tests involving training a TorToiSe model from scratch, as per the original repo's README:

These models were trained on…

mrq pushed to master at mrq/vall-e 2023-09-13 02:27:06 +00:00
23a5fdd645 implemented a naive beam search (I really should be taking a break)
mrq pushed to master at mrq/vall-e 2023-09-13 02:21:35 +00:00
ee389e1992 implemented a naive beam search (I really should be taking a break)
mrq pushed to master at mrq/vall-e 2023-09-12 21:03:41 +00:00
a6ae344e5b some comments
mrq pushed to master at mrq/vall-e 2023-09-12 20:53:51 +00:00
d07c63b9d8 unified more things with training the AR+NAR monolothic model
mrq commented on issue mrq/ai-voice-cloning#377 2023-09-12 14:23:28 +00:00
Attempting to run training -- libcudart.so not t found

[Training] [2023-09-11T23:20:33.324092] local_state[k] = v[grad_accum_step] [Training] [2023-09-11T23:20:33.324092] IndexError: list index out of range

Your batch size isn't evenly…

mrq pushed to master at mrq/vall-e 2023-09-11 19:12:47 +00:00
40ef34e1ca this embedding class definitely works, and migrating from the previous embedding weights seems to work.
mrq commented on issue mrq/ai-voice-cloning#152 2023-09-11 19:11:19 +00:00
VALL-E Integration (and In Response To TorToiSe: a Quick Retrospective)

@mrq have you tried https://github.com/Plachtaa/vallex-webui ? the author say they use https://github.com/lifeiteng/vall-e for training code

I gave it a cursory glance and I find it rather…

mrq pushed to master at mrq/vall-e 2023-09-11 01:32:25 +00:00
a1f250ffac set default max_levels for NAR to 0 and implicitly set it to max resps levels because the previous way was implicitly assuming all models were outputting at 1+7 RVQ bins.
mrq pushed to master at mrq/vall-e 2023-09-10 20:49:45 +00:00
671dca88ee throw error when no reference audio is provided in the web UI because someone keeps doing that in the HF space
mrq pushed to master at mrq/vall-e 2023-09-10 18:49:07 +00:00
ba71020318 added option to limit (or exceed) inferenced RVQ-bin levels through the NAR