vall-e/vall_e
2024-06-29 23:42:30 -05:00
..
emb added summing of external embeddings (at this point i dont think any amount of cope bandaids will get DAC to train nicely, I think the RVQ levels the NAR tends add too much noise if they're not accurate) 2024-06-29 23:42:30 -05:00
engines change from chunk-slicing paths for distributed dataloader to instead interleave 2024-06-29 10:10:35 -05:00
ext fixes 2024-06-04 00:07:00 -05:00
models added summing of external embeddings (at this point i dont think any amount of cope bandaids will get DAC to train nicely, I think the RVQ levels the NAR tends add too much noise if they're not accurate) 2024-06-29 23:42:30 -05:00
utils cleaned up subjugated audio embedding into a flag, flag can also have it include the original, underlying embedding as well (it seems to do better when set to inclusive) 2024-06-29 21:46:35 -05:00
__init__.py Rewrite init 2023-08-02 21:53:35 +00:00
__main__.py backport fix from tortoise_tts with local trainer + loading state when training lora 2024-06-25 13:41:29 -05:00
config.py cleaned up subjugated audio embedding into a flag, flag can also have it include the original, underlying embedding as well (it seems to do better when set to inclusive) 2024-06-29 21:46:35 -05:00
data.py ugh 2024-06-29 22:14:35 -05:00
export.py actually make deepspeed work with LoRAs 2024-06-17 13:55:37 -05:00
inference.py backport fix from tortoise_tts with local trainer + loading state when training lora 2024-06-25 13:41:29 -05:00
plot.py ugh 2024-06-09 11:39:43 -05:00
samplers.py ban stop token for NAR levels (because sometimes it gets sampled and causes problems) 2024-06-17 22:14:43 -05:00
train.py nasty bandaid because some of my DAC dataset only has 8 RVQ levels instead of the full 9 2024-06-29 10:16:37 -05:00
webui.py validated that inferencing works, changed some defaults (NAR benefits from greedy sampling) 2024-06-09 17:11:38 -05:00