vall-e/vall_e
2024-08-03 09:01:37 -05:00
..
emb
engines tweaks for the NAR-len model, maybe 2024-08-03 08:40:39 -05:00
ext
models tweaked initial NAR pass's initial token embeddings to use a different value, or osmething 2024-08-03 09:01:37 -05:00
utils oversight with using resize_modules 2024-08-02 20:28:49 -05:00
__init__.py
__main__.py
config.py it actually wasn't working because Engines.__init__() automatically moves the entire module to the requested device, which was being called after offloading the model in the test trainer (and it seems I cant do it without injecting a bunch of shit in modeling_llama.py) 2024-08-01 20:56:28 -05:00
data.py fixes, throw an exception when using NAR only model with non-unified position IDs, since for some reason it outputs garbage for the NAR 2024-08-02 22:25:49 -05:00
demo.py
export.py fix weird regression in handling checkpoints when backend is local, but deepspeed checkpoints are in (it was handled with LoRA loading but not real loading...) 2024-07-30 22:15:56 -05:00
inference.py fix weird regression in handling checkpoints when backend is local, but deepspeed checkpoints are in (it was handled with LoRA loading but not real loading...) 2024-07-30 22:15:56 -05:00
plot.py
samplers.py
train.py
webui.py