vall-e

mrq/vall-e

History

mrq 9e1989be1b tweaked initial NAR pass's initial token embeddings to use a different value, or osmething		2024-08-03 09:01:37 -05:00
..
emb
engines	tweaks for the NAR-len model, maybe	2024-08-03 08:40:39 -05:00
ext
models	tweaked initial NAR pass's initial token embeddings to use a different value, or osmething	2024-08-03 09:01:37 -05:00
utils	oversight with using resize_modules	2024-08-02 20:28:49 -05:00
__init__.py
__main__.py
config.py	it actually wasn't working because Engines.__init__() automatically moves the entire module to the requested device, which was being called after offloading the model in the test trainer (and it seems I cant do it without injecting a bunch of shit in modeling_llama.py)	2024-08-01 20:56:28 -05:00
data.py	fixes, throw an exception when using NAR only model with non-unified position IDs, since for some reason it outputs garbage for the NAR	2024-08-02 22:25:49 -05:00
demo.py
export.py	fix weird regression in handling checkpoints when backend is local, but deepspeed checkpoints are in (it was handled with LoRA loading but not real loading...)	2024-07-30 22:15:56 -05:00
inference.py	fix weird regression in handling checkpoints when backend is local, but deepspeed checkpoints are in (it was handled with LoRA loading but not real loading...)	2024-07-30 22:15:56 -05:00
plot.py
samplers.py
train.py
webui.py