vall-e/vall_e/models
2024-11-13 10:17:52 -06:00
..
arch This better work 2024-11-09 18:04:59 -06:00
__init__.py unified nar.py into ar_nar.py 2024-11-10 12:19:48 -06:00
ar_nar.py do not pass timestep token/embedding since it doesn't seem to matter at all after all, fixed training masking rate to 80% because a paper said so 2024-11-13 09:07:10 -06:00
base.py better causal-ness for split loss calc, and also do masking for NAR-len for it 2024-11-13 10:17:52 -06:00
experimental.py moved prints to use logger, edited readme (fused_attn doesnt seem stable for training) 2024-08-29 13:27:16 -05:00
lora.py