This website requires JavaScript.
Explore
Help
Register
Sign In
mrq
/
vall-e
Watch
5
Star
9
Fork
0
You've already forked vall-e
Code
Issues
8
Pull Requests
Packages
Projects
Releases
Wiki
Activity
be83ddabaa
vall-e
/
vall_e
/
models
History
mrq
be83ddabaa
better causal-ness for split loss calc, and also do masking for NAR-len for it
2024-11-13 10:17:52 -06:00
..
arch
This better work
2024-11-09 18:04:59 -06:00
__init__.py
unified nar.py into ar_nar.py
2024-11-10 12:19:48 -06:00
ar_nar.py
do not pass timestep token/embedding since it doesn't seem to matter at all after all, fixed training masking rate to 80% because a paper said so
2024-11-13 09:07:10 -06:00
base.py
better causal-ness for split loss calc, and also do masking for NAR-len for it
2024-11-13 10:17:52 -06:00
experimental.py
moved prints to use logger, edited readme (fused_attn doesnt seem stable for training)
2024-08-29 13:27:16 -05:00
lora.py