This website requires JavaScript.
Explore
Help
Register
Sign In
mrq
/
vall-e
Watch
5
Star
9
Fork
0
You've already forked vall-e
Code
Issues
8
Pull Requests
Packages
Projects
Releases
Wiki
Activity
a174c33db6
vall-e
/
vall_e
/
engines
History
mrq
ceecac6ffe
I think I made resp_parallel_training=True faster with loss factoring?
2025-02-26 23:13:32 -06:00
..
__init__.py
lol
2025-02-26 10:49:06 -06:00
base.py
I think I made resp_parallel_training=True faster with loss factoring?
2025-02-26 23:13:32 -06:00
deepspeed.py
added muon optimizer through kludge hacks because it necessitates a second optimizer in tandum that seems to only sometimes work with deepspeed
2025-02-23 11:22:13 -06:00