This website requires JavaScript.
Explore
Help
Register
Sign In
mrq
/
vall-e
Watch
5
Star
9
Fork
0
You've already forked vall-e
Code
Issues
8
Pull Requests
Packages
Projects
Releases
Wiki
Activity
5e9d1a5302
vall-e
/
vall_e
/
utils
History
mrq
2dd80a03ff
stuff for interfacing with the loss scaler value (because I want to cap it)
2025-03-06 17:07:29 -06:00
..
ext
made muon actually work by actually utilizing param groups (thanks APOLLO for reminding me this is the sane way to handle this split)
2025-02-26 10:39:13 -06:00
__init__.py
tweaks
2025-03-02 22:36:25 -06:00
distributed.py
io.py
agony
2024-12-21 22:52:10 -06:00
ml.py
a birdie tells me i should probably use a different optimizer (also preliminary support for native sparse attention but I don't know if I'll use it)
2025-03-04 14:53:02 -06:00
pattern.py
sampler.py
tweaks to bucket sampling
2024-11-13 11:09:24 -06:00
trainer.py
stuff for interfacing with the loss scaler value (because I want to cap it)
2025-03-06 17:07:29 -06:00
utils.py
tweaks
2025-03-02 22:36:25 -06:00