This website requires JavaScript.
Explore
Help
Register
Sign In
mrq
/
vall-e
Watch
5
Star
9
Fork
0
You've already forked vall-e
Code
Issues
8
Pull Requests
Packages
Projects
Releases
Wiki
Activity
7075c2a5f0
vall-e
/
vall_e
/
utils
History
mrq
f3c59c3e7e
cleaner replacement code (because I realized BitNet had an implementation for it too), added calculating gradient norm and performing gradient clipping in local trainer (non-deepspeed)
2024-03-01 20:18:43 -06:00
..
__init__.py
Rewrite init
2023-08-02 21:53:35 +00:00
distributed.py
logger broke for some reason, added flag to just tqdm.write instead, make cfg.bitsandbytes.bitnet==True yamls denoted since I'm sure they're not interoperable
2024-03-01 10:32:35 -06:00
sampler.py
added per-speaker samplers
2023-09-03 21:27:13 -05:00
trainer.py
logger broke for some reason, added flag to just tqdm.write instead, make cfg.bitsandbytes.bitnet==True yamls denoted since I'm sure they're not interoperable
2024-03-01 10:32:35 -06:00
utils.py
nasty hotfix for transformer's Mixtral throwing an error when batch sizes > 1
2024-01-26 19:41:12 -06:00
wrapper.py
cleaner replacement code (because I realized BitNet had an implementation for it too), added calculating gradient norm and performing gradient clipping in local trainer (non-deepspeed)
2024-03-01 20:18:43 -06:00