This website requires JavaScript.
Explore
Help
Register
Sign In
mrq
/
vall-e
Watch
5
Star
9
Fork
0
You've already forked vall-e
Code
Issues
8
Pull Requests
Packages
Projects
Releases
Wiki
Activity
cce929e136
vall-e
/
vall_e
/
utils
History
mrq
cce929e136
nasty hotfix for transformer's Mixtral throwing an error when batch sizes > 1
2024-01-26 19:41:12 -06:00
..
__init__.py
distributed.py
added torchscale XMOE integration (because Mixtral 8x7B seems very promising and I want to see if it works)
2023-12-20 18:45:58 -06:00
sampler.py
trainer.py
nasty hotfix for transformer's Mixtral throwing an error when batch sizes > 1
2024-01-26 19:41:12 -06:00
utils.py
nasty hotfix for transformer's Mixtral throwing an error when batch sizes > 1
2024-01-26 19:41:12 -06:00
wrapper.py
tweaks to try and get deepspeed quantized inferencing, validating bitsandbytes and deepspeed quantization, nothing seems to work
2023-10-12 22:21:43 -05:00