This website requires JavaScript.
Explore
Help
Register
Sign In
mrq
/
vall-e
Watch
5
Star
9
Fork
0
You've already forked vall-e
Code
Issues
8
Pull Requests
Packages
Projects
Releases
Wiki
Activity
39f961abcd
vall-e
/
vall_e
/
engines
History
mrq
fe0f235335
mechanism to store the model config inside the weights and load them, some other things to allow LoRA training on the RetNet (gradient checkpointing will gripe about inputs not having require_grad and nothing seems to remedy it)
2024-07-16 18:23:13 -05:00
..
__init__.py
mechanism to store the model config inside the weights and load them, some other things to allow LoRA training on the RetNet (gradient checkpointing will gripe about inputs not having require_grad and nothing seems to remedy it)
2024-07-16 18:23:13 -05:00
base.py
mechanism to store the model config inside the weights and load them, some other things to allow LoRA training on the RetNet (gradient checkpointing will gripe about inputs not having require_grad and nothing seems to remedy it)
2024-07-16 18:23:13 -05:00
deepspeed.py
local training backend should be a bit more aware of variable batch sizes, maybe
2024-06-28 22:39:05 -05:00