This website requires JavaScript.
Explore
Help
Register
Sign In
mrq
/
vall-e
Watch
5
Star
9
Fork
0
You've already forked vall-e
Code
Issues
8
Pull Requests
Packages
Projects
Releases
Wiki
Activity
fe0f235335
vall-e
/
vall_e
/
models
History
mrq
fe0f235335
mechanism to store the model config inside the weights and load them, some other things to allow LoRA training on the RetNet (gradient checkpointing will gripe about inputs not having require_grad and nothing seems to remedy it)
2024-07-16 18:23:13 -05:00
..
arch
mamba2-hf using
vasqu/mamba2-torch
because it lets me use mamba2 without triton ops (training with my 4xV100s are not happy with mamba2 because of triton)
2024-06-14 19:42:17 -05:00
__init__.py
sanity cleanup: moved experimental features under its own thing
2024-06-30 10:37:33 -05:00
ar_nar.py
allow loading a different model within the web ui (apparently I did not have the web UI in the documentation)
2024-07-15 19:59:48 -05:00
base.py
mechanism to store the model config inside the weights and load them, some other things to allow LoRA training on the RetNet (gradient checkpointing will gripe about inputs not having require_grad and nothing seems to remedy it)
2024-07-16 18:23:13 -05:00
experimental.py
sanity cleanup
2024-07-04 15:58:08 -05:00
lora.py
mechanism to store the model config inside the weights and load them, some other things to allow LoRA training on the RetNet (gradient checkpointing will gripe about inputs not having require_grad and nothing seems to remedy it)
2024-07-16 18:23:13 -05:00
nar.py
allow loading a different model within the web ui (apparently I did not have the web UI in the documentation)
2024-07-15 19:59:48 -05:00