This website requires JavaScript.
Explore
Help
Register
Sign In
mrq
/
vall-e
Watch
5
Star
9
Fork
0
You've already forked vall-e
Code
Issues
8
Pull Requests
Packages
Projects
Releases
Wiki
Activity
87db03dd93
vall-e
/
vall_e
/
models
History
mrq
87db03dd93
trim the input prompt to 3 seconds when training NAR tasks (marked as experimental; the paper mentions doing so, but I don't know how much this would harm the retention heads)
2023-10-09 22:03:58 -05:00
..
__init__.py
added option to use SGD optimizer through the YAML, added option to pass in additional optimizer parameters through the YAML, added experimental unified AR+NAR model (does not seem fruitful in testing)
2023-09-06 18:58:35 -05:00
adaln.py
Tweaks
2023-08-02 22:06:39 +00:00
ar_nar.py
trim the input prompt to 3 seconds when training NAR tasks (marked as experimental; the paper mentions doing so, but I don't know how much this would harm the retention heads)
2023-10-09 22:03:58 -05:00
ar.py
restructured some things with the model to remove dead weights
2023-09-20 19:10:59 -05:00
base.py
reduced dynamic temperature threshold to > 1.0, as it seems to not quite be useful for audio LMs, sped up any sampling that touches logits by copying them to CPU first, as accessing tensors on the GPU is slow as balls)
2023-10-09 14:46:17 -05:00
nar.py
trim the input prompt to 3 seconds when training NAR tasks (marked as experimental; the paper mentions doing so, but I don't know how much this would harm the retention heads)
2023-10-09 22:03:58 -05:00
retnet.py
restructured some things with the model to remove dead weights
2023-09-20 19:10:59 -05:00
transformer.py
added ability to disable activation checkpointing through the YAML (it is very VRAM intensive at double layer size)
2023-09-05 15:38:21 -05:00