This website requires JavaScript.
Explore
Help
Register
Sign In
mrq
/
vall-e
Watch
5
Star
9
Fork
0
You've already forked vall-e
Code
Issues
8
Pull Requests
Packages
Projects
Releases
Wiki
Activity
a6bfe43590
vall-e
/
vall_e
/
models
History
mrq
a6bfe43590
added mirostat sampling (given a partially trained model, it got far decent output than I expected, need to test on a better trained model)
2023-09-18 18:55:41 -05:00
..
__init__.py
added option to use SGD optimizer through the YAML, added option to pass in additional optimizer parameters through the YAML, added experimental unified AR+NAR model (does not seem fruitful in testing)
2023-09-06 18:58:35 -05:00
adaln.py
ar_nar.py
added mirostat sampling (given a partially trained model, it got far decent output than I expected, need to test on a better trained model)
2023-09-18 18:55:41 -05:00
ar.py
added mirostat sampling (given a partially trained model, it got far decent output than I expected, need to test on a better trained model)
2023-09-18 18:55:41 -05:00
base.py
added mirostat sampling (given a partially trained model, it got far decent output than I expected, need to test on a better trained model)
2023-09-18 18:55:41 -05:00
nar.py
added mirostat sampling (given a partially trained model, it got far decent output than I expected, need to test on a better trained model)
2023-09-18 18:55:41 -05:00
retnet.py
somewhat got recurrent forward working (it's as accurate as chunkwise forward: it's not accurate at all), added option to use AMP instead of blanket setting the weight's dtype
2023-09-01 20:58:29 -05:00
transformer.py
added ability to disable activation checkpointing through the YAML (it is very VRAM intensive at double layer size)
2023-09-05 15:38:21 -05:00