This website requires JavaScript.
Explore
Help
Register
Sign In
mrq
/
vall-e
Watch
5
Star
9
Fork
0
You've already forked vall-e
Code
Issues
8
Pull Requests
Packages
Projects
Releases
Wiki
Activity
b2907ae7e0
vall-e
/
vall_e
/
models
History
mrq
b2907ae7e0
seems that my PromEmbedding/RespEmbedding doesn't actually work all that well, naively using dedicated MultiEmbeddings for AR/NAR in the monolithic model is the best way to go
2023-09-08 01:03:24 -05:00
..
__init__.py
added option to use SGD optimizer through the YAML, added option to pass in additional optimizer parameters through the YAML, added experimental unified AR+NAR model (does not seem fruitful in testing)
2023-09-06 18:58:35 -05:00
adaln.py
ar_nar.py
seems that my PromEmbedding/RespEmbedding doesn't actually work all that well, naively using dedicated MultiEmbeddings for AR/NAR in the monolithic model is the best way to go
2023-09-08 01:03:24 -05:00
ar.py
tweaks and fixes
2023-09-07 17:08:38 -05:00
base.py
seems that my PromEmbedding/RespEmbedding doesn't actually work all that well, naively using dedicated MultiEmbeddings for AR/NAR in the monolithic model is the best way to go
2023-09-08 01:03:24 -05:00
nar.py
tweaks and fixes
2023-09-07 17:08:38 -05:00
retnet.py
somewhat got recurrent forward working (it's as accurate as chunkwise forward: it's not accurate at all), added option to use AMP instead of blanket setting the weight's dtype
2023-09-01 20:58:29 -05:00
transformer.py
added ability to disable activation checkpointing through the YAML (it is very VRAM intensive at double layer size)
2023-09-05 15:38:21 -05:00