vall-e/vall_e
2024-11-21 13:18:11 -06:00
..
emb
engines oops 2024-11-20 19:21:03 -06:00
ext
models added mixed modality AR+NAR-len to generate a short prefix through the AR, then inference with said prefix through the NAR-len (need to experiment with it more to ensure that the masked off tokens are the only tokens getting updated) 2024-11-20 14:22:12 -06:00
utils
__init__.py
__main__.py moved stuff in the web UI around (un-experimented the max NAR-len steps because its kind of important to adjust this value for better sounding audio / quicker generated audio) 2024-11-20 20:37:33 -06:00
config.py I cannot believe it's not actually called Wand DB (added wandb logging support since I think it would have been a much better way to look at my metrics) 2024-11-20 16:10:47 -06:00
data.py added more harvard sentences to load from a text file 2024-11-21 13:18:11 -06:00
demo.py moved stuff in the web UI around (un-experimented the max NAR-len steps because its kind of important to adjust this value for better sounding audio / quicker generated audio) 2024-11-20 20:37:33 -06:00
export.py
inference.py added more harvard sentences to load from a text file 2024-11-21 13:18:11 -06:00
plot.py
samplers.py
train.py
webui.py moved stuff in the web UI around (un-experimented the max NAR-len steps because its kind of important to adjust this value for better sounding audio / quicker generated audio) 2024-11-20 20:37:33 -06:00