vall-e/vall_e
2024-11-19 10:30:05 -06:00
..
emb
engines
ext
models cleaned up classifier-free guidance logit processing (in order to try and cope with a bad nar-len model) 2024-11-19 10:30:05 -06:00
utils default set cfg strength to 3.0 since the reference model is updated 2024-11-17 10:23:40 -06:00
__init__.py
__main__.py
config.py normalize sampler index by batch size (if not using batched sampler), add option to cap out utterances for a speaker, some other things 2024-11-18 12:46:50 -06:00
data.py oops 2024-11-18 14:12:26 -06:00
demo.py set option to set training masking ratio (I don't think for tts a fixed masking ratio is beneficial since the magic of the AR+NAR is being able to still reference the prior sequence of tokens for predicting things) 2024-11-17 17:04:07 -06:00
export.py two weeks of agony concludes 2024-11-18 21:29:28 -06:00
inference.py two weeks of agony concludes 2024-11-18 21:29:28 -06:00
plot.py
samplers.py cleaned up classifier-free guidance logit processing (in order to try and cope with a bad nar-len model) 2024-11-19 10:30:05 -06:00
train.py default set cfg strength to 3.0 since the reference model is updated 2024-11-17 10:23:40 -06:00
webui.py default set cfg strength to 3.0 since the reference model is updated 2024-11-17 10:23:40 -06:00