vall-e/vall_e
2025-02-24 21:03:23 -06:00
..
emb borrowed muon since it might better work under deepspeed and not require cruft (even though it really does not like the masked-NAR, also make the masked-NAR faux-causal since it might better help out for cfg.model.version >= 7 2025-02-23 17:23:24 -06:00
engines ugh 2025-02-23 21:20:36 -06:00
models when the 2025-02-24 21:03:23 -06:00
utils fixed grad norm and loss scale not reporting for local trainer 2025-02-23 19:08:26 -06:00
__init__.py Rewrite init 2023-08-02 21:53:35 +00:00
__main__.py added option to playback audio directly, removed no-phonemize option since I swear it worked in testing but it doesn't actually work 2025-01-12 21:52:49 -06:00
config.py another experimental flag 2025-02-24 13:50:41 -06:00
data.py ugh 2025-02-23 12:31:07 -06:00
demo.py sanity checks (and I realized that the model actually had langs set to 4 in the yaml for KO/ZH so................ 2024-12-19 19:08:57 -06:00
export.py 2024-12-26 21:42:17 -06:00
inference.py added muon optimizer through kludge hacks because it necessitates a second optimizer in tandum that seems to only sometimes work with deepspeed 2025-02-23 11:22:13 -06:00
metrics.py instead just compute a bunch of stuff on the transcriptions to store later in different names so I can just retrieve what I want, also added tongue twisters for nefarious reasons 2024-12-18 23:43:11 -06:00
plot.py very, very naive layerskip speculative sampling (it just checks if the current layer's state is good enough) 2024-11-02 11:49:05 -05:00
samplers.py agony 2025-02-12 00:18:24 -06:00
train.py maybe fix NaNs being thrown for immature models at fp16 for training evals 2025-02-24 18:25:54 -06:00
webui.py added option to playback audio directly, removed no-phonemize option since I swear it worked in testing but it doesn't actually work 2025-01-12 21:52:49 -06:00