vall-e/vall_e
2025-02-23 11:36:32 -06:00
..
emb fixes fixes fixes (a quarter of my recently processed audio returned zero'd tensors......) 2025-02-22 09:07:33 -06:00
engines separate mask token and stop token because this might cause issues 2025-02-23 11:36:32 -06:00
models separate mask token and stop token because this might cause issues 2025-02-23 11:36:32 -06:00
utils separate mask token and stop token because this might cause issues 2025-02-23 11:36:32 -06:00
__init__.py
__main__.py added option to playback audio directly, removed no-phonemize option since I swear it worked in testing but it doesn't actually work 2025-01-12 21:52:49 -06:00
config.py fixes fixes fixes (a quarter of my recently processed audio returned zero'd tensors......) 2025-02-22 09:07:33 -06:00
data.py (finally) added parallel AR for cfg.model.version >= 7 (nvidia/audio-codec-44khz is being a pain and it might require training purely AR first......) 2025-02-23 08:31:03 -06:00
demo.py
export.py 2024-12-26 21:42:17 -06:00
inference.py added muon optimizer through kludge hacks because it necessitates a second optimizer in tandum that seems to only sometimes work with deepspeed 2025-02-23 11:22:13 -06:00
metrics.py
plot.py
samplers.py agony 2025-02-12 00:18:24 -06:00
train.py oops 2025-01-05 23:53:17 -06:00
webui.py added option to playback audio directly, removed no-phonemize option since I swear it worked in testing but it doesn't actually work 2025-01-12 21:52:49 -06:00