vall-e/vall_e
2025-02-28 22:15:37 -06:00
..
emb borrowed muon since it might better work under deepspeed and not require cruft (even though it really does not like the masked-NAR, also make the masked-NAR faux-causal since it might better help out for cfg.model.version >= 7 2025-02-23 17:23:24 -06:00
engines lol 2025-02-28 22:15:37 -06:00
models the learning rate scheduler pill is a tough pill to swallow 2025-02-28 22:12:19 -06:00
utils the learning rate scheduler pill is a tough pill to swallow 2025-02-28 22:12:19 -06:00
__init__.py
__main__.py added option to playback audio directly, removed no-phonemize option since I swear it worked in testing but it doesn't actually work 2025-01-12 21:52:49 -06:00
config.py the learning rate scheduler pill is a tough pill to swallow 2025-02-28 22:12:19 -06:00
data.py fixes... 2025-02-28 18:53:07 -06:00
demo.py ugh 2025-02-28 01:06:38 -06:00
export.py 2024-12-26 21:42:17 -06:00
inference.py segregated experimental changes into its own streamlined file to avoid breaking the existing model, and it can pivot to the cleaned up code if it actually works (nothing is working) 2025-02-26 21:26:13 -06:00
metrics.py instead just compute a bunch of stuff on the transcriptions to store later in different names so I can just retrieve what I want, also added tongue twisters for nefarious reasons 2024-12-18 23:43:11 -06:00
plot.py
samplers.py agony 2025-02-12 00:18:24 -06:00
train.py require minimum of 1 second durations for training because of my slop code auto-transposing that I don't wanna fix right now 2025-02-26 22:00:33 -06:00
webui.py added option to playback audio directly, removed no-phonemize option since I swear it worked in testing but it doesn't actually work 2025-01-12 21:52:49 -06:00