vall-e

History

mrq 69c1d2991f updated mixtral backend (need this for something else)		2025-01-20 21:50:56 -06:00
..
emb	actually do speaker verification	2024-12-17 10:11:14 -06:00
engines	more work (the wall is non-causal decoding......)	2024-12-22 20:11:31 -06:00
models	updated mixtral backend (need this for something else)	2025-01-20 21:50:56 -06:00
utils	agony	2024-12-21 22:52:10 -06:00
__init__.py
__main__.py	added option to playback audio directly, removed no-phonemize option since I swear it worked in testing but it doesn't actually work	2025-01-12 21:52:49 -06:00
config.py	experimental	2025-01-05 19:05:00 -06:00
data.py	oops	2025-01-05 23:53:17 -06:00
demo.py	sanity checks (and I realized that the model actually had langs set to 4 in the yaml for KO/ZH so................	2024-12-19 19:08:57 -06:00
export.py		2024-12-26 21:42:17 -06:00
inference.py	added option to playback audio directly, removed no-phonemize option since I swear it worked in testing but it doesn't actually work	2025-01-12 21:52:49 -06:00
metrics.py	instead just compute a bunch of stuff on the transcriptions to store later in different names so I can just retrieve what I want, also added tongue twisters for nefarious reasons	2024-12-18 23:43:11 -06:00
plot.py
samplers.py	sort batches to try and reduce number of padded tokens in batched inference (also commented out F5 samples getting added to the demo page because I would have to regenerate them)	2024-12-11 22:45:38 -06:00
train.py	oops	2025-01-05 23:53:17 -06:00
webui.py	added option to playback audio directly, removed no-phonemize option since I swear it worked in testing but it doesn't actually work	2025-01-12 21:52:49 -06:00