vall-e

mrq 59f56ad099 cleaup	2024-12-24 23:14:32 -06:00
..
emb	actually do speaker verification	2024-12-17 10:11:14 -06:00
engines	more work (the wall is non-causal decoding......)	2024-12-22 20:11:31 -06:00
models	cleaup	2024-12-24 23:14:32 -06:00
utils	agony	2024-12-21 22:52:10 -06:00
__init__.py	Rewrite init	2023-08-02 21:53:35 +00:00
__main__.py	doc update, added automatically deducing language from a given text, also checks if the input is already phonemized text to allow direct control without being cringe (procrastinating adding WER/SIM-O)	2024-12-07 22:34:25 -06:00
config.py	ugh	2024-12-20 17:13:37 -06:00
data.py	corrected export.py's --hf	2024-12-20 15:17:13 -06:00
demo.py	sanity checks (and I realized that the model actually had langs set to 4 in the yaml for KO/ZH so................	2024-12-19 19:08:57 -06:00
export.py	ugh	2024-12-22 16:15:24 -06:00
inference.py	added extremely barebones vall_e.cpp so I can stop having to juggle this file around so much	2024-12-21 10:57:02 -06:00
metrics.py	instead just compute a bunch of stuff on the transcriptions to store later in different names so I can just retrieve what I want, also added tongue twisters for nefarious reasons	2024-12-18 23:43:11 -06:00
plot.py	very, very naive layerskip speculative sampling (it just checks if the current layer's state is good enough)	2024-11-02 11:49:05 -05:00
samplers.py	sort batches to try and reduce number of padded tokens in batched inference (also commented out F5 samples getting added to the demo page because I would have to regenerate them)	2024-12-11 22:45:38 -06:00
train.py	remove nan checks because it causes problems in distributed training because I'm not syncing between GPUs (and nan losses gets ignored anyways with loss scaling)	2024-12-15 09:42:54 -06:00
webui.py	exposed additional task (ns, sr, vc) (vc is experimental)	2024-12-20 11:15:29 -06:00