vall-e/vall_e
2024-12-27 18:16:57 -06:00
..
emb actually do speaker verification 2024-12-17 10:11:14 -06:00
engines more work (the wall is non-causal decoding......) 2024-12-22 20:11:31 -06:00
models 2024-12-26 21:42:17 -06:00
utils agony 2024-12-21 22:52:10 -06:00
__init__.py
__main__.py doc update, added automatically deducing language from a given text, also checks if the input is already phonemized text to allow direct control without being cringe (procrastinating adding WER/SIM-O) 2024-12-07 22:34:25 -06:00
config.py ugh 2024-12-20 17:13:37 -06:00
data.py 2024-12-26 21:42:17 -06:00
demo.py sanity checks (and I realized that the model actually had langs set to 4 in the yaml for KO/ZH so................ 2024-12-19 19:08:57 -06:00
export.py 2024-12-26 21:42:17 -06:00
inference.py when you do more training thinking the original model that can do NS/SR got deleted but it was actually a string not having its quotes in the right place....... 2024-12-27 18:16:57 -06:00
metrics.py instead just compute a bunch of stuff on the transcriptions to store later in different names so I can just retrieve what I want, also added tongue twisters for nefarious reasons 2024-12-18 23:43:11 -06:00
plot.py
samplers.py sort batches to try and reduce number of padded tokens in batched inference (also commented out F5 samples getting added to the demo page because I would have to regenerate them) 2024-12-11 22:45:38 -06:00
train.py remove nan checks because it causes problems in distributed training because I'm not syncing between GPUs (and nan losses gets ignored anyways with loss scaling) 2024-12-15 09:42:54 -06:00
webui.py 2024-12-26 21:42:17 -06:00