vall-e

History

mrq 9090c34f10 cringe script to process seed-tts-eval's eval dataset into something i can easily use		2024-12-17 22:47:12 -06:00
..
emb	actually do speaker verification	2024-12-17 10:11:14 -06:00
engines	remove nan checks because it causes problems in distributed training because I'm not syncing between GPUs (and nan losses gets ignored anyways with loss scaling)	2024-12-15 09:42:54 -06:00
models	actually do speaker verification	2024-12-17 10:11:14 -06:00
utils	actually do speaker verification	2024-12-17 10:11:14 -06:00
__init__.py
__main__.py
config.py
data.py	tweaks to prompt duration to allow me to divorce how i use it for training with how I'm using it for the demo page, and demo page tweaks to make my life easier	2024-12-17 19:33:04 -06:00
demo.py	cringe script to process seed-tts-eval's eval dataset into something i can easily use	2024-12-17 22:47:12 -06:00
export.py
inference.py	tweaks to prompt duration to allow me to divorce how i use it for training with how I'm using it for the demo page, and demo page tweaks to make my life easier	2024-12-17 19:33:04 -06:00
metrics.py	actually do proper wer/cer calculation by un-normalizing the scores	2024-12-17 14:22:30 -06:00
plot.py
samplers.py
train.py	remove nan checks because it causes problems in distributed training because I'm not syncing between GPUs (and nan losses gets ignored anyways with loss scaling)	2024-12-15 09:42:54 -06:00
webui.py	really shoddy voice conversion implementation (it sort of works...)	2024-12-16 22:54:53 -06:00