vall-e/vall_e
2025-03-11 20:33:09 -05:00
..
emb could have sworn this worked before, might have broke it when i decoupled from omegaconf 2025-03-01 19:30:26 -06:00
engines one more time one more time (this normalization isn't a spook) 2025-03-07 19:32:42 -06:00
models len prediction for new model (and remove logit normalization since it kills inferencing) 2025-03-11 20:33:09 -05:00
utils stuff for interfacing with the loss scaler value (because I want to cap it) 2025-03-06 17:07:29 -06:00
__init__.py
__main__.py added option to playback audio directly, removed no-phonemize option since I swear it worked in testing but it doesn't actually work 2025-01-12 21:52:49 -06:00
config.py len prediction for new model (and remove logit normalization since it kills inferencing) 2025-03-11 20:33:09 -05:00
data.py another optimization (within the dataloader because the similar utterance sampler was mondo slow) 2025-03-08 17:10:50 -06:00
demo.py ugh 2025-02-28 01:06:38 -06:00
export.py 2024-12-26 21:42:17 -06:00
inference.py len prediction for new model (and remove logit normalization since it kills inferencing) 2025-03-11 20:33:09 -05:00
metrics.py instead just compute a bunch of stuff on the transcriptions to store later in different names so I can just retrieve what I want, also added tongue twisters for nefarious reasons 2024-12-18 23:43:11 -06:00
plot.py
samplers.py agony 2025-02-12 00:18:24 -06:00
train.py ugh 2025-03-06 17:19:27 -06:00
webui.py added option to playback audio directly, removed no-phonemize option since I swear it worked in testing but it doesn't actually work 2025-01-12 21:52:49 -06:00