vall-e/vall_e
2025-03-08 17:10:50 -06:00
..
emb could have sworn this worked before, might have broke it when i decoupled from omegaconf 2025-03-01 19:30:26 -06:00
engines one more time one more time (this normalization isn't a spook) 2025-03-07 19:32:42 -06:00
models another optimization (within the dataloader because the similar utterance sampler was mondo slow) 2025-03-08 17:10:50 -06:00
utils stuff for interfacing with the loss scaler value (because I want to cap it) 2025-03-06 17:07:29 -06:00
__init__.py Rewrite init 2023-08-02 21:53:35 +00:00
__main__.py added option to playback audio directly, removed no-phonemize option since I swear it worked in testing but it doesn't actually work 2025-01-12 21:52:49 -06:00
config.py oops 2025-03-07 18:57:25 -06:00
data.py another optimization (within the dataloader because the similar utterance sampler was mondo slow) 2025-03-08 17:10:50 -06:00
demo.py ugh 2025-02-28 01:06:38 -06:00
export.py 2024-12-26 21:42:17 -06:00
inference.py segregated experimental changes into its own streamlined file to avoid breaking the existing model, and it can pivot to the cleaned up code if it actually works (nothing is working) 2025-02-26 21:26:13 -06:00
metrics.py instead just compute a bunch of stuff on the transcriptions to store later in different names so I can just retrieve what I want, also added tongue twisters for nefarious reasons 2024-12-18 23:43:11 -06:00
plot.py very, very naive layerskip speculative sampling (it just checks if the current layer's state is good enough) 2024-11-02 11:49:05 -05:00
samplers.py agony 2025-02-12 00:18:24 -06:00
train.py ugh 2025-03-06 17:19:27 -06:00
webui.py added option to playback audio directly, removed no-phonemize option since I swear it worked in testing but it doesn't actually work 2025-01-12 21:52:49 -06:00