vall-e

mrq 2fb2b732fc wow that was fast	2025-03-04 23:17:18 -06:00
..
emb	could have sworn this worked before, might have broke it when i decoupled from omegaconf	2025-03-01 19:30:26 -06:00
engines	a birdie tells me i should probably use a different optimizer (also preliminary support for native sparse attention but I don't know if I'll use it)	2025-03-04 14:53:02 -06:00
models	wow that was fast	2025-03-04 23:17:18 -06:00
utils	a birdie tells me i should probably use a different optimizer (also preliminary support for native sparse attention but I don't know if I'll use it)	2025-03-04 14:53:02 -06:00
__init__.py	Rewrite init	2023-08-02 21:53:35 +00:00
__main__.py	added option to playback audio directly, removed no-phonemize option since I swear it worked in testing but it doesn't actually work	2025-01-12 21:52:49 -06:00
config.py	could have sworn this worked before, might have broke it when i decoupled from omegaconf	2025-03-01 19:30:26 -06:00
data.py	tweaks	2025-03-02 22:36:25 -06:00
demo.py	ugh	2025-02-28 01:06:38 -06:00
export.py		2024-12-26 21:42:17 -06:00
inference.py	segregated experimental changes into its own streamlined file to avoid breaking the existing model, and it can pivot to the cleaned up code if it actually works (nothing is working)	2025-02-26 21:26:13 -06:00
metrics.py	instead just compute a bunch of stuff on the transcriptions to store later in different names so I can just retrieve what I want, also added tongue twisters for nefarious reasons	2024-12-18 23:43:11 -06:00
plot.py	very, very naive layerskip speculative sampling (it just checks if the current layer's state is good enough)	2024-11-02 11:49:05 -05:00
samplers.py	agony	2025-02-12 00:18:24 -06:00
train.py	require minimum of 1 second durations for training because of my slop code auto-transposing that I don't wanna fix right now	2025-02-26 22:00:33 -06:00
webui.py	added option to playback audio directly, removed no-phonemize option since I swear it worked in testing but it doesn't actually work	2025-01-12 21:52:49 -06:00