vall-e/vall_e
2024-05-11 16:47:19 -05:00
..
emb oops 2024-05-09 22:33:40 -05:00
engines remove redundant loss, tweak readme 2024-05-11 15:02:47 -05:00
ext backwards compat for old YAMLs with models, option to set flash attention 2 for Llama (and derivatives), included syncdoth/RetNets torchscale retnet for shits and grins, etc. 2024-04-16 10:02:31 -05:00
models ugh 2024-05-11 16:47:19 -05:00
utils ugh 2024-05-11 16:47:19 -05:00
__init__.py Rewrite init 2023-08-02 21:53:35 +00:00
__main__.py deprecate sole AR/NAR model by only keeping the AR+NAR (the beauty of no one using this is that I can break compat as much as I want), add tone token for when I classify my dataset with tone/emotion in the future, some other things 2024-04-15 19:54:32 -05:00
config.py sanitizing 2024-05-11 16:31:05 -05:00
data.py might just be better to explicitly define prompt duration ranges, especially under a "train small contexts then increase it" training paradigm 2024-05-11 09:50:54 -05:00
export.py cleanup, use deepspeed inferencing pathway if requested 2023-10-09 15:24:04 -05:00
inference.py added option to specify frames per second for the given audio representation (Encodec is 75Hz, DAC is 41Hz (at 24K sources)) 2024-05-04 12:05:41 -05:00
plot.py deprecate sole AR/NAR model by only keeping the AR+NAR (the beauty of no one using this is that I can break compat as much as I want), add tone token for when I classify my dataset with tone/emotion in the future, some other things 2024-04-15 19:54:32 -05:00
samplers.py separated samplers into its own file, don't bother copying the logits back to the GPU after sampling, it's not necessary 2023-10-11 12:25:31 -05:00
train.py simple DDP wrapper (for my NVlink test) 2024-05-04 11:48:26 -05:00
webui.py backwards compat for old YAMLs with models, option to set flash attention 2 for Llama (and derivatives), included syncdoth/RetNets torchscale retnet for shits and grins, etc. 2024-04-16 10:02:31 -05:00