vall-e/vall_e/models
2024-05-04 13:13:52 -05:00
..
__init__.py deprecate sole AR/NAR model by only keeping the AR+NAR (the beauty of no one using this is that I can break compat as much as I want), add tone token for when I classify my dataset with tone/emotion in the future, some other things 2024-04-15 19:54:32 -05:00
adaln.py
ar_nar.py added option to specify frames per second for the given audio representation (Encodec is 75Hz, DAC is 41Hz (at 24K sources)) 2024-05-04 12:05:41 -05:00
base.py forgot to disable verbose flag 2024-05-04 13:13:52 -05:00
retnet_hf.py
retnet_ts.py backwards compat for old YAMLs with models, option to set flash attention 2 for Llama (and derivatives), included syncdoth/RetNets torchscale retnet for shits and grins, etc. 2024-04-16 10:02:31 -05:00
retnet.py
transformer.py