vall-e/vall_e/models/arch
2024-11-23 09:45:23 -06:00
..
attention
__init__.py
bitnet.py
llama.py fixed training tqdm being stubborn 2024-11-23 09:45:23 -06:00
mamba.py temporarily dropping support for xformers because it's breaking when using an attention mask (which i dont remember commenting it out when being passed), default to not use wandb because it's being a pain when doing tests and not actual sessionsS) 2024-11-22 11:29:12 -06:00
mixtral.py
retnet.py
transformer.py