vall-e/vall_e/models
2024-08-06 20:42:39 -05:00
..
arch do not include SDPA attention if there's no available SDPA backends 2024-08-06 20:42:39 -05:00
__init__.py
ar_nar.py add adapted MixtralAttention for when I make a bad decision to actually train a MoE 2024-08-04 22:03:22 -05:00
ar.py fix issue with sft and shared tensors... 2024-08-04 19:56:21 -05:00
base.py do not include SDPA attention if there's no available SDPA backends 2024-08-06 20:42:39 -05:00
experimental.py
lora.py
nar.py