vall-e

mrq/vall-e

Author	SHA1	Message	Date
mrq	d423bc03c2	fixed attentions for MoE	2024-08-27 17:02:42 -05:00
mrq	d636edd3a2	added flash_attn LlamaAttention (including flash_attn==1.0.9)	2024-08-18 20:51:14 -05:00
mrq	debcc93e7e	add adapted MixtralAttention for when I make a bad decision to actually train a MoE	2024-08-04 22:03:22 -05:00
mrq	b2194b859a	re-added loading multiple models because I'm now entertaining having split AR/NAR models again (and need a way to load both at once)	2024-06-06 09:48:43 -05:00
mrq	ff6fe6f1bc	cleanup	2024-06-05 20:30:43 -05:00