DL-Art-School

History

James Betker ee8ceed6da rework tfd13 further - use a gated activation layer for both attention & convs - add a relativistic learned position bias. I believe this is similar to the T5 position encodings but it is simpler and learned - get rid of prepending to the attention matrix - this doesn't really work that well. the model eventually learns to attend one of its heads to these blocks but why not just concat if it is doing that?		2022-07-20 23:28:29 -06:00
..
.idea
data	Support aac datatypes	2022-07-01 00:44:20 -06:00
models	rework tfd13 further	2022-07-20 23:28:29 -06:00
scripts	add assertions to mel generator script	2022-07-19 11:23:54 -06:00
trainer	rework tfd13 further	2022-07-20 23:28:29 -06:00
utils	misc	2022-07-08 00:37:53 -06:00
multi_modal_train.py
process_video.py
requirements.txt
sweep.py
test.py
train.py	misc	2022-07-20 10:19:02 -06:00
use_discriminator_as_filter.py