James Betker
|
89bd40d39f
|
eval bug fix
|
2022-06-10 13:51:06 -06:00 |
|
James Betker
|
84469f3538
|
get rid of encoder checkpointing
|
2022-06-10 10:50:34 -06:00 |
|
James Betker
|
97b32dd39d
|
try to make tfd8 be able to be trained e2e in quantizer mode
|
2022-06-10 10:40:56 -06:00 |
|
James Betker
|
e78c4b422c
|
tfd8
|
2022-06-10 09:24:41 -06:00 |
|
James Betker
|
d98b895307
|
loss aware fix and report gumbel temperature
|
2022-06-09 21:56:47 -06:00 |
|
James Betker
|
c61cd64bc9
|
network updates
|
2022-06-08 09:26:59 -06:00 |
|
James Betker
|
602df0abbc
|
revert changes to dietattentionblock
|
2022-06-05 10:06:17 -06:00 |
|
James Betker
|
51d1908e94
|
update
|
2022-06-05 09:35:43 -06:00 |
|
James Betker
|
f9ebcf11d8
|
fix2
|
2022-06-05 01:31:37 -06:00 |
|
James Betker
|
aac92b01b3
|
fix
|
2022-06-05 01:27:28 -06:00 |
|
James Betker
|
38d8b17d18
|
tfd8 gets real verbose grad norm metrics
|
2022-06-04 23:09:54 -06:00 |
|
James Betker
|
0a9d4d4afc
|
bunch of new stuff
|
2022-06-04 22:23:08 -06:00 |
|