• https://git.ecker.tech/ aims to provide a place to share my efforts while maintaining true ownership of my code, as I do not trust GitHub.

    XMR: 4B9TQdkAkBFYrbj5ztvTx89e5LpucPeTSPzemCihdDi9EBnx7btn8RDNZTBz2zihWsjMnDkzn5As1LU6gLv3KQy8BLsZ8SG

  • Joined on 2022-10-10
mrq pushed to master at mrq/vall-e 2025-03-04 20:51:51 +00:00
mrq pushed to master at mrq/vall-e 2025-03-04 20:48:06 +00:00
1cd24f3381 a birdie tells me i should probably use a different optimizer (also preliminary support for native sparse attention but I don't know if I'll use it)
mrq pushed to master at mrq/vall-e 2025-03-03 19:16:41 +00:00
0451f75e33 now that the new model seems a little more promising, i can re-document things non-cynically
mrq pushed to master at mrq/vall-e 2025-03-03 04:31:27 +00:00
3f1070f575 tweaks
mrq pushed to master at mrq/vall-e 2025-03-02 02:58:22 +00:00
4afa4ccce5 at wits end (parhaps the semantic token approach is the toughest pill to swallow)
mrq pushed to master at mrq/vall-e 2025-03-02 01:25:25 +00:00
1d3290b023 could have sworn this worked before, might have broke it when i decoupled from omegaconf
mrq pushed to master at mrq/vall-e 2025-03-01 23:43:50 +00:00
17094b8002 reticulating splines
mrq pushed to master at mrq/vall-e 2025-03-01 04:10:34 +00:00
mrq pushed to master at mrq/vall-e 2025-03-01 04:09:50 +00:00
mrq pushed to master at mrq/vall-e 2025-03-01 04:07:15 +00:00
ddc49c89c5 the learning rate scheduler pill is a tough pill to swallow
mrq pushed to master at mrq/vall-e 2025-03-01 04:03:05 +00:00
94861677d3 the learning rate scheduler pill is a tough pill to swallow
mrq pushed to master at mrq/vall-e 2025-03-01 00:48:04 +00:00
b97faa8173 fixes...
mrq pushed to master at mrq/vall-e 2025-03-01 00:01:34 +00:00
mrq pushed to master at mrq/vall-e 2025-02-28 23:51:51 +00:00
a174c33db6 a gorillionth time's the charm (aka: the encoder/decoder pill is a tough pill to swallow)
mrq pushed to master at mrq/vall-e 2025-02-28 07:02:03 +00:00
mrq pushed to master at mrq/vall-e 2025-02-28 07:00:41 +00:00
mrq pushed to master at mrq/vall-e 2025-02-28 06:59:25 +00:00
180a4eac1b ughh
mrq pushed to master at mrq/vall-e 2025-02-28 06:06:02 +00:00
mrq pushed to master at mrq/vall-e 2025-02-28 05:59:50 +00:00
mrq pushed to master at mrq/vall-e 2025-02-28 05:54:53 +00:00
93feb5660f do not like that