vall-e/vall_e/utils
2025-03-06 17:07:29 -06:00
..
ext made muon actually work by actually utilizing param groups (thanks APOLLO for reminding me this is the sane way to handle this split) 2025-02-26 10:39:13 -06:00
__init__.py tweaks 2025-03-02 22:36:25 -06:00
distributed.py moved prints to use logger, edited readme (fused_attn doesnt seem stable for training) 2024-08-29 13:27:16 -05:00
io.py agony 2024-12-21 22:52:10 -06:00
ml.py a birdie tells me i should probably use a different optimizer (also preliminary support for native sparse attention but I don't know if I'll use it) 2025-03-04 14:53:02 -06:00
pattern.py oops, kept forgetting to actually pass in lang/tone tokens (despite not really using these at the moment) 2024-07-18 14:18:34 -05:00
sampler.py tweaks to bucket sampling 2024-11-13 11:09:24 -06:00
trainer.py stuff for interfacing with the loss scaler value (because I want to cap it) 2025-03-06 17:07:29 -06:00
utils.py tweaks 2025-03-02 22:36:25 -06:00