vall-e/vall_e/engines
2024-12-12 17:12:59 -06:00
..
__init__.py APOLLO cringe (doesn't want to work with deepspeed) 2024-12-12 00:31:58 -06:00
base.py actually save the optimizer for the local engine backend because safetensors doesn't save it 2024-12-12 17:12:59 -06:00
deepspeed.py added knowledge distillation in the trainer (sadly it is not agnostic because of the grave mistake of further processing the batch within the forward pass, so subsequent calls do not match......) 2024-12-05 23:05:52 -06:00