DL-Art-School/codes
James Betker a9629f7022 Try out using the GPT tokenizer rather than nv_tacotron
This results in a significant compression of the text domain, I'm curious what the
effect on speech quality will be.
2021-12-22 14:03:18 -07:00
..
.idea
data Try out using the GPT tokenizer rather than nv_tacotron 2021-12-22 14:03:18 -07:00
models Try out using the GPT tokenizer rather than nv_tacotron 2021-12-22 14:03:18 -07:00
scripts gpt_tts_hf inference fixes 2021-12-22 13:22:15 -07:00
trainer move speech utils 2021-12-16 20:47:37 -07:00
utils Add use_gpt_tts script 2021-12-16 23:28:54 -07:00
multi_modal_train.py More adjustments to support distributed training with teco & on multi_modal_train 2020-10-27 20:58:03 -06:00
process_video.py misc 2021-01-23 13:45:17 -07:00
requirements.txt Remove obsolete lucidrains DALLE stuff, re-create it in a dedicated folder 2021-12-22 13:44:02 -07:00
test.py Add FID evaluator for diffusion models 2021-06-14 09:14:30 -06:00
train.py Fix mel terminator 2021-12-18 17:18:06 -07:00
use_discriminator_as_filter.py Various mods to support better jpeg image filtering 2021-06-25 13:16:15 -06:00