DL-Art-School/codes
James Betker d9936df363 Add gpt_tts dataset and implement inference
- Adds a script which preprocesses quantized mels given a DVAE
- Adds a dataset which can consume preprocessed qmels
- Reworks GPT TTS to consume the outputs of that dataset (removes logic to add padding and start/end tokens)
- Adds inference to gpt_tts
2021-08-04 00:44:04 -06:00
..
.idea IDEA update 2020-05-19 09:35:26 -06:00
data Add gpt_tts dataset and implement inference 2021-08-04 00:44:04 -06:00
models Add gpt_tts dataset and implement inference 2021-08-04 00:44:04 -06:00
scripts Add gpt_tts dataset and implement inference 2021-08-04 00:44:04 -06:00
trainer Get dalle-style TTS to "work" 2021-08-03 21:08:27 -06:00
utils Add waveglow & inference capabilities to audio generator 2021-07-08 23:07:36 -06:00
multi_modal_train.py More adjustments to support distributed training with teco & on multi_modal_train 2020-10-27 20:58:03 -06:00
process_video.py misc 2021-01-23 13:45:17 -07:00
requirements.txt Mods to support vqvae in audio mode (1d) 2021-07-20 08:36:46 -06:00
test_image_patch_classifier.py More refactoring 2020-12-18 09:18:34 -07:00
test.py Add FID evaluator for diffusion models 2021-06-14 09:14:30 -06:00
train.py Get dalle-style TTS to "work" 2021-08-03 21:08:27 -06:00
use_discriminator_as_filter.py Various mods to support better jpeg image filtering 2021-06-25 13:16:15 -06:00