DL-Art-School/codes
James Betker d9936df363 Add gpt_tts dataset and implement inference
- Adds a script which preprocesses quantized mels given a DVAE
- Adds a dataset which can consume preprocessed qmels
- Reworks GPT TTS to consume the outputs of that dataset (removes logic to add padding and start/end tokens)
- Adds inference to gpt_tts
2021-08-04 00:44:04 -06:00
..
.idea
data Add gpt_tts dataset and implement inference 2021-08-04 00:44:04 -06:00
models Add gpt_tts dataset and implement inference 2021-08-04 00:44:04 -06:00
scripts Add gpt_tts dataset and implement inference 2021-08-04 00:44:04 -06:00
trainer Get dalle-style TTS to "work" 2021-08-03 21:08:27 -06:00
utils Add waveglow & inference capabilities to audio generator 2021-07-08 23:07:36 -06:00
multi_modal_train.py
process_video.py
requirements.txt Mods to support vqvae in audio mode (1d) 2021-07-20 08:36:46 -06:00
test_image_patch_classifier.py
test.py
train.py Get dalle-style TTS to "work" 2021-08-03 21:08:27 -06:00
use_discriminator_as_filter.py Various mods to support better jpeg image filtering 2021-06-25 13:16:15 -06:00