DL-Art-School

History

James Betker d9936df363 Add gpt_tts dataset and implement inference - Adds a script which preprocesses quantized mels given a DVAE - Adds a dataset which can consume preprocessed qmels - Reworks GPT TTS to consume the outputs of that dataset (removes logic to add padding and start/end tokens) - Adds inference to gpt_tts		2021-08-04 00:44:04 -06:00
..
.idea
data	Add gpt_tts dataset and implement inference	2021-08-04 00:44:04 -06:00
models	Add gpt_tts dataset and implement inference	2021-08-04 00:44:04 -06:00
scripts	Add gpt_tts dataset and implement inference	2021-08-04 00:44:04 -06:00
trainer	Get dalle-style TTS to "work"	2021-08-03 21:08:27 -06:00
utils	Add waveglow & inference capabilities to audio generator	2021-07-08 23:07:36 -06:00
multi_modal_train.py	More adjustments to support distributed training with teco & on multi_modal_train	2020-10-27 20:58:03 -06:00
process_video.py	misc	2021-01-23 13:45:17 -07:00
requirements.txt	Mods to support vqvae in audio mode (1d)	2021-07-20 08:36:46 -06:00
test_image_patch_classifier.py	More refactoring	2020-12-18 09:18:34 -07:00
test.py	Add FID evaluator for diffusion models	2021-06-14 09:14:30 -06:00
train.py	Get dalle-style TTS to "work"	2021-08-03 21:08:27 -06:00
use_discriminator_as_filter.py	Various mods to support better jpeg image filtering	2021-06-25 13:16:15 -06:00