forked from ecker/DL-Art-School
- Adds a script which preprocesses quantized mels given a DVAE - Adds a dataset which can consume preprocessed qmels - Reworks GPT TTS to consume the outputs of that dataset (removes logic to add padding and start/end tokens) - Adds inference to gpt_tts |
||
|---|---|---|
| .. | ||
| .idea | ||
| data | ||
| models | ||
| scripts | ||
| trainer | ||
| utils | ||
| multi_modal_train.py | ||
| process_video.py | ||
| requirements.txt | ||
| test_image_patch_classifier.py | ||
| test.py | ||
| train.py | ||
| use_discriminator_as_filter.py | ||