DL-Art-School/codes
James Betker 48e3ee9a5b Shuffle conditioning inputs along the positional axis to reduce fitting on prosody and other positional information
The mels should still retain some short-range positional information the model can use
for tone and frequencies, for example.
2021-12-20 19:05:56 -07:00
..
.idea
data Various fixes to gpt_tts_hf 2021-12-16 23:28:44 -07:00
models Shuffle conditioning inputs along the positional axis to reduce fitting on prosody and other positional information 2021-12-20 19:05:56 -07:00
scripts Fix gpt_tts_hf inference 2021-12-20 17:45:26 -07:00
trainer move speech utils 2021-12-16 20:47:37 -07:00
utils Add use_gpt_tts script 2021-12-16 23:28:54 -07:00
multi_modal_train.py
process_video.py
requirements.txt Integrate with lr_quantizer 2021-11-23 19:48:22 -07:00
test.py
train.py Fix mel terminator 2021-12-18 17:18:06 -07:00
use_discriminator_as_filter.py Various mods to support better jpeg image filtering 2021-06-25 13:16:15 -06:00