DL-Art-School

History

James Betker 48e3ee9a5b Shuffle conditioning inputs along the positional axis to reduce fitting on prosody and other positional information The mels should still retain some short-range positional information the model can use for tone and frequencies, for example.		2021-12-20 19:05:56 -07:00
..
.idea
data	Various fixes to gpt_tts_hf	2021-12-16 23:28:44 -07:00
models	Shuffle conditioning inputs along the positional axis to reduce fitting on prosody and other positional information	2021-12-20 19:05:56 -07:00
scripts	Fix gpt_tts_hf inference	2021-12-20 17:45:26 -07:00
trainer	move speech utils	2021-12-16 20:47:37 -07:00
utils	Add use_gpt_tts script	2021-12-16 23:28:54 -07:00
multi_modal_train.py	More adjustments to support distributed training with teco & on multi_modal_train	2020-10-27 20:58:03 -06:00
process_video.py	misc	2021-01-23 13:45:17 -07:00
requirements.txt	Integrate with lr_quantizer	2021-11-23 19:48:22 -07:00
test.py	Add FID evaluator for diffusion models	2021-06-14 09:14:30 -06:00
train.py	Fix mel terminator	2021-12-18 17:18:06 -07:00
use_discriminator_as_filter.py	Various mods to support better jpeg image filtering	2021-06-25 13:16:15 -06:00