DL-Art-School/codes/models/gpt_voice
James Betker 48e3ee9a5b Shuffle conditioning inputs along the positional axis to reduce fitting on prosody and other positional information
The mels should still retain some short-range positional information the model can use
for tone and frequencies, for example.
2021-12-20 19:05:56 -07:00
..
dvae_arch_playground Add norming to discretization_loss 2021-10-06 17:10:50 -06:00
__init__.py
gpt_asr_hf.py Fix gpt_tts_hf inference 2021-12-20 17:45:26 -07:00
gpt_asr_hf2.py One last fix for gpt_asr_hf2 2021-12-02 21:19:28 -07:00
gpt_asr.py Check in GPT with new inference methods (but not the backing code..) 2021-10-29 17:21:40 -06:00
gpt_audio_segmentor.py Stop dataset - attempt #2 2021-08-18 18:29:38 -06:00
gpt_tts_hf.py Shuffle conditioning inputs along the positional axis to reduce fitting on prosody and other positional information 2021-12-20 19:05:56 -07:00
gpt_tts.py
lucidrains_dvae.py Record codes more often 2021-12-07 09:22:45 -07:00
lucidrains_gpt.py Fix inference mode for lucidrains_gpt 2021-10-30 16:59:18 -06:00
mini_encoder.py Further simplify diffusion_vocoder and make noise_surfer work 2021-10-26 08:54:30 -06:00
pixelshuffle_1d.py
reversible.py Add support for distilling gpt_asr 2021-10-27 13:10:07 -06:00
unet_diffusion_vocoder_with_ref.py misc 2021-12-11 08:17:26 -07:00