DL-Art-School/codes/models/audio/tts
2022-04-01 15:53:45 -06:00
..
tacotron2
__init__.py
ctc_code_generator.py
ctc_code_generator2.py
diffusion_encoder.py drop full layers in layerdrop, not half layers 2022-03-23 17:15:08 -06:00
lucidrains_dvae.py
mini_encoder.py
transformer_builders.py
unet_diffusion_tts_flat.py drop full layers in layerdrop, not half layers 2022-03-23 17:15:08 -06:00
unet_diffusion_tts_flat0.py prep flat0 for feeding from autoregressive_latent_converter 2022-04-01 15:53:45 -06:00
unet_diffusion_tts5.py
unet_diffusion_tts6.py
unet_diffusion_tts7.py support x-transformers in text_voice_clip and support relative positional embeddings 2022-03-26 22:48:10 -06:00
unet_diffusion_tts8.py
unet_diffusion_tts9.py tts9: fix position embeddings snafu 2022-03-22 11:41:32 -06:00
unet_diffusion_vocoder_with_ref.py
unet_diffusion_vocoder.py
unified_voice2.py
voice_voice_clip.py
w2v_matcher.py