DL-Art-School/codes/models/audio/tts
2022-05-27 11:12:03 -06:00
..
tacotron2 support tts typing 2022-04-16 23:36:57 -06:00
__init__.py Refactor audio-style models into the audio folder 2022-03-15 11:06:25 -06:00
autoregressive_codegen.py Align autoregressive text using start and stop tokens 2022-04-08 09:41:59 -06:00
autoregressive_codegen2.py cg2 2022-04-06 21:24:36 -06:00
ctc_code_generator.py ressurect ctc code gen with some cool new ideas 2022-05-24 14:02:33 -06:00
diffusion_encoder.py drop full layers in layerdrop, not half layers 2022-03-23 17:15:08 -06:00
lucidrains_dvae.py Refactor audio-style models into the audio folder 2022-03-15 11:06:25 -06:00
mini_encoder.py reverse univnet classifier 2022-04-20 21:37:55 -06:00
random_latent_converter.py Add a trainable network for converting a normal distribution into a latent space 2022-05-02 09:47:30 -06:00
transformer_builders.py undo relative 2022-04-08 16:32:52 -06:00
transformer_diffusion_tts.py propagate type 2022-05-27 11:12:03 -06:00
unet_diffusion_tts_flat.py Add a trainable network for converting a normal distribution into a latent space 2022-05-02 09:47:30 -06:00
unet_diffusion_tts7.py support x-transformers in text_voice_clip and support relative positional embeddings 2022-03-26 22:48:10 -06:00
unet_diffusion_tts9.py tts9: fix position embeddings snafu 2022-03-22 11:41:32 -06:00
unet_diffusion_tts10.py adf update 2022-05-27 09:25:53 -06:00
unet_diffusion_vocoder_with_ref.py Refactor audio-style models into the audio folder 2022-03-15 11:06:25 -06:00
unet_diffusion_vocoder.py Refactor audio-style models into the audio folder 2022-03-15 11:06:25 -06:00
unified_voice2.py clean up unified voice 2022-05-09 14:45:49 -06:00
unified_voice3.py uv3 2022-05-13 17:57:47 -06:00
voice_voice_clip.py Refactor audio-style models into the audio folder 2022-03-15 11:06:25 -06:00
w2v_matcher.py Refactor audio-style models into the audio folder 2022-03-15 11:06:25 -06:00