.. |
tacotron2
|
support tts typing
|
2022-04-16 23:36:57 -06:00 |
__init__.py
|
Refactor audio-style models into the audio folder
|
2022-03-15 11:06:25 -06:00 |
autoregressive_codegen.py
|
Align autoregressive text using start and stop tokens
|
2022-04-08 09:41:59 -06:00 |
autoregressive_codegen2.py
|
cg2
|
2022-04-06 21:24:36 -06:00 |
ctc_code_generator.py
|
ressurect ctc code gen with some cool new ideas
|
2022-05-24 14:02:33 -06:00 |
diffusion_encoder.py
|
drop full layers in layerdrop, not half layers
|
2022-03-23 17:15:08 -06:00 |
lucidrains_dvae.py
|
Refactor audio-style models into the audio folder
|
2022-03-15 11:06:25 -06:00 |
mini_encoder.py
|
reverse univnet classifier
|
2022-04-20 21:37:55 -06:00 |
random_latent_converter.py
|
Add a trainable network for converting a normal distribution into a latent space
|
2022-05-02 09:47:30 -06:00 |
transformer_builders.py
|
undo relative
|
2022-04-08 16:32:52 -06:00 |
transformer_diffusion_tts.py
|
tfd5
|
2022-05-28 22:27:04 -06:00 |
transformer_diffusion_tts2.py
|
tfd6
|
2022-05-30 09:09:42 -06:00 |
unet_diffusion_tts_flat.py
|
Add a trainable network for converting a normal distribution into a latent space
|
2022-05-02 09:47:30 -06:00 |
unet_diffusion_tts7.py
|
support x-transformers in text_voice_clip and support relative positional embeddings
|
2022-03-26 22:48:10 -06:00 |
unet_diffusion_tts9.py
|
tts9: fix position embeddings snafu
|
2022-03-22 11:41:32 -06:00 |
unet_diffusion_vocoder_with_ref.py
|
Refactor audio-style models into the audio folder
|
2022-03-15 11:06:25 -06:00 |
unet_diffusion_vocoder.py
|
Refactor audio-style models into the audio folder
|
2022-03-15 11:06:25 -06:00 |
unified_voice2.py
|
uv2 add alignment head
|
2022-06-14 15:18:58 -06:00 |
unified_voice3.py
|
uv3
|
2022-05-13 17:57:47 -06:00 |
voice_voice_clip.py
|
Refactor audio-style models into the audio folder
|
2022-03-15 11:06:25 -06:00 |
w2v_matcher.py
|
Refactor audio-style models into the audio folder
|
2022-03-15 11:06:25 -06:00 |