DL-Art-School/codes/models
James Betker 009a1e8404 Add a new diffusion_vocoder that should be trainable faster
This new one has a "cheating" top layer, that does not feed down into the unet encoder,
but does consume the outputs of the unet. This cheater only operates on half of the input,
while the rest of the unet operates on the full input. This limits the dimensionality of this last
layer, on the assumption that these last layers consume by far the most computation and memory,
but do not require the full input context.

Losses are only computed on half of the aggregate input.
2022-01-11 17:26:07 -07:00
..
byol Fix byol_model_wrapper to function with audio inputs 2021-08-05 22:20:22 -06:00
classifiers Fix error & add nonfinite warning 2021-11-09 23:58:41 -07:00
diffusion Fixes 2021-12-18 16:45:38 -07:00
fixup_resnet More refactoring 2020-12-18 09:18:34 -07:00
flownet2@db2b7899ea Update flownet submodule 2020-10-24 11:59:00 -06:00
glean Glean mods 2020-12-27 12:25:06 -07:00
gpt_voice Add a new diffusion_vocoder that should be trainable faster 2022-01-11 17:26:07 -07:00
lucidrains Make performer code functional 2022-01-09 22:32:50 -07:00
optical_flow Add PWCNet for human optical flow 2021-01-25 08:25:44 -07:00
segformer Fix error & add nonfinite warning 2021-11-09 23:58:41 -07:00
spleeter Improvements to splitter 2021-09-09 23:34:56 -06:00
srflow Migrate generators to dynamic model registration 2020-12-24 23:02:10 -07:00
stylegan Various fixes 2021-07-14 00:08:42 -06:00
switched_conv Add switchnorm to gumbel_quantizer 2021-09-24 18:49:25 -06:00
tacotron2 Initial implementation of audio_with_noise dataset 2021-10-21 16:45:19 -06:00
vqvae Fix (?) use_gpt_tts for unified_voice 2022-01-05 20:09:31 -07:00
waveglow Add waveglow & inference capabilities to audio generator 2021-07-08 23:07:36 -06:00
__init__.py Lots of new discriminator nets 2020-11-10 16:06:54 -07:00
arch_util.py Clean stuff up, move more things into arch_util 2021-10-20 21:19:25 -06:00
audio_resnet.py Add audio augmentation to wavfile_dataset, utility to test audio similary 2021-08-05 22:14:49 -06:00
clip.py Add generic CLIP model based off of x_clip 2022-01-08 19:08:01 -07:00
discriminator_vgg_arch.py Fix codes when inferring from dvae 2021-10-17 22:51:17 -06:00
feature_arch.py More refactoring 2020-12-18 09:18:34 -07:00
lightweight_gan.py Various fixes 2021-07-14 00:08:42 -06:00
ResGen_arch.py More refactoring 2020-12-18 09:18:34 -07:00
RRDBNet_arch.py More cleanup 2021-09-29 14:24:49 -06:00
spinenet_arch.py Migrate generators to dynamic model registration 2020-12-24 23:02:10 -07:00