James Betker
|
92e7e57f81
|
Update diffusion_noise_surfer to support audio
|
2021-09-01 08:34:47 -06:00 |
|
James Betker
|
3e073cff85
|
Set kernel_size in diffusion_vocoder
|
2021-09-01 08:33:46 -06:00 |
|
James Betker
|
dabd87246d
|
Add unet_diffusion_vocoder
|
2021-08-31 14:38:33 -06:00 |
|
James Betker
|
909754cc27
|
Add find_faulty_files.py
|
2021-08-25 18:00:43 -06:00 |
|
James Betker
|
cfd284f425
|
Fix up some stuff that allows the MEL to be computed on-GPU
|
2021-08-13 18:35:55 -06:00 |
|
James Betker
|
cdee31c60b
|
GPT_ASR
|
2021-08-13 15:02:18 -06:00 |
|
James Betker
|
04d14b3acc
|
No batch factors for eval
|
2021-08-09 16:02:01 -06:00 |
|
James Betker
|
82fc69abfa
|
Add "pure" evaluator
Which simply computes the training loss against an eval dataset
|
2021-08-09 14:58:35 -06:00 |
|
James Betker
|
b43683b772
|
Add lucidrains_dvae
|
2021-08-06 12:03:46 -06:00 |
|
James Betker
|
3ca51e80b2
|
Only fix weird path bug in windows
|
2021-08-05 22:21:25 -06:00 |
|
James Betker
|
5037220ac7
|
Mods to support contrastive learning on audio files
|
2021-08-05 05:57:04 -06:00 |
|
James Betker
|
341f28dd82
|
It works!
|
2021-08-04 20:07:51 -06:00 |
|
James Betker
|
4c98b9703f
|
Get dalle-style TTS to "work"
|
2021-08-03 21:08:27 -06:00 |
|
James Betker
|
2814307eee
|
Alterations to support VQVAE on mel spectrograms
|
2021-08-01 07:54:21 -06:00 |
|
James Betker
|
965f6e6b52
|
Fixes to weight_decay in adamw
|
2021-07-31 15:58:41 -06:00 |
|
James Betker
|
0c9e75bc69
|
Improvements to GptTts
|
2021-07-31 15:57:57 -06:00 |
|
James Betker
|
96e90e7047
|
Add support for a gaussian-diffusion-based wave tacotron
|
2021-07-26 16:27:31 -06:00 |
|
James Betker
|
97d7cbbc34
|
Additional work for audio xformer (which doesnt really do a great job)
|
2021-07-23 10:58:14 -06:00 |
|
James Betker
|
2325e7a88c
|
Allow inference for vqvae
|
2021-07-20 10:40:05 -06:00 |
|
James Betker
|
d81386c1be
|
Mods to support vqvae in audio mode (1d)
|
2021-07-20 08:36:46 -06:00 |
|
James Betker
|
5584cfcc7a
|
tacotron2 work
|
2021-07-14 21:41:57 -06:00 |
|
James Betker
|
be2745f42d
|
Add waveglow & inference capabilities to audio generator
|
2021-07-08 23:07:36 -06:00 |
|
James Betker
|
1ff434218e
|
tacotron2, ready for prime time!
|
2021-07-08 22:13:44 -06:00 |
|
James Betker
|
86fd3ad7fd
|
Initial checkin of nvidia tacotron model & dataset
These two are tested, full support for training to come.
|
2021-07-06 11:11:35 -06:00 |
|
James Betker
|
6fd16ea9c8
|
Add meta-anomaly detection, colorjitter augmentation
|
2021-06-29 13:41:55 -06:00 |
|
James Betker
|
a57ed8e960
|
Various mods to support better jpeg image filtering
|
2021-06-25 13:16:15 -06:00 |
|
James Betker
|
e7890dc0ba
|
Misc fixes for diffusion nets
|
2021-06-21 10:38:07 -06:00 |
|
James Betker
|
8e3a33e001
|
Fix a bug where non-rank-0 is computing FID before all images are saved.
|
2021-06-16 16:27:09 -06:00 |
|
James Betker
|
68cbbed886
|
Add some cool diffusion testing scripts
|
2021-06-16 16:26:36 -06:00 |
|
James Betker
|
ae8de0cb9d
|
fid saving images across all rank fix
|
2021-06-15 10:31:07 -06:00 |
|
James Betker
|
6a75bd0777
|
Another fix
|
2021-06-14 09:51:44 -06:00 |
|
James Betker
|
54bff35171
|
Fix issue where eval was not being used by all ddp processes
|
2021-06-14 09:50:04 -06:00 |
|
James Betker
|
60079a1572
|
Fix saver in distributed mode
|
2021-06-14 09:41:06 -06:00 |
|
James Betker
|
545f2db170
|
Distributed FID dataset across processes
|
2021-06-14 09:33:44 -06:00 |
|
James Betker
|
6b32c87dcb
|
Try to make diffusion fid more deterministic
|
2021-06-14 09:27:43 -06:00 |
|
James Betker
|
5b4f86293f
|
Add FID evaluator for diffusion models
|
2021-06-14 09:14:30 -06:00 |
|
James Betker
|
9cfe840872
|
Attempt to fix syncing multiple times when doing gradient accumulation
|
2021-06-13 14:30:30 -06:00 |
|
James Betker
|
1cd75dfd33
|
Fix ddp bug
|
2021-06-13 10:25:23 -06:00 |
|
James Betker
|
3e3ad7825f
|
Add support for training an EMA network alongside the main networks
|
2021-06-12 21:01:41 -06:00 |
|
James Betker
|
696f320820
|
Get rid of feature networks
|
2021-06-11 20:50:07 -06:00 |
|
James Betker
|
65c474eecf
|
Various changes to fix testing
|
2021-06-11 15:31:10 -06:00 |
|
James Betker
|
aea12e1b9c
|
Fix cat eval hack
|
2021-06-09 17:05:11 -06:00 |
|
James Betker
|
2ad2b56438
|
Don't do wandb except on rank 0
|
2021-06-06 16:52:07 -06:00 |
|
James Betker
|
7c5478bc2c
|
Formatting issue with gdi
|
2021-06-06 16:35:37 -06:00 |
|
James Betker
|
692e9c417b
|
Support diffusion unet
|
2021-06-06 13:57:22 -06:00 |
|
James Betker
|
16cd92acd5
|
hack
|
2021-06-05 14:23:41 -06:00 |
|
James Betker
|
80d4404367
|
A few fixes:
- Output better prediction of xstart from eps
- Support LossAwareSampler
- Support AdamW
|
2021-06-05 13:40:32 -06:00 |
|
James Betker
|
7d45132f60
|
fdsa
|
2021-06-04 21:26:54 -06:00 |
|
James Betker
|
6c8c8087d5
|
asdf
|
2021-06-04 21:24:48 -06:00 |
|
James Betker
|
bf811f80c1
|
GD mods & fixes
- Report variational loss separately
- Report model prediction from injector
- Log these things
- Use respacing like guided diffusion
|
2021-06-04 17:13:16 -06:00 |
|