James Betker
|
a3da7f186e
|
add tfd audio diffusion
|
2022-06-12 13:59:22 -06:00 |
|
James Betker
|
f691f5faa1
|
f
|
2022-05-27 13:47:05 -06:00 |
|
James Betker
|
031769150d
|
clip adf test dataset
|
2022-05-27 13:42:52 -06:00 |
|
James Betker
|
31dec016e0
|
adf
|
2022-05-27 12:28:04 -06:00 |
|
James Betker
|
3db862dd32
|
adf update
|
2022-05-27 09:25:53 -06:00 |
|
James Betker
|
d8925ccde5
|
few things with gap filling
|
2022-05-06 14:33:44 -06:00 |
|
James Betker
|
3cad1b8114
|
more fixes
|
2022-04-11 15:18:44 -06:00 |
|
James Betker
|
6dea7da7a8
|
another fix
|
2022-04-11 12:29:43 -06:00 |
|
James Betker
|
f2c172291f
|
fix audio_diffusion_fid for autoregressive latent inputs
|
2022-04-11 12:08:15 -06:00 |
|
James Betker
|
6fc4f49e86
|
some dumb stuff
|
2022-04-07 11:32:34 -06:00 |
|
James Betker
|
9c79fec734
|
update adf
|
2022-03-24 21:20:29 -06:00 |
|
James Betker
|
b0d2827fad
|
flat0
|
2022-03-24 11:30:40 -06:00 |
|
James Betker
|
be5f052255
|
misc
|
2022-03-22 11:40:56 -06:00 |
|
James Betker
|
1ad18d29a8
|
Flat fixes
|
2022-03-21 14:43:52 -06:00 |
|
James Betker
|
c14fc003ed
|
flat diffusion
|
2022-03-17 17:45:27 -06:00 |
|
James Betker
|
8b376e63d9
|
More improvements
|
2022-03-16 10:16:34 -06:00 |
|
James Betker
|
54202aa099
|
fix mel normalization
|
2022-03-16 09:26:55 -06:00 |
|
James Betker
|
8437bb0c53
|
fixes
|
2022-03-15 23:52:48 -06:00 |
|
James Betker
|
f563a8dd41
|
fixes
|
2022-03-15 21:43:00 -06:00 |
|
James Betker
|
1e3a8554a1
|
updates to audio_diffusion_fid
|
2022-03-15 11:35:09 -06:00 |
|
James Betker
|
7929fd89de
|
Refactor audio-style models into the audio folder
|
2022-03-15 11:06:25 -06:00 |
|
James Betker
|
c4e4cf91a0
|
add support for the original vocoder to audio_diffusion_fid; also add a new "intelligibility" metric
|
2022-03-08 15:53:27 -07:00 |
|
James Betker
|
382681a35d
|
Load diffusion_fid DVAE into the correct cuda device
|
2022-03-04 13:42:14 -07:00 |
|
James Betker
|
58019a2ce3
|
audio diffusion fid updates
|
2022-03-03 21:53:32 -07:00 |
|
James Betker
|
db0c3340ac
|
Implement guidance-free diffusion in eval
And a few other fixes
|
2022-03-01 11:49:36 -07:00 |
|
James Betker
|
2134f06516
|
Implement conditioning-free diffusion at the eval level
|
2022-02-27 15:11:42 -07:00 |
|
James Betker
|
7c17c8e674
|
gurgl
|
2022-02-23 21:28:24 -07:00 |
|
James Betker
|
81017d9696
|
put frechet_distance on cuda
|
2022-02-23 21:21:13 -07:00 |
|
James Betker
|
9a7bbf33df
|
f
|
2022-02-23 18:03:38 -07:00 |
|
James Betker
|
b7319ab518
|
Support vocoder type diffusion in audio_diffusion_fid
|
2022-02-23 17:25:16 -07:00 |
|
James Betker
|
58f6c9805b
|
adf
|
2022-02-22 23:12:58 -07:00 |
|
James Betker
|
7b12799370
|
Reformat mel_text_clip for use in eval
|
2022-02-19 20:37:26 -07:00 |
|
James Betker
|
102142d1eb
|
f
|
2022-02-11 11:05:13 -07:00 |
|
James Betker
|
40b08a52d0
|
dafuk
|
2022-02-11 11:01:31 -07:00 |
|
James Betker
|
46b97049dc
|
Fix eval
|
2022-02-11 10:59:32 -07:00 |
|
James Betker
|
d1d1ae32a1
|
audio diffusion frechet distance measurement!
|
2022-02-10 22:55:46 -07:00 |
|