DL-Art-School

Author	SHA1	Message	Date
James Betker	db38672dae	precompute diffusion embeddings for from_codes	2022-05-22 06:45:57 -06:00
James Betker	ea21a8b107	Update music_diffusion_fid to support waveform diffusion from codes	2022-05-22 05:23:54 -06:00
James Betker	c9c16e3b01	misc updates	2022-05-19 13:39:32 -06:00
James Betker	3e97abc8a9	update flat0 to break out timestep-independent inference steps	2022-04-01 14:38:53 -06:00
James Betker	d186414566	More spring cleaning	2022-03-16 12:04:00 -06:00
James Betker	0419a64107	misc	2022-03-15 10:36:34 -06:00
James Betker	d2bdeb6f20	misc audio support	2022-03-08 15:52:26 -07:00
James Betker	93a3302819	Push training_state data to CPU memory before saving it For whatever reason, keeping this on GPU memory just doesn't work. When you load it, it consumes a large amount of GPU memory and that utilization doesn't go away. Saving to CPU should fix this.	2022-03-04 17:57:33 -07:00
James Betker	e1052a5e32	Move log consensus to train for efficiency	2022-03-04 13:41:32 -07:00
James Betker	3ff878ae85	Accumulate loss & grad_norm metrics from all entities within a distributed graph	2022-03-04 12:01:16 -07:00
James Betker	f490eaeba7	Shuffle optimizer states back and forth between cpu memory during steps	2022-03-04 10:38:51 -07:00
James Betker	d1d1ae32a1	audio diffusion frechet distance measurement!	2022-02-10 22:55:46 -07:00
James Betker	7badbf1b4d	update usage scripts	2022-01-25 17:57:26 -07:00
James Betker	49edffb6ad	Revise device mapping	2022-01-24 15:08:13 -07:00
James Betker	f4484fd155	Add "dataset_debugger" support This allows the datasets themselves compile statistics and report them via tensorboard and wandb.	2022-01-06 12:38:20 -07:00
James Betker	61cd351b71	update unified	2022-01-06 09:48:11 -07:00
James Betker	b24a51f0aa	Check in speech2speech CLIP inference tool	2021-12-29 00:19:44 -07:00
James Betker	dee34f096c	Add use_gpt_tts script	2021-12-16 23:28:54 -07:00
James Betker	b2d8fbcfc0	build a better speech synthesis toolset	2021-12-09 22:59:56 -07:00
James Betker	973f47c525	misc nonfunctional	2021-11-22 17:16:39 -07:00
James Betker	3b65241b6b	Get rid of printing grad names (didn't work very well..)	2021-11-01 18:44:05 -06:00
James Betker	87364b890f	Add custom clip_grad_norm that prints out the param names in error.	2021-11-01 11:12:20 -06:00
James Betker	0822792d79	Fix options.py bug	2021-10-29 14:47:31 -06:00
James Betker	2afea126d7	mod trainer to be very explicit about the fact that loading models and state together dont work, but allow it	2021-10-28 22:32:42 -06:00
James Betker	b1248e7114	Get rid of filter_urbansounds	2021-10-21 16:46:04 -06:00
James Betker	06ea6191a9	Initial implementation of audio_with_noise dataset	2021-10-21 16:45:19 -06:00
James Betker	55b58fb67f	Clean up codebase Remove stuff that I'm likely not going to use again (or generally failed experiments)	2021-09-29 09:21:44 -06:00
James Betker	ac57cdc794	Add scheduling to quantizer, enable cudnn_benchmarking to be disabled	2021-09-24 17:01:36 -06:00
James Betker	f78ce9d924	Get diffusion_dvae ready for prime time!	2021-09-16 22:43:10 -06:00
James Betker	570ed327ed	Stop dataset - attempt #2	2021-08-18 18:29:38 -06:00
James Betker	82fc69abfa	Add "pure" evaluator Which simply computes the training loss against an eval dataset	2021-08-09 14:58:35 -06:00
James Betker	be2745f42d	Add waveglow & inference capabilities to audio generator	2021-07-08 23:07:36 -06:00
James Betker	3e3ad7825f	Add support for training an EMA network alongside the main networks	2021-06-12 21:01:41 -06:00
James Betker	6084915af8	Support gaussian diffusion models Adds support for GD models, courtesy of some maths from openai. Also: - Fixes requirement for eval{} even when it isn't being used - Adds support for denormalizing an imagenet norm	2021-06-02 21:47:32 -06:00
James Betker	b374dcdd46	update vqvae to double codebook size for bottom quantizer	2021-01-23 13:47:07 -07:00
James Betker	dac7d768fa	test uresnet playground mods	2021-01-23 13:46:43 -07:00
James Betker	07168ecfb4	Enable vqvae to use a switched_conv variant	2021-01-09 20:53:14 -07:00
James Betker	5a8156026a	Did anyone ask for k-means clustering? This is so cool...	2021-01-07 22:37:41 -07:00
James Betker	193cdc6636	Move discriminators to the create_model paradigm Also cleans up a lot of old discriminator models that I have no intention of using again.	2021-01-01 15:56:09 -07:00
James Betker	8f18b2709e	Get rid of CUDA_VISIBLE_DEVICES It is not clear to me what the purpose of this is, but it has recently started causing failures.	2020-12-31 10:13:58 -07:00
James Betker	8f0984cacf	Add sr_fid evaluator	2020-12-30 20:18:58 -07:00
James Betker	10fdfa1563	Migrate generators to dynamic model registration	2020-12-24 23:02:10 -07:00
James Betker	e82f4552db	Update other docs with dumb config options	2020-12-18 16:21:28 -07:00
James Betker	d875ca8342	More refactor changes	2020-12-18 09:24:31 -07:00
James Betker	5640e4efe4	More refactoring	2020-12-18 09:18:34 -07:00
James Betker	b905b108da	Large cleanup Removed a lot of old code that I won't be touching again. Refactored some code elements into more logical places.	2020-12-18 09:10:44 -07:00
James Betker	2f0a52b7db	misc changes	2020-12-18 08:53:45 -07:00
James Betker	0cf52ef52c	latent work	2020-11-06 20:38:23 -07:00
James Betker	658a267bab	More work on SSIM/PSNR approximators - Add a network that accomodates this style of approximator while retaining structure - Migrate to SSIM approximation - Add a tool to visualize how these approximators are working - Fix some issues that came up while doign this work	2020-11-03 08:09:58 -07:00
James Betker	74738489b9	Fixes and additional support for progressive zoom	2020-10-30 09:59:54 -06:00

1 2

86 Commits