DL-Art-School

Author	SHA1	Message	Date
James Betker	d5fa059594	Add capability to have old discriminators serve as feature networks	2020-07-31 14:59:54 -06:00
James Betker	6b45b35447	Allow multi_step_lr_scheduler to load a new LR schedule when restoring state	2020-07-31 11:21:11 -06:00
James Betker	e37726f302	Add feature_model for training custom feature nets	2020-07-31 11:20:39 -06:00
James Betker	7629cb0e61	Add FDPL Loss New loss type that can replace PSNR loss. Works against the frequency domain and focuses on frequency features loss during hr->lr conversion.	2020-07-30 20:47:57 -06:00
James Betker	85ee64b8d9	Turn down feadisc intensity Honestly - this feature is probably going to be removed soon, so backwards compatibility is not a huge deal anymore.	2020-07-27 15:28:55 -06:00
James Betker	ebb199e884	Get rid of safety valve (probably being encountered in val)	2020-07-26 22:51:59 -06:00
James Betker	d09ed4e5f7	Misc fixes	2020-07-26 22:44:24 -06:00
James Betker	c54784ae9e	Fix feature disc log item error	2020-07-26 22:25:59 -06:00
James Betker	9a8f227501	Allow separate dataset to pushed in for GAN-only training	2020-07-26 21:44:45 -06:00
James Betker	b06e1784e1	Fix SRG4 & switch disc "fix". hehe.	2020-07-25 17:16:54 -06:00
James Betker	e6e91a1d75	Add SRG4 Back to the idea that maybe what we need is a hybrid approach between pure switches and RDB.	2020-07-24 20:32:49 -06:00
James Betker	3320ad685f	Fix mega_batch_factor not set for test	2020-07-24 12:26:44 -06:00
James Betker	c50cce2a62	Add an abstract, configurabler weight scheduling class and apply it to the feature weight	2020-07-23 17:03:54 -06:00
James Betker	9ccf771629	Fix feature validation, wrong device Only shows up in distributed training for some reason.	2020-07-23 10:16:34 -06:00
James Betker	bba283776c	Enable find_unused_parameters for DistributedDataParallel attention_norm has some parameters which are not used to compute grad, which is causing failures in the distributed case.	2020-07-23 09:08:13 -06:00
James Betker	dbf6147504	Add switched discriminator The logic is that the discriminator may be incapable of providing a truly targeted loss for all image regions since it has to be too generic (basically the same argument for the switched generator). So add some switches in! See how it works!	2020-07-22 20:52:59 -06:00
James Betker	106b8da315	Assert that temperature is set properly in eval mode.	2020-07-22 20:50:59 -06:00
James Betker	c74b9ee2e4	Add a way to disable grad on portions of the generator graph to save memory	2020-07-22 11:40:42 -06:00
James Betker	e3adafbeac	Add convert_model.py and a hacky way to add extra layers to a model	2020-07-22 11:39:45 -06:00
James Betker	7f7e17e291	Update feature discriminator further Move the feature/disc losses closer and add a feature computation layer.	2020-07-20 20:54:45 -06:00
James Betker	46aa776fbb	Allow feature discriminator unet to only output closest layer to feature output	2020-07-19 19:05:08 -06:00
James Betker	8a9f215653	Huge set of mods to support progressive generator growth	2020-07-18 14:18:48 -06:00
James Betker	47a525241f	Make attention norm optional	2020-07-18 07:24:02 -06:00
James Betker	ad97a6a18a	Progressive SRG first check-in	2020-07-18 07:23:26 -06:00
James Betker	b08b1cad45	Fix feature decay	2020-07-16 23:27:06 -06:00
James Betker	3e7a83896b	Fix pixgan debugging issues	2020-07-16 11:45:19 -06:00
James Betker	a1bff64d1a	More fixes	2020-07-16 10:48:48 -06:00
James Betker	240f254263	More loss fixes	2020-07-16 10:45:50 -06:00
James Betker	6cfa67d831	Fix featuredisc broadcast error	2020-07-16 10:18:30 -06:00
James Betker	8d061a2687	Add u-net discriminator with feature output	2020-07-16 10:10:09 -06:00
James Betker	0c4c388e15	Remove dualoutputsrg Good idea, didn't pan out.	2020-07-16 10:09:24 -06:00
James Betker	4bcc409fc7	Fix loadSRG2 typo	2020-07-14 10:20:53 -06:00
James Betker	1e4083a35b	Apply temperature mods to all SRG models (Honestly this needs to be base classed at this point)	2020-07-14 10:19:35 -06:00
James Betker	7659bd6818	Fix temperature equation	2020-07-14 10:17:14 -06:00
James Betker	853468ef82	Allow legacy state_dicts in srg2	2020-07-14 10:03:45 -06:00
James Betker	1b1431133b	Add DualOutputSRG Also removes the old multi-return mechanism that Generators support. Also fixes AttentionNorm.	2020-07-14 09:28:24 -06:00
James Betker	a2285ff2ee	Scale anorm by transform count	2020-07-13 08:49:09 -06:00
James Betker	dd0bbd9a7c	Enable AttentionNorm on SRG2	2020-07-13 08:38:17 -06:00
James Betker	4c0f770f2a	Fix inverted temperature curve bug	2020-07-12 11:02:50 -06:00
James Betker	14d23b9d20	Fixes, do fake swaps less often in pixgan discriminator	2020-07-11 21:22:11 -06:00
James Betker	ba6187859a	err5	2020-07-10 23:02:56 -06:00
James Betker	902527dfaa	err4	2020-07-10 23:00:21 -06:00
James Betker	020b3361fa	err3	2020-07-10 22:57:34 -06:00
James Betker	b3a2c21250	err2	2020-07-10 22:52:02 -06:00
James Betker	716433db1f	err1	2020-07-10 22:50:56 -06:00
James Betker	0b7193392f	Implement unet disc The latest discriminator architecture was already pretty much a unet. This one makes that official and uses shared layers. It also upsamples one additional time and throws out the lowest upsampling result. The intent is to delete the old vgg pixdisc, but I'll keep it around for a bit since I'm still trying out a few models with it.	2020-07-10 16:24:42 -06:00
James Betker	812c684f7d	Update pixgan swap algorithm - Swap multiple blocks in the image instead of just one. The discriminator was clearly learning that most blocks have one region that needs to be fixed. - Relax block size constraints. This was in place to gaurantee that the discriminator signal was clean. Instead, just downsample the "loss image" with bilinear interpolation. The result is noisier, but this is actually probably healthy for the discriminator.	2020-07-10 15:56:14 -06:00
James Betker	33ca3832e1	Move ExpansionBlock to arch_util Also makes all processing blocks have a conformant signature. Alters ExpansionBlock to perform a processing conv on the passthrough before the conjoin operation - this will break backwards compatibilty with SRG2.	2020-07-10 15:53:41 -06:00
James Betker	5e8b52f34c	Misc changes	2020-07-10 09:45:48 -06:00
James Betker	5f2c722a10	SRG2 revival Big update to SRG2 architecture to pull in a lot of things that have been learned: - Use group norm instead of batch norm - Initialize the weights on the transformations low like is done in RRDB rather than using the scalar. Models live or die by their early stages, and this ones early stage is pretty weak - Transform multiplexer to use u-net like architecture. - Just use one set of configuration variables instead of a list - flat networks performed fine in this regard.	2020-07-09 17:34:51 -06:00
James Betker	12da993da8	More fixes...	2020-07-08 22:07:09 -06:00
James Betker	7d6eb28b87	More fixes	2020-07-08 22:00:57 -06:00
James Betker	b2507be13c	Fix up pixgan loss and pixdisc	2020-07-08 21:27:48 -06:00
James Betker	26a4a66d1c	Bug fixes and new gan mechanism - Removed a bunch of unnecessary image loggers. These were just consuming space and never being viewed - Got rid of support of artificial var_ref support. The new pixdisc is what i wanted to implement then - it's much better. - Add pixgan GAN mechanism. This is purpose-built for the pixdisc. It is intended to promote a healthy discriminator - Megabatchfactor was applied twice on metrics, fixed that Adds pix_gan (untested) which swaps a portion of the fake and real image with each other, then expects the discriminator to properly discriminate the swapped regions.	2020-07-08 17:40:26 -06:00
James Betker	4305be97b4	Update log metrics They should now be universal regardless of job configuration	2020-07-07 15:33:22 -06:00
James Betker	8a4eb8241d	SRG3 work Operates on top of a pre-trained SpineNET backbone (trained on CoCo 2017 with RetinaNet) This variant is extremely shallow.	2020-07-07 13:46:40 -06:00
James Betker	0acad81035	More SRG2 adjustments..	2020-07-06 22:40:40 -06:00
James Betker	086b2f0570	More bugs	2020-07-06 22:28:07 -06:00
James Betker	d4d4f85fc0	Bug fixes	2020-07-06 22:25:40 -06:00
James Betker	3c31bea1ac	SRG2 architectural changes	2020-07-06 22:22:29 -06:00
James Betker	9a1c3241f5	Switch discriminator to groupnorm	2020-07-06 20:59:59 -06:00
James Betker	6beefa6d0c	PixDisc - Add two more levels of losses coming from this gen at higher resolutions	2020-07-06 11:15:52 -06:00
James Betker	2636d3b620	Fix assertion error	2020-07-06 09:23:53 -06:00
James Betker	8f92c0a088	Interpolate attention well before softmax	2020-07-06 09:18:30 -06:00
James Betker	72f90cabf8	More pixdisc fixes	2020-07-05 22:03:16 -06:00
James Betker	909007ee6a	Add G_warmup Let the Generator get to a point where it is at least competing with the discriminator before firing off. Backwards from most GAN architectures, but this one is a bit different from most.	2020-07-05 21:58:35 -06:00
James Betker	a47a5dca43	Fix pixdisc bug	2020-07-05 21:57:52 -06:00
James Betker	d0957bd7d4	Alter weight initialization for transformation blocks	2020-07-05 17:32:46 -06:00
James Betker	16d1bf6dd7	Replace ConvBnRelus in SRG2 with Silus	2020-07-05 17:29:20 -06:00
James Betker	10f7e49214	Add ConvBnSilu to replace ConvBnRelu Relu produced good performance gains over LeakyRelu, but GAN performance degraded significantly. Try SiLU as an alternative to see if it's the leaky-ness we are looking for or the smooth activation curvature.	2020-07-05 13:39:08 -06:00
James Betker	9934e5d082	Move SRG1 to identical to new	2020-07-05 08:49:34 -06:00
James Betker	416538f31c	SRG1 conjoined except ConvBnRelu	2020-07-05 08:44:17 -06:00
James Betker	c58c2b09ca	Back to remove all biases (looks like a ConvBnRelu made its way in..)	2020-07-04 22:41:02 -06:00
James Betker	86cda86e94	Re-add biases, also add new init A/B testing where we lost our GAN competitiveness.	2020-07-04 22:24:42 -06:00
James Betker	b03741f30e	Remove all biases from generator Continuing to investigate loss of GAN competitiveness, this is a big difference between "old" SRG1 and "new".	2020-07-04 22:19:55 -06:00
James Betker	726e946e79	Turn BN off in SRG1 This wont work well but just testing if GAN performance comes back	2020-07-04 14:51:27 -06:00
James Betker	0ee39d419b	OrderedDict not needed	2020-07-04 14:09:27 -06:00
James Betker	9048105b72	Break out SRG1 as separate network Something strange is going on. These networks do not respond to discriminator gradients properly anymore. SRG1 did, however so reverting back to last known good state to figure out why.	2020-07-04 13:28:50 -06:00
James Betker	510b2f887d	Remove RDB from srg2 Doesnt seem to work so great.	2020-07-03 22:31:20 -06:00
James Betker	da4335c25e	Add a feature-based validation test	2020-07-03 15:18:57 -06:00
James Betker	703dec4472	Add SpineNet & integrate with SRG New version of SRG uses SpineNet for a switch backbone.	2020-07-03 12:07:31 -06:00
James Betker	3ed7a2b9ab	Move ConvBnRelu/Lelu to arch_util	2020-07-03 12:06:38 -06:00
James Betker	e9ee67ff10	Integrate RDB into SRG The last RDB for each cluster is switched.	2020-07-01 17:19:55 -06:00
James Betker	6ac6c95177	Fix scaling bug	2020-07-01 16:42:27 -06:00
James Betker	30653181ba	Experiment: get rid of post_switch_conv	2020-07-01 16:30:40 -06:00
James Betker	17191de836	Experiment: bring initialize_weights back again Something really strange going on here..	2020-07-01 15:58:13 -06:00
James Betker	d1d573de07	Experiment: new init and post-switch-conv	2020-07-01 15:25:54 -06:00
James Betker	480d1299d7	Remove RRDB with switching This idea never really panned out, removing it.	2020-07-01 12:08:32 -06:00
James Betker	e2398ac83c	Experiment: revert initialization changes	2020-07-01 12:08:09 -06:00
James Betker	78276afcaa	Experiment: Back to lelu	2020-07-01 11:43:25 -06:00
James Betker	b945021c90	SRG v2 - Move to Relu, rely on Module-based initialization	2020-07-01 11:33:32 -06:00
James Betker	604763be68	NSG r7 Converts the switching trunk to a VGG-style network to make it more comparable to SRG architectures.	2020-07-01 09:54:29 -06:00
James Betker	87f1e9c56f	Invert ResGen2 to operate in LR space	2020-06-30 20:57:40 -06:00
James Betker	e07d8abafb	NSG rev 6 - Disable style passthrough - Process multiplexers starting at base resolution	2020-06-30 20:47:26 -06:00
James Betker	3ce1a1878d	NSG improvements (r5) - Get rid of forwards(), it makes numeric_stability.py not work properly. - Do stability auditing across layers. - Upsample last instead of first, work in much higher dimensionality for transforms.	2020-06-30 16:59:57 -06:00
James Betker	75f148022d	Even more NSG improvements (r4)	2020-06-30 13:52:47 -06:00
James Betker	773753073f	More NSG improvements (v3) Move to a fully fixup residual network for the switch (no batch norms). Fix a bunch of other small bugs. Add in a temporary latent feed-forward from the bottom of the switch. Fix several initialization issues.	2020-06-29 20:26:51 -06:00
James Betker	4b82d0815d	NSG improvements - Just use resnet blocks for the multiplexer trunk of the generator - Every block initializes itself, rather than everything at the end - Cleans up some messy parts of the architecture, including unnecessary kernel sizes and places where BN is not used properly.	2020-06-29 10:09:51 -06:00
James Betker	978036e7b3	Add NestedSwitchGenerator An evolution of SwitchedResidualGenerator, this variant nests attention modules upon themselves to extend the representative capacity of the model significantly.	2020-06-28 21:22:05 -06:00
James Betker	c8a670842e	Missed networks.py in last commit	2020-06-25 18:36:06 -06:00

1 2 3 4 5 ...

262 Commits