DL-Art-School

Author	SHA1	Message	Date
James Betker	c68669e1e1	uv2 add alignment head	2022-06-14 15:18:58 -06:00
James Betker	b42b4e18de	clean up unified voice - remove unused code - fix inference model to use the terms "prior" and "posterior" to properly define the modeling order (they were inverted before) - default some settings I never intend to change in the future	2022-05-09 14:45:49 -06:00
James Betker	c42c53e75a	Add a trainable network for converting a normal distribution into a latent space	2022-05-02 09:47:30 -06:00
James Betker	b712d3b72b	break out get_conditioning_latent from unified_voice	2022-05-01 23:04:44 -06:00
James Betker	b4549eed9f	uv2 fix	2022-04-20 00:27:38 -06:00
James Betker	8fe0dff33c	support tts typing	2022-04-16 23:36:57 -06:00
James Betker	8ea5c307fb	Fixes for training the diffusion model on autoregressive inputs	2022-04-11 11:02:44 -06:00
James Betker	032983e2ed	fix bug and allow position encodings to be trained separately from the rest of the model	2022-04-08 16:26:01 -06:00
James Betker	09ab1aa9bc	revert rotary embeddings work I'm not really sure that this is going to work. I'd rather explore re-using what I've already trained	2022-04-08 16:18:35 -06:00
James Betker	3f8d7955ef	unified_voice with rotary embeddings	2022-04-07 20:11:14 -06:00
James Betker	536511fc4b	unified_voice: relative position encodings	2022-03-22 11:41:13 -06:00
James Betker	e735d8e1fa	unified_voice fixes	2022-03-21 14:44:00 -06:00
James Betker	7929fd89de	Refactor audio-style models into the audio folder	2022-03-15 11:06:25 -06:00