vall-e

mrq/vall-e

Author	SHA1	Message	Date
mrq	9384900ce6	revert the frankensteined "train one model but hotload the other" since it kept loading the last exported weights and I'm not supporting this usecase anymore anyways	2023-09-22 13:04:17 -05:00
mrq	22ffaf3a33	have loss for the NAR not-ignore the text prompt, I imagine this should help the NAR and explain why it's always had a bit of an issue with training	2023-09-15 19:08:44 -05:00
mrq	4aef798135	added picking final candidate based on sum of score instead of first candidate (this changes nothing).	2023-09-13 13:19:11 -05:00
mrq	23a5fdd645	implemented a naive beam search (I really should be taking a break)	2023-09-12 21:28:07 -05:00
mrq	b2907ae7e0	seems that my PromEmbedding/RespEmbedding doesn't actually work all that well, naively using dedicated MultiEmbeddings for AR/NAR in the monolithic model is the best way to go	2023-09-08 01:03:24 -05:00
mrq	57db3ccfa8	shuffled VALL-E continuous as a task tts-c instead, logic fixes for it	2023-09-02 12:23:40 -05:00
mrq	e40c0d34a0	somewhat got recurrent forward working (it's as accurate as chunkwise forward: it's not accurate at all), added option to use AMP instead of blanket setting the weight's dtype	2023-09-01 20:58:29 -05:00
mrq	7f4388e591	added total samples processed and tokens processed (len of text tokens + len of target response tokens)	2023-08-28 11:02:45 -05:00
mrq	87c4bfedba	added ability to mark models as disabled for training, and hotloading them for eval/validation (useful if training only one model, or training a model per GPU)	2023-08-27 12:26:12 -05:00
mrq	2d1a9f10c0	nightmare of spaghetti that might break compat; mechanism to increase RVQ bins of an existing model without retraining, keeps sampled proms/resps at max RVQ level and trim off excess levels according to what model receives them, some other things I already forgot (I really hope no one else has weights being baked right now)	2023-08-19 15:06:33 -05:00
mrq	6ca347e1e1	literally had a urethra moment before going to bed with a way to implement cse/nse tasks	2023-08-19 01:16:46 -05:00
mrq	508677fcd5	repaired auraloss loss calc during eval/val	2023-08-18 21:19:47 -05:00
mrq	fb4e816823	oops	2023-08-18 21:11:19 -05:00
mrq	2a71486cb6	preparing for SpeechX extensions	2023-08-18 20:58:07 -05:00
mrq	8e7f900210	forgot the =	2023-08-17 19:07:59 -05:00
mrq	3ff7cf8341	maybe fix evaluation dataset not being capped to cfg.evaluation.size	2023-08-17 18:56:37 -05:00
mrq	d7deaf6def	distributed training works now (hopefully)	2023-08-13 22:07:45 -05:00
mrq	2af09d0bef	fixed that mysterious discepancy between the reported losses (I am so freaking mad, my piss is boiling, I had to interrupt halfway through an epoch)	2023-08-05 15:25:41 -05:00
mrq	d1b9770d41	set model to eval when inferencing (very important)	2023-08-05 04:29:05 +00:00
mrq	d89568a96e	some fixes for the local framework	2023-08-05 03:22:15 +00:00
mrq	012f54b7f1	another classic commit so i can copy it to another machine to gut out things and use the trainer bits for a side project that I should really get around to working on sooner than later	2023-08-04 14:21:30 -05:00
mrq	c85101403f	big cleanup	2023-08-03 20:26:36 -05:00
mrq	2e03e5ac93	Fixed an issue with having fairseq installed at all will brick logging	2023-08-02 22:57:10 -05:00
mrq	bf8cedc9dd	Rewrite init	2023-08-02 21:53:35 +00:00

24 Commits