Commit Graph

347 Commits

Author SHA1 Message Date
mrq
730a04708d added flag to disable preprocessing (because some IPAs will turn into ASCII, implicitly enable for using the specific ipa.json tokenizer vocab) 2023-03-16 04:24:32 +00:00
mrq
71cc43e65c added a flag (thanks gannybal) 2023-02-26 14:56:26 +00:00
James Betker
a1bbde8a43 few things 2022-07-26 11:52:03 -06:00
James Betker
4c3413d008 Support aac datatypes 2022-07-01 00:44:20 -06:00
James Betker
7a9c4310e8 support reading cheaters directly 2022-06-23 11:39:10 -06:00
James Betker
f0117150d0 produce correct clip_lengths.. 2022-06-21 20:21:12 -06:00
James Betker
0e5a3f4712 We don't need that encoder either.. 2022-06-19 23:24:42 -06:00
James Betker
a659cd865c All the stuff needed for cheater latent generation 2022-06-19 23:12:52 -06:00
James Betker
e67e82be2d misc 2022-06-09 21:14:48 -06:00
James Betker
d3a60633a3 codes generation script 2022-06-03 11:02:28 -06:00
James Betker
c0db85bf4f music quantizer 2022-05-31 21:06:54 -06:00
James Betker
5efeee6b97 fix type bug 2022-05-27 11:19:30 -06:00
James Betker
00e133afa9 support not_ew too 2022-05-25 08:58:23 -06:00
James Betker
5188866bd5 facepalm 2022-05-24 14:55:51 -06:00
James Betker
1d758c3bc8 try and make repeated failures recover better 2022-05-23 08:50:27 -06:00
James Betker
9f16b25ce5 introduce prepadlength 2022-05-23 08:21:27 -06:00
James Betker
37640e9759 Squelch annoying warning 2022-05-22 12:12:40 -06:00
James Betker
6a2c29f596 Fix inverted logic 2022-05-17 15:39:07 -06:00
James Betker
519151d83f m2v 2022-05-17 15:37:59 -06:00
James Betker
8202b9f39c some stuff 2022-05-15 21:50:54 -06:00
James Betker
eb64d18075 Fix phoneme tokenizer 2022-05-13 17:56:26 -06:00
James Betker
51f8c1bced phonetic dataset 2022-05-12 11:57:28 -06:00
James Betker
c85ab738c5 paired fix 2022-04-16 23:41:57 -06:00
James Betker
8fe0dff33c support tts typing 2022-04-16 23:36:57 -06:00
James Betker
48cb6a5abd misc 2022-04-16 20:28:04 -06:00
James Betker
628569af7b Another fix 2022-04-08 09:41:18 -06:00
James Betker
45804177b8 more stuff 2022-03-25 00:03:18 -06:00
James Betker
95ea0a592f More cleaning 2022-03-16 12:05:56 -06:00
James Betker
d186414566 More spring cleaning 2022-03-16 12:04:00 -06:00
James Betker
f563a8dd41 fixes 2022-03-15 21:43:00 -06:00
James Betker
7929fd89de Refactor audio-style models into the audio folder 2022-03-15 11:06:25 -06:00
James Betker
9bbbe26012 update audio_with_noise 2022-03-12 20:41:47 -07:00
James Betker
896accb71f data and prep improvements 2022-03-12 15:10:11 -07:00
James Betker
726e30c4f7 Update noise augmentation dataset to include voices that are appended at the end of another clip. 2022-03-09 09:43:10 -07:00
James Betker
38fd9fc985 Improve efficiency of audio_with_noise_dataset 2022-03-08 15:50:13 -07:00
James Betker
30ddac69aa lots of bad entries 2022-03-05 23:15:59 -07:00
James Betker
dcf98df0c2 ++ 2022-03-05 23:12:34 -07:00
James Betker
64d764ccd7 fml 2022-03-05 23:11:10 -07:00
James Betker
ef63ff84e2 pvd2 2022-03-05 23:08:39 -07:00
James Betker
1a05712764 pvd 2022-03-05 23:05:29 -07:00
James Betker
db0c3340ac Implement guidance-free diffusion in eval
And a few other fixes
2022-03-01 11:49:36 -07:00
James Betker
03752c1cd6 Report NaN 2022-02-22 23:09:37 -07:00
James Betker
af50afe222 pairedvoice: error out if clip is too short 2022-02-21 19:11:10 -07:00
James Betker
79e8f36d30 Convert CLIP models into new folder 2022-02-15 20:53:07 -07:00
James Betker
c24682c668 Record load times in fast_paired_dataset 2022-02-07 15:45:38 -07:00
James Betker
5ae816bead ctc gen checkin 2022-02-05 15:59:53 -07:00
James Betker
8fb147e8ab add an autoregressive ctc code generator 2022-02-04 11:00:15 -07:00
James Betker
4249681c4b Mods to support a autoregressive CTC code generator 2022-02-03 19:58:54 -07:00
James Betker
8c255811ad more fixes 2022-01-25 17:57:16 -07:00
James Betker
91b4b240ac dont pickle unique files 2022-01-21 00:02:06 -07:00