Commit Graph

386 Commits

Author SHA1 Message Date
mrq
ebbc85fb6a finetuned => finetunes 2023-02-18 19:41:21 +00:00
mrq
8dddb560e1 Merge pull request 'Using zfill in utils.pad' (#5) from lightmare/ai-voice-cloning:zfill into master
Reviewed-on: mrq/ai-voice-cloning#5
2023-02-18 19:29:57 +00:00
lightmare
4807072894 Using zfill in utils.pad 2023-02-18 19:09:25 +00:00
mrq
1f4cdcb8a9 rude 2023-02-18 17:23:44 +00:00
mrq
13d466baf5 notebook tweaked, drive mounts and symlinks folders so I can stop having to wait a gorillion years to import voices 2023-02-18 16:30:05 +00:00
mrq
996e5217d2 apparently anything after deactivate does not get ran, as it terminates the batch script. 2023-02-18 16:20:26 +00:00
mrq
cf758f4732 oops 2023-02-18 15:50:51 +00:00
mrq
843bfbfb96 Simplified generating training YAML, cleaned it up, training output is cleaned up and will "autoscroll" (only show the last 8 lines, refer to console for a full trace if needed) 2023-02-18 14:51:00 +00:00
mrq
0dd5640a89 forgot that call only worked if shell=True 2023-02-18 14:14:42 +00:00
mrq
2615cafd75 added dropdown to select autoregressive model for TTS, fixed a bug where the settings saveer constantly fires I hate gradio so much why are dropdown.change broken to contiuously fire and send an empty array 2023-02-18 14:10:26 +00:00
mrq
a9bd17c353 fixes #2 2023-02-18 13:07:23 +00:00
mrq
c26bda4d96 finally can get training to work under the web UI 2023-02-18 03:36:08 +00:00
mrq
809012c84d debugging in colab is pure cock and ball torture because sometimes the files don't actually update when edited, and sometimes they update after I restart the runtime, notebook can't use venv because I can't source it in a subprocess shell call 2023-02-18 03:31:44 +00:00
mrq
915ab5f65d fixes 2023-02-18 03:17:46 +00:00
mrq
602d477935 crunchbangs 2023-02-18 02:46:44 +00:00
mrq
650eada8d5 fix spawning training subprocess for unixes 2023-02-18 02:40:30 +00:00
mrq
d5c1433268 a bit of UI cleanup, import multiple audio files at once, actually shows progress when importing voices, hides audio metadata / latents if no generated settings are detected, preparing datasets shows its progress, saving a training YAML shows a message when done, training now works within the web UI, training output shows to web UI, provided notebook is cleaned up and uses a venv, etc. 2023-02-18 02:07:22 +00:00
mrq
c75d0bc5da pulls DLAS for any updates since I might be actually updating it, added option to not load TTS on initialization to save VRAM when training 2023-02-17 20:43:12 +00:00
mrq
a245dc43c0 small fixes 2023-02-17 20:18:57 +00:00
mrq
67208be022 just in case 2023-02-17 20:13:00 +00:00
mrq
ad4adc960f small fixes 2023-02-17 20:10:27 +00:00
mrq
f708909687 Wiki'd 2023-02-17 19:21:31 +00:00
mrq
bcec64af0f cleanup, "injected" dvae.pth to download through tortoise's model loader, so I don't need to keep copying it 2023-02-17 19:06:05 +00:00
mrq
13c9920b7f caveats while I tighten some nuts 2023-02-17 17:44:52 +00:00
mrq
8d268bc7a3 training added, seems to work, need to test it more 2023-02-17 16:29:27 +00:00
mrq
229be0bdb8 almost 2023-02-17 15:53:50 +00:00
mrq
f87764e7d0 Slight fix, getting close to be able to train from the web UI directly 2023-02-17 13:57:03 +00:00
mrq
8482131e10 oops x2 2023-02-17 06:25:00 +00:00
mrq
a16e6b150f oops 2023-02-17 06:11:04 +00:00
mrq
59d0f08244 https://arch.b4k.co/v/search/text/%22TAKE%20YOUR%20DAMN%20CLOTHES%20OFF%22/type/op/ 2023-02-17 06:06:50 +00:00
mrq
12933cfd60 added dropdown to select which whisper model to use for transcription, added note that FFMPEG is required 2023-02-17 06:01:14 +00:00
mrq
96e9acdeec added preparation of LJSpeech-esque dataset 2023-02-17 05:42:55 +00:00
mrq
9c0e4666d2 updated notebooks to use the new "main" setup 2023-02-17 03:30:53 +00:00
mrq
f8249aa826 tab to generate the training YAML 2023-02-17 03:05:27 +00:00
mrq
3a078df95e Initial refractor 2023-02-17 00:08:27 +00:00
mrq
0456f71ec3 Initial commit 2023-02-16 19:38:15 +00:00