lightmare
|
4807072894
|
Using zfill in utils.pad
|
2023-02-18 19:09:25 +00:00 |
|
|
1f4cdcb8a9
|
rude
|
2023-02-18 17:23:44 +00:00 |
|
|
cf758f4732
|
oops
|
2023-02-18 15:50:51 +00:00 |
|
|
843bfbfb96
|
Simplified generating training YAML, cleaned it up, training output is cleaned up and will "autoscroll" (only show the last 8 lines, refer to console for a full trace if needed)
|
2023-02-18 14:51:00 +00:00 |
|
|
0dd5640a89
|
forgot that call only worked if shell=True
|
2023-02-18 14:14:42 +00:00 |
|
|
2615cafd75
|
added dropdown to select autoregressive model for TTS, fixed a bug where the settings saveer constantly fires I hate gradio so much why are dropdown.change broken to contiuously fire and send an empty array
|
2023-02-18 14:10:26 +00:00 |
|
|
a9bd17c353
|
fixes #2
|
2023-02-18 13:07:23 +00:00 |
|
|
809012c84d
|
debugging in colab is pure cock and ball torture because sometimes the files don't actually update when edited, and sometimes they update after I restart the runtime, notebook can't use venv because I can't source it in a subprocess shell call
|
2023-02-18 03:31:44 +00:00 |
|
|
915ab5f65d
|
fixes
|
2023-02-18 03:17:46 +00:00 |
|
|
650eada8d5
|
fix spawning training subprocess for unixes
|
2023-02-18 02:40:30 +00:00 |
|
|
d5c1433268
|
a bit of UI cleanup, import multiple audio files at once, actually shows progress when importing voices, hides audio metadata / latents if no generated settings are detected, preparing datasets shows its progress, saving a training YAML shows a message when done, training now works within the web UI, training output shows to web UI, provided notebook is cleaned up and uses a venv, etc.
|
2023-02-18 02:07:22 +00:00 |
|
|
c75d0bc5da
|
pulls DLAS for any updates since I might be actually updating it, added option to not load TTS on initialization to save VRAM when training
|
2023-02-17 20:43:12 +00:00 |
|
|
ad4adc960f
|
small fixes
|
2023-02-17 20:10:27 +00:00 |
|
|
bcec64af0f
|
cleanup, "injected" dvae.pth to download through tortoise's model loader, so I don't need to keep copying it
|
2023-02-17 19:06:05 +00:00 |
|
|
13c9920b7f
|
caveats while I tighten some nuts
|
2023-02-17 17:44:52 +00:00 |
|
|
f87764e7d0
|
Slight fix, getting close to be able to train from the web UI directly
|
2023-02-17 13:57:03 +00:00 |
|
|
8482131e10
|
oops x2
|
2023-02-17 06:25:00 +00:00 |
|
|
a16e6b150f
|
oops
|
2023-02-17 06:11:04 +00:00 |
|
|
59d0f08244
|
https://arch.b4k.co/v/search/text/%22TAKE%20YOUR%20DAMN%20CLOTHES%20OFF%22/type/op/
|
2023-02-17 06:06:50 +00:00 |
|
|
12933cfd60
|
added dropdown to select which whisper model to use for transcription, added note that FFMPEG is required
|
2023-02-17 06:01:14 +00:00 |
|
|
96e9acdeec
|
added preparation of LJSpeech-esque dataset
|
2023-02-17 05:42:55 +00:00 |
|
|
9c0e4666d2
|
updated notebooks to use the new "main" setup
|
2023-02-17 03:30:53 +00:00 |
|
|
f8249aa826
|
tab to generate the training YAML
|
2023-02-17 03:05:27 +00:00 |
|
|
3a078df95e
|
Initial refractor
|
2023-02-17 00:08:27 +00:00 |
|