ai-voice-cloning

Author	SHA1	Message	Date
mrq	d5c1433268	a bit of UI cleanup, import multiple audio files at once, actually shows progress when importing voices, hides audio metadata / latents if no generated settings are detected, preparing datasets shows its progress, saving a training YAML shows a message when done, training now works within the web UI, training output shows to web UI, provided notebook is cleaned up and uses a venv, etc.	2023-02-18 02:07:22 +00:00
mrq	c75d0bc5da	pulls DLAS for any updates since I might be actually updating it, added option to not load TTS on initialization to save VRAM when training	2023-02-17 20:43:12 +00:00
mrq	ad4adc960f	small fixes	2023-02-17 20:10:27 +00:00
mrq	bcec64af0f	cleanup, "injected" dvae.pth to download through tortoise's model loader, so I don't need to keep copying it	2023-02-17 19:06:05 +00:00
mrq	13c9920b7f	caveats while I tighten some nuts	2023-02-17 17:44:52 +00:00
mrq	8d268bc7a3	training added, seems to work, need to test it more	2023-02-17 16:29:27 +00:00
mrq	f87764e7d0	Slight fix, getting close to be able to train from the web UI directly	2023-02-17 13:57:03 +00:00
mrq	8482131e10	oops x2	2023-02-17 06:25:00 +00:00
mrq	a16e6b150f	oops	2023-02-17 06:11:04 +00:00
mrq	59d0f08244	https://arch.b4k.co/v/search/text/%22TAKE%20YOUR%20DAMN%20CLOTHES%20OFF%22/type/op/	2023-02-17 06:06:50 +00:00
mrq	12933cfd60	added dropdown to select which whisper model to use for transcription, added note that FFMPEG is required	2023-02-17 06:01:14 +00:00
mrq	96e9acdeec	added preparation of LJSpeech-esque dataset	2023-02-17 05:42:55 +00:00
mrq	9c0e4666d2	updated notebooks to use the new "main" setup	2023-02-17 03:30:53 +00:00
mrq	f8249aa826	tab to generate the training YAML	2023-02-17 03:05:27 +00:00
mrq	3a078df95e	Initial refractor	2023-02-17 00:08:27 +00:00

15 Commits