# AI Voice Cloning This [repo](https://git.ecker.tech/mrq/ai-voice-cloning)/[rentry](https://rentry.org/AI-Voice-Cloning/) aims to serve as both a foolproof guide for setting up AI voice cloning tools for legitimate, local use on Windows/Linux, as well as a stepping stone for anons that genuinely want to play around with [TorToiSe](https://github.com/neonbjb/tortoise-tts). Similar to my own findings for Stable Diffusion image generation, this rentry may appear a little disheveled as I note my new findings with TorToiSe. Please keep this in mind if the guide seems to shift a bit or sound confusing. >\>Ugh... why bother when I can just abuse 11.AI? You're more than welcome to, but TorToiSe is shaping up to be a very promising tool, especially with finetuning now on the horizon. This is not endorsed by [neonbjb](https://github.com/neonbjb/). I do not expect this to run into any ethical issues, as it seems (like me), this is mostly for making funny haha vidya characters say funny lines. ## Documentation Please consult [the wiki](https://git.ecker.tech/mrq/ai-voice-cloning/wiki) for the documentation, including how to install, prepare voices for, and use the software. ## Bug Reporting If you run into any problems, please refer to the [issues you may encounter](https://git.ecker.tech/mrq/ai-voice-cloning/wiki/Issues) wiki page first. Please don't hesitate to submit an issue. ## Changelogs Below will be a rather-loose changelogss, as I don't think I have a way to chronicle them outside of commit messages: ### `2023.02.22` * greatly reduced VRAM consumption through the use of [TimDettmers/bitsandbytes](https://github.com/TimDettmers/bitsandbytes) * cleaned up section of code that handled parsing output from training script * added button to reconnect to the training script's output (sometimes skips a line to update, but it's better than nothing) * actually update submodules from the update script (somehow forgot to pass `--remote`) ### `Before 2023.02.22` Refer to commit logs.