ai-voice-cloning/README.md

# AI Voice Cloning

This [repo](https://git.ecker.tech/mrq/ai-voice-cloning)/[rentry](https://rentry.org/AI-Voice-Cloning/) aims to serve as both a foolproof guide for setting up AI voice cloning tools for legitimate, local use on Windows/Linux, as well as a stepping stone for anons that genuinely want to play around with [TorToiSe](https://github.com/neonbjb/tortoise-tts).

Similar to my own findings for Stable Diffusion image generation, this rentry may appear a little disheveled as I note my new findings with TorToiSe. Please keep this in mind if the guide seems to shift a bit or sound confusing.

>\>Ugh... why bother when I can just abuse 11.AI?

You're more than welcome to, but TorToiSe is shaping up to be a very promising tool, especially with finetuning now on the horizon.

This is not endorsed by [neonbjb](https://github.com/neonbjb/). I do not expect this to run into any ethical issues, as it seems (like me), this is mostly for making funny haha vidya characters say funny lines.

## Documentation

Please consult [the wiki](https://git.ecker.tech/mrq/ai-voice-cloning/wiki) for the documentation, including how to install, prepare voices for, and use the software.

## Bug Reporting

If you run into any problems, please refer to the [issues you may encounter](https://git.ecker.tech/mrq/ai-voice-cloning/wiki/Issues) wiki page first. Please don't hesitate to submit an issue.

## Changelogs

Below will be a rather-loose changelogss, as I don't think I have a way to chronicle them outside of commit messages:

### `2023.02.22`

* greatly reduced VRAM consumption through the use of [TimDettmers/bitsandbytes](https://github.com/TimDettmers/bitsandbytes)
* cleaned up section of code that handled parsing output from training script
* added button to reconnect to the training script's output (sometimes skips a line to update, but it's better than nothing)
* actually update submodules from the update script (somehow forgot to pass `--remote`)

### `Before 2023.02.22`

Refer to commit logs.
Initial refractor 2023-02-17 00:08:27 +00:00			`# AI Voice Cloning`
Initial commit 2023-02-16 19:38:15 +00:00
added preparation of LJSpeech-esque dataset 2023-02-17 05:42:55 +00:00			`This [repo](https://git.ecker.tech/mrq/ai-voice-cloning)/[rentry](https://rentry.org/AI-Voice-Cloning/) aims to serve as both a foolproof guide for setting up AI voice cloning tools for legitimate, local use on Windows/Linux, as well as a stepping stone for anons that genuinely want to play around with [TorToiSe](https://github.com/neonbjb/tortoise-tts).`
Initial refractor 2023-02-17 00:08:27 +00:00
			`Similar to my own findings for Stable Diffusion image generation, this rentry may appear a little disheveled as I note my new findings with TorToiSe. Please keep this in mind if the guide seems to shift a bit or sound confusing.`

			`>\>Ugh... why bother when I can just abuse 11.AI?`

added preparation of LJSpeech-esque dataset 2023-02-17 05:42:55 +00:00			`You're more than welcome to, but TorToiSe is shaping up to be a very promising tool, especially with finetuning now on the horizon.`
Initial refractor 2023-02-17 00:08:27 +00:00
			`This is not endorsed by [neonbjb](https://github.com/neonbjb/). I do not expect this to run into any ethical issues, as it seems (like me), this is mostly for making funny haha vidya characters say funny lines.`

Wiki'd 2023-02-17 19:21:31 +00:00			`## Documentation`
Initial refractor 2023-02-17 00:08:27 +00:00
finally can get training to work under the web UI 2023-02-18 03:36:08 +00:00			`Please consult [the wiki](https://git.ecker.tech/mrq/ai-voice-cloning/wiki) for the documentation, including how to install, prepare voices for, and use the software.`

			`## Bug Reporting`

huge success 2023-02-23 06:24:54 +00:00			`If you run into any problems, please refer to the [issues you may encounter](https://git.ecker.tech/mrq/ai-voice-cloning/wiki/Issues) wiki page first. Please don't hesitate to submit an issue.`

			`## Changelogs`

			`Below will be a rather-loose changelogss, as I don't think I have a way to chronicle them outside of commit messages:`

			### `2023.02.22`

			`* greatly reduced VRAM consumption through the use of [TimDettmers/bitsandbytes](https://github.com/TimDettmers/bitsandbytes)`
			`* cleaned up section of code that handled parsing output from training script`
			`* added button to reconnect to the training script's output (sometimes skips a line to update, but it's better than nothing)`
			* actually update submodules from the update script (somehow forgot to pass `--remote`)

			### `Before 2023.02.22`

			`Refer to commit logs.`