(QoL improvements for) a multi-voice TTS system trained with an emphasis on quality
 
 
 
 
Go to file
mrq ac0a572679 arg to skip voice latents for grabbing voice lists (for preparing datasets) 2023-02-17 04:50:02 +07:00
config Added settings page, added checking for updates (disabled by default), some other things that I don't remember 2023-02-06 21:43:01 +07:00
convert modified conversion scripts to not give a shit about bitrate and formats since torchaudio.load handles all of that anyways, and it all gets resampled anyways 2023-02-15 04:44:14 +07:00
scripts Fix typos 2023-01-06 11:04:36 +07:00
tortoise arg to skip voice latents for grabbing voice lists (for preparing datasets) 2023-02-17 04:50:02 +07:00
voices Moved voices out of the tortoise folder because it kept being processed for setup.py 2023-02-10 20:11:56 +07:00
.gitignore added shell scripts for linux, wrapped sorted() for voice list, I guess 2023-02-06 21:54:31 +07:00
CITATION.cff update citation file 2022-04-28 23:11:09 +07:00
LICENSE Initial commit 2022-01-27 21:33:15 +07:00
MANIFEST.in Also include voices in the manifest 2022-05-31 10:31:50 +07:00
README.md fixed colab to use the new repo, reorder loading tortoise before the web UI for people who don't wait 2023-02-16 22:12:13 +07:00
README_OLD.md QoL fixes 2023-02-02 21:13:28 +07:00
list_devices.py added setting "device-override", less naively decide the number to use for results, some other thing 2023-02-15 21:51:22 +07:00
main.py oops 2023-02-17 01:46:38 +07:00
requirements.txt pip-ifying things 2023-02-16 19:48:06 +07:00
requirements_legacy.txt pip-ifying things 2023-02-16 19:48:06 +07:00
setup-cuda.bat pip-ifying things 2023-02-16 19:48:06 +07:00
setup-cuda.sh pip-ifying things 2023-02-16 19:48:06 +07:00
setup-directml.bat pip-ifying things 2023-02-16 19:48:06 +07:00
setup-rocm.sh pip-ifying things 2023-02-16 19:48:06 +07:00
setup.py pip-ifying things 2023-02-16 19:48:06 +07:00
start.bat Colab notebook (part 1) 2023-02-10 15:58:56 +07:00
start.sh Colab notebook (part 1) 2023-02-10 15:58:56 +07:00
tortoise_tts.ipynb updated notebooks to use the new "main" setup 2023-02-17 03:31:19 +07:00
update-force.bat Added option: listen path 2023-02-09 20:42:38 +07:00
update-force.sh Added option: listen path 2023-02-09 20:42:38 +07:00
update.bat modified how conditional latents are computed (before, it just happened to only bother reading the first 102400/24000=4.26 seconds per audio input, now it will chunk it all to compute latents) 2023-02-05 23:25:41 +07:00
update.sh Added option: listen path 2023-02-09 20:42:38 +07:00
webui.py actually for real fixed incrementing filenames because i had a regex that actually only worked if candidates or lines>1, cuda now takes priority over dml if you're a nut with both of them installed because you can just specify an override anyways 2023-02-16 01:06:32 +07:00

README.md

(QoL improvements for) TorToiSe

This repo is for my modifications to neonbjb/tortoise-tts.

For the original repo, please go to mrq/ai-voice-cloning.