Compare commits
5 Commits
Author | SHA1 | Date | |
---|---|---|---|
0e788f50e7 | |||
a5bf81e37b | |||
c7630f4e40 | |||
300c507347 | |||
6e7dd3f534 |
|
@ -1,6 +1,6 @@
|
||||||
# AI Voice Cloning
|
# AI Voice Cloning
|
||||||
|
|
||||||
> **Note** This project is effectively abandonware due to requiring a rewrite. Please use [JarodMica/ai-voice-cloning](https://github.com/JarodMica/ai-voice-cloning).
|
> **Note** This project has been in dire need of being rewritten from the ground up for some time. Apologies for any crust from my rather spaghetti code.
|
||||||
|
|
||||||
This [repo](https://git.ecker.tech/mrq/ai-voice-cloning)/[rentry](https://rentry.org/AI-Voice-Cloning/) aims to serve as both a foolproof guide for setting up AI voice cloning tools for legitimate, local use on Windows/Linux, as well as a stepping stone for anons that genuinely want to play around with [TorToiSe](https://github.com/neonbjb/tortoise-tts).
|
This [repo](https://git.ecker.tech/mrq/ai-voice-cloning)/[rentry](https://rentry.org/AI-Voice-Cloning/) aims to serve as both a foolproof guide for setting up AI voice cloning tools for legitimate, local use on Windows/Linux, as well as a stepping stone for anons that genuinely want to play around with [TorToiSe](https://github.com/neonbjb/tortoise-tts).
|
||||||
|
|
||||||
|
@ -16,4 +16,4 @@ Please consult [the wiki](https://git.ecker.tech/mrq/ai-voice-cloning/wiki) for
|
||||||
|
|
||||||
## Bug Reporting
|
## Bug Reporting
|
||||||
|
|
||||||
If you run into any problems, please refer to the [issues you may encounter](https://git.ecker.tech/mrq/ai-voice-cloning/wiki/Issues) wiki page first.
|
If you run into any problems, please refer to the [issues you may encounter](https://git.ecker.tech/mrq/ai-voice-cloning/wiki/Issues) wiki page first.
|
|
@ -38,24 +38,10 @@
|
||||||
|
|
||||||
],
|
],
|
||||||
"source":[
|
"source":[
|
||||||
"!apt install python3.10-venv\n",
|
"!apt install python3.8-venv\n",
|
||||||
"!git clone https://git.ecker.tech/mrq/ai-voice-cloning/\n",
|
"!git clone https://git.ecker.tech/mrq/ai-voice-cloning/\n",
|
||||||
"%cd /content/ai-voice-cloning\n",
|
"%cd /content/ai-voice-cloning\n",
|
||||||
"# get local dependencies\n",
|
"!./setup-cuda.sh"
|
||||||
"!git submodule init\n",
|
|
||||||
"!git submodule update --remote\n",
|
|
||||||
"# setup venv\n",
|
|
||||||
"!python3 -m venv venv\n",
|
|
||||||
"!source ./venv/bin/activate\n",
|
|
||||||
"!python3 -m pip install --upgrade pip # just to be safe\n",
|
|
||||||
"# CUDA\n",
|
|
||||||
"!pip3 install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu118\n",
|
|
||||||
"# install requirements\n",
|
|
||||||
"!python3 -m pip install -r ./modules/tortoise-tts/requirements.txt # install TorToiSe requirements\n",
|
|
||||||
"!python3 -m pip install -e ./modules/tortoise-tts/ # install TorToiSe\n",
|
|
||||||
"!python3 -m pip install -r ./modules/dlas/requirements.txt # instal DLAS requirements, last, because whisperx will break a dependency here\n",
|
|
||||||
"!python3 -m pip install -e ./modules/dlas/ # install DLAS\n",
|
|
||||||
"!python3 -m pip install -r ./requirements.txt # install local requirements"
|
|
||||||
]
|
]
|
||||||
},
|
},
|
||||||
{
|
{
|
||||||
|
@ -129,8 +115,7 @@
|
||||||
"cell_type":"code",
|
"cell_type":"code",
|
||||||
"source":[
|
"source":[
|
||||||
"%cd /content/ai-voice-cloning/\n",
|
"%cd /content/ai-voice-cloning/\n",
|
||||||
"!source ./venv/bin/activate\n",
|
"!./start.sh --share"
|
||||||
"!python3 ./src/main.py --share"
|
|
||||||
],
|
],
|
||||||
"metadata":{
|
"metadata":{
|
||||||
"id":"QRA8jF3cF-YJ"
|
"id":"QRA8jF3cF-YJ"
|
||||||
|
|
|
@ -1,9 +1,5 @@
|
||||||
--extra-index-url https://download.pytorch.org/whl/cu118
|
git+https://github.com/openai/whisper.git
|
||||||
torch>=2.1.0
|
|
||||||
torchvision
|
|
||||||
torchaudio
|
|
||||||
|
|
||||||
openai-whisper
|
|
||||||
more-itertools
|
more-itertools
|
||||||
ffmpeg-python
|
ffmpeg-python
|
||||||
gradio<=3.23.0
|
gradio<=3.23.0
|
||||||
|
|
BIN
voices/1.wav
Normal file
BIN
voices/1.wav
Normal file
Binary file not shown.
BIN
voices/2.wav
Normal file
BIN
voices/2.wav
Normal file
Binary file not shown.
BIN
voices/3.wav
Normal file
BIN
voices/3.wav
Normal file
Binary file not shown.
Loading…
Reference in New Issue
Block a user