psammites
  • Joined on 2023-03-11
psammites commented on issue mrq/ai-voice-cloning#322 2023-08-04 19:23:19 +00:00
Non-English Tokenizer

It depends on if you intend to process input in bopomofo, pinyin, or 中文。

psammites commented on issue mrq/ai-voice-cloning#319 2023-07-30 16:53:35 +00:00
Transcription and diarization (speaker identification) - Easy dataset building?

It's supported by whisperx, see notes on Speaker Diarization in the README.

psammites commented on issue mrq/ai-voice-cloning#303 2023-07-11 12:55:48 +00:00
No module named 'rpds.rpds' I am not fully sure why

Activate the virtual environment with venv\Scripts\activate then pip install -e .\requirements.txt

psammites commented on issue mrq/ai-voice-cloning#306 2023-07-11 12:52:40 +00:00
"System error" when using Bark as TTS

raise LibsndfileError(err, prefix="Error opening {0!r}: ".format(self.name)) soundfile.LibsndfileError: Error opening './training/tboiNarrator/audio/see 4ever 1_00000.wav': System error.`

Mea…

psammites commented on issue mrq/ai-voice-cloning#302 2023-07-10 22:05:46 +00:00
Error with training can't find dlas module, tried manual install

Might be better to re-clone the whole repo and run setup again.

psammites commented on issue mrq/ai-voice-cloning#302 2023-07-09 17:30:24 +00:00
Error with training can't find dlas module, tried manual install

git submodule update --remote

psammites commented on issue mrq/ai-voice-cloning#295 2023-07-06 16:08:11 +00:00
training

That should be more than enough but wouldn't hurt to try 96 anyway.

psammites commented on issue mrq/ai-voice-cloning#295 2023-07-06 03:46:31 +00:00
training

How much VRAM do you have? If it's 8GB or less then knock the # of training elements down to 96 and try with a batch size of 32.

psammites commented on issue mrq/ai-voice-cloning#292 2023-07-05 19:13:41 +00:00
UnicodeDecodeError: 'utf-8' codec can't decode byte 0x84 in position 0: invalid start byte

"Error no kernel image is available for execution on the device" likely indicates that you don't have CUDA installed correctly.

psammites commented on issue mrq/ai-voice-cloning#294 2023-07-05 18:55:51 +00:00
no module named "tortoise"

You can ignore the stuff about valle and bark, they're optional.

psammites commented on issue mrq/ai-voice-cloning#294 2023-07-05 18:08:24 +00:00
no module named "tortoise"

Did you run the appropriate setup script for your platform?

psammites commented on issue mrq/ai-voice-cloning#293 2023-07-05 16:18:46 +00:00
Suggestion on how to package as Docker container

There's one for vanilla Tortoise-TTS which would probably make a good starting point.

psammites commented on issue mrq/ai-voice-cloning#289 2023-07-04 17:14:33 +00:00
conditioning_length: 44000 is different to sample rate?

Changing the sample rate may not have any noticeable effect other than increasing training times, see notes regarding RVQ bins in #152.

psammites commented on issue mrq/ai-voice-cloning#290 2023-07-04 17:12:18 +00:00
No module named 'tortoise.api'

Did you run the appropriate setup script for your platform?

psammites commented on issue mrq/ai-voice-cloning#286 2023-06-29 14:07:54 +00:00
Deep voices

Both low and high-pitched voices come out closer to the median. Might improve with more training cycles but I usually just pitch-shift it with ffmpeg.

psammites commented on issue mrq/ai-voice-cloning#287 2023-06-29 14:00:57 +00:00
Voice Chunk Size

No, the opposite.