Blank error when attempting to prepare dataset for training. #171

Closed
opened 2023-03-24 20:23:57 +00:00 by The13thDrifter · 10 comments

Just like it says in the title, whenever I attempt to prepare a dataset for voice training, I get given a completely blank error code- so I have no idea where to begin on fixing it. Here is the Command Prompt log.

Traceback (most recent call last):
File "C:\Users\jenik\Documents\FourChanFork\ai-voice-cloning\venv\lib\site-packages\gradio\routes.py", line 394, in run_predict
output = await app.get_blocks().process_api(
File "C:\Users\jenik\Documents\FourChanFork\ai-voice-cloning\venv\lib\site-packages\gradio\blocks.py", line 1075, in process_api
result = await self.call_function(
File "C:\Users\jenik\Documents\FourChanFork\ai-voice-cloning\venv\lib\site-packages\gradio\blocks.py", line 884, in call_function
prediction = await anyio.to_thread.run_sync(
File "C:\Users\jenik\Documents\FourChanFork\ai-voice-cloning\venv\lib\site-packages\anyio\to_thread.py", line 31, in run_sync
return await get_asynclib().run_sync_in_worker_thread(
File "C:\Users\jenik\Documents\FourChanFork\ai-voice-cloning\venv\lib\site-packages\anyio_backends_asyncio.py", line 937, in run_sync_in_worker_thread
return await future
File "C:\Users\jenik\Documents\FourChanFork\ai-voice-cloning\venv\lib\site-packages\anyio_backends_asyncio.py", line 867, in run
result = context.run(func, *args)
File "C:\Users\jenik\Documents\FourChanFork\ai-voice-cloning\src\webui.py", line 201, in prepare_dataset_proxy
message = transcribe_dataset( voice=voice, language=language, skip_existings=skip_existings, progress=progress )
File "C:\Users\jenik\Documents\FourChanFork\ai-voice-cloning\src\utils.py", line 1270, in transcribe_dataset
files = sorted( get_voices(load_latents=False)[voice] )
KeyError: ''

Just like it says in the title, whenever I attempt to prepare a dataset for voice training, I get given a completely blank error code- so I have no idea where to begin on fixing it. Here is the Command Prompt log. > Traceback (most recent call last): File "C:\Users\jenik\Documents\FourChanFork\ai-voice-cloning\venv\lib\site-packages\gradio\routes.py", line 394, in run_predict output = await app.get_blocks().process_api( File "C:\Users\jenik\Documents\FourChanFork\ai-voice-cloning\venv\lib\site-packages\gradio\blocks.py", line 1075, in process_api result = await self.call_function( File "C:\Users\jenik\Documents\FourChanFork\ai-voice-cloning\venv\lib\site-packages\gradio\blocks.py", line 884, in call_function prediction = await anyio.to_thread.run_sync( File "C:\Users\jenik\Documents\FourChanFork\ai-voice-cloning\venv\lib\site-packages\anyio\to_thread.py", line 31, in run_sync return await get_asynclib().run_sync_in_worker_thread( File "C:\Users\jenik\Documents\FourChanFork\ai-voice-cloning\venv\lib\site-packages\anyio\_backends\_asyncio.py", line 937, in run_sync_in_worker_thread return await future File "C:\Users\jenik\Documents\FourChanFork\ai-voice-cloning\venv\lib\site-packages\anyio\_backends\_asyncio.py", line 867, in run result = context.run(func, *args) File "C:\Users\jenik\Documents\FourChanFork\ai-voice-cloning\src\webui.py", line 201, in prepare_dataset_proxy message = transcribe_dataset( voice=voice, language=language, skip_existings=skip_existings, progress=progress ) File "C:\Users\jenik\Documents\FourChanFork\ai-voice-cloning\src\utils.py", line 1270, in transcribe_dataset files = sorted( get_voices(load_latents=False)[voice] ) KeyError: ''

What do you have set as your Whisper Backend?

Edit: Did you run the appropriate setup script for your hardware?

What do you have set as your Whisper Backend? Edit: Did you run the appropriate setup script for your hardware?
Author

Yes, I ran the appropriate setup script; windows CUDA- as for my Whisper Backend- it's set to openAI

Yes, I ran the appropriate setup script; windows CUDA- as for my Whisper Backend- it's set to openAI

Hmm. What's in the directory for the voice you're attempting to prepare the dataset from? Are the files valid .wav's?

Hmm. What's in the directory for the voice you're attempting to prepare the dataset from? Are the files valid .wav's?
Author

The directory leads directly to where the .wav files are stored. I'm attempting to train a Cortana voice model from the Halo 1 voice files. The Wavs have all been converted to the correct format.

D:\SteamLibrary\steamapps\common\xVATrainer\resources\app\datasets\halo_cortana\wavs

Note that the 'wavs' at the end is referring to a folder named 'wavs', that has the .wav files inside of it. If it helps, the blank error came after a couple of seconds where it looked to be processing originally. Now, the error is instant.

The directory leads directly to where the .wav files are stored. I'm attempting to train a Cortana voice model from the Halo 1 voice files. The Wavs have all been converted to the correct format. > D:\SteamLibrary\steamapps\common\xVATrainer\resources\app\datasets\halo_cortana\wavs Note that the 'wavs' at the end is referring to a folder named 'wavs', that has the .wav files inside of it. If it helps, the blank error came after a couple of seconds where it looked to be processing originally. Now, the error is instant.
Owner

ai-voice-cloning\src\utils.py", line 1270

Well, to start with, you're on an outdated version, so I suggest updating, as that line is nowhere near correct in the upstream version.

> [`ai-voice-cloning\src\utils.py", line 1270`](https://git.ecker.tech/mrq/ai-voice-cloning/src/branch/master/src/utils.py#L1270) Well, to start with, you're on an outdated version, so I suggest updating, as that line is nowhere near correct in the upstream version.
Owner

D:\SteamLibrary\steamapps\common\xVATrainer\resources\app\datasets\halo_cortana\wavs

Per the documentation:

Dataset Source: a valid folder under ./voice/, as if you were using it to generate with.

It's not a text field, it's a dropdown of names in the voice folder.

> D:\SteamLibrary\steamapps\common\xVATrainer\resources\app\datasets\halo_cortana\wavs Per [the documentation](https://git.ecker.tech/mrq/ai-voice-cloning/wiki/Training#prepare-dataset): > Dataset Source: a valid folder under ./voice/, as if you were using it to generate with. It's not a text field, it's a dropdown of names in the voice folder.
Author

Ah shit, I must have skipped over that; sorry, it was way too early when I set this up. I'll move it over and check to see if the error repeats with the correct path

Ah shit, I must have skipped over that; sorry, it was way too early when I set this up. I'll move it over and check to see if the error repeats with the correct path

Please, keep me posted! Getting this on Colab, too. It makes sense, since there's no actual option to refresh the "dataset source" list, there was no way for me to select my voice from the list, so I just entered a path, as well.

Please, keep me posted! Getting this on Colab, too. It makes sense, since there's no actual option to refresh the "dataset source" list, there was no way for me to select my voice from the list, so I just entered a path, as well.

It makes sense, since there's no actual option to refresh the "dataset source" list, there was no way for me to select my voice from the list

"Refresh Voice List" on the Generate tab will do it.

> It makes sense, since there's no actual option to refresh the "dataset source" list, there was no way for me to select my voice from the list "Refresh Voice List" on the Generate tab will do it.
Author

Okay, I'm thinking it was user error. I've just updated the software and changed the .wav file location, and I'm no-longer getting an blank error. I'm instead getting a 'missing whisper.json' error- but I know that the fix for that is located in the wiki. Thanks again for the help.

Okay, I'm thinking it was user error. I've just updated the software and changed the .wav file location, and I'm no-longer getting an blank error. I'm instead getting a 'missing whisper.json' error- but I know that the fix for that is located in the wiki. Thanks again for the help.
Sign in to join this conversation.
No Milestone
No project
No Assignees
4 Participants
Notifications
Due Date
The due date is invalid or out of range. Please use the format 'yyyy-mm-dd'.

No due date set.

Dependencies

No dependencies set.

Reference: mrq/ai-voice-cloning#171
No description provided.