Error when preparing dataset #143

Closed
opened 2023-03-16 08:01:17 +00:00 by hman360 · 2 comments

Some kind of error related to phonemes in utils.py, happens at the end of the "Transcribe and Process" process.
Stack trace:

Traceback (most recent call last)subscriptable:
  File "D:\ai-voice-cloning\venv\lib\site-packages\gradio\routes.py", line 384, in run_predict
    output = await app.get_blocks().process_api(
  File "D:\ai-voice-cloning\venv\lib\site-packages\gradio\blocks.py", line 1032, in process_api
    result = await self.call_function(
  File "D:\ai-voice-cloning\venv\lib\site-packages\gradio\blocks.py", line 844, in call_function
    prediction = await anyio.to_thread.run_sync(
  File "D:\ai-voice-cloning\venv\lib\site-packages\anyio\to_thread.py", line 31, in run_sync
    return await get_asynclib().run_sync_in_worker_thread(
  File "D:\ai-voice-cloning\venv\lib\site-packages\anyio\_backends\_asyncio.py", line 937, in run_sync_in_worker_thread
    return await future
  File "D:\ai-voice-cloning\venv\lib\site-packages\anyio\_backends\_asyncio.py", line 867, in run
    result = context.run(func, *args)
  File "D:\ai-voice-cloning\src\webui.py", line 205, in prepare_dataset_proxy
    message = prepare_dataset( voice, use_segments=slice_audio, text_length=validation_text_length, audio_length=validation_audio_length )
  File "D:\ai-voice-cloning\src\utils.py", line 1277, in prepare_dataset
    phonemize = phonemize=args.tokenizer_json[-8:] == "ipa.json"
TypeError: 'NoneType' object is not subscriptable
Some kind of error related to phonemes in utils.py, happens at the end of the "Transcribe and Process" process. Stack trace: ``` Traceback (most recent call last)subscriptable: File "D:\ai-voice-cloning\venv\lib\site-packages\gradio\routes.py", line 384, in run_predict output = await app.get_blocks().process_api( File "D:\ai-voice-cloning\venv\lib\site-packages\gradio\blocks.py", line 1032, in process_api result = await self.call_function( File "D:\ai-voice-cloning\venv\lib\site-packages\gradio\blocks.py", line 844, in call_function prediction = await anyio.to_thread.run_sync( File "D:\ai-voice-cloning\venv\lib\site-packages\anyio\to_thread.py", line 31, in run_sync return await get_asynclib().run_sync_in_worker_thread( File "D:\ai-voice-cloning\venv\lib\site-packages\anyio\_backends\_asyncio.py", line 937, in run_sync_in_worker_thread return await future File "D:\ai-voice-cloning\venv\lib\site-packages\anyio\_backends\_asyncio.py", line 867, in run result = context.run(func, *args) File "D:\ai-voice-cloning\src\webui.py", line 205, in prepare_dataset_proxy message = prepare_dataset( voice, use_segments=slice_audio, text_length=validation_text_length, audio_length=validation_audio_length ) File "D:\ai-voice-cloning\src\utils.py", line 1277, in prepare_dataset phonemize = phonemize=args.tokenizer_json[-8:] == "ipa.json" TypeError: 'NoneType' object is not subscriptable ```
Owner

Temporary fix is to go into Settings and click on the default Tokenizer (./modules/tortoise/tortoise-tts/data/tokenizer.json or something).

I'll push a fix when I get a chance.

Temporary fix is to go into Settings and click on the default Tokenizer (./modules/tortoise/tortoise-tts/data/tokenizer.json or something). I'll push a fix when I get a chance.
Owner

Should be fixed in commit f9154c4db1.

Should be fixed in commit f9154c4db1869c12904478d35bea2f449f9f8880.
Sign in to join this conversation.
No Milestone
No project
No Assignees
2 Participants
Notifications
Due Date
The due date is invalid or out of range. Please use the format 'yyyy-mm-dd'.

No due date set.

Dependencies

No dependencies set.

Reference: mrq/ai-voice-cloning#143
No description provided.