whisper.json #177

Closed
opened 2023-03-26 12:42:40 +07:00 by alxis · 5 comments

Hello dev first of all thanks alot for this awesome tool , i just started using it but when i try to create dataset i get this error

Loading specialized model for language: en
Loading Whisper model: base.en
Loaded Whisper model
Failed to transcribe: ./voices\zelda\vocal_000.wav
Failed to transcribe: ./voices\zelda\vocal_001.wav
Failed to transcribe: ./voices\zelda\vocal_002.wav
Failed to transcribe: ./voices\zelda\vocal_003.wav
Failed to transcribe: ./voices\zelda\vocal_004.wav
Failed to transcribe: ./voices\zelda\vocal_005.wav
Failed to transcribe: ./voices\zelda\vocal_006.wav
Failed to transcribe: ./voices\zelda\vocal_007.wav
Failed to transcribe: ./voices\zelda\vocal_008.wav
Traceback (most recent call last):
  File "D:\ai\ai-audio-clone\ai-voice-cloning\venv\lib\site-packages\gradio\routes.py", line 394, in run_predict
    output = await app.get_blocks().process_api(
  File "D:\ai\ai-audio-clone\ai-voice-cloning\venv\lib\site-packages\gradio\blocks.py", line 1075, in process_api
    result = await self.call_function(
  File "D:\ai\ai-audio-clone\ai-voice-cloning\venv\lib\site-packages\gradio\blocks.py", line 884, in call_function
    prediction = await anyio.to_thread.run_sync(
  File "D:\ai\ai-audio-clone\ai-voice-cloning\venv\lib\site-packages\anyio\to_thread.py", line 31, in run_sync
    return await get_asynclib().run_sync_in_worker_thread(
  File "D:\ai\ai-audio-clone\ai-voice-cloning\venv\lib\site-packages\anyio\_backends\_asyncio.py", line 937, in run_sync_in_worker_thread
    return await future
  File "D:\ai\ai-audio-clone\ai-voice-cloning\venv\lib\site-packages\anyio\_backends\_asyncio.py", line 867, in run
    result = context.run(func, *args)
  File "D:\ai\ai-audio-clone\ai-voice-cloning\src\webui.py", line 208, in prepare_dataset_proxy
    message = prepare_dataset( voice, use_segments=slice_audio, text_length=validation_text_length, audio_length=validation_audio_length, progress=progress )
  File "D:\ai\ai-audio-clone\ai-voice-cloning\src\utils.py", line 1570, in prepare_dataset
    raise Exception(f"Missing dataset: {infile}")
Exception: Missing dataset: ./training/zelda//whisper.json
Hello dev first of all thanks alot for this awesome tool , i just started using it but when i try to create dataset i get this error ``` Loading specialized model for language: en Loading Whisper model: base.en Loaded Whisper model Failed to transcribe: ./voices\zelda\vocal_000.wav Failed to transcribe: ./voices\zelda\vocal_001.wav Failed to transcribe: ./voices\zelda\vocal_002.wav Failed to transcribe: ./voices\zelda\vocal_003.wav Failed to transcribe: ./voices\zelda\vocal_004.wav Failed to transcribe: ./voices\zelda\vocal_005.wav Failed to transcribe: ./voices\zelda\vocal_006.wav Failed to transcribe: ./voices\zelda\vocal_007.wav Failed to transcribe: ./voices\zelda\vocal_008.wav Traceback (most recent call last): File "D:\ai\ai-audio-clone\ai-voice-cloning\venv\lib\site-packages\gradio\routes.py", line 394, in run_predict output = await app.get_blocks().process_api( File "D:\ai\ai-audio-clone\ai-voice-cloning\venv\lib\site-packages\gradio\blocks.py", line 1075, in process_api result = await self.call_function( File "D:\ai\ai-audio-clone\ai-voice-cloning\venv\lib\site-packages\gradio\blocks.py", line 884, in call_function prediction = await anyio.to_thread.run_sync( File "D:\ai\ai-audio-clone\ai-voice-cloning\venv\lib\site-packages\anyio\to_thread.py", line 31, in run_sync return await get_asynclib().run_sync_in_worker_thread( File "D:\ai\ai-audio-clone\ai-voice-cloning\venv\lib\site-packages\anyio\_backends\_asyncio.py", line 937, in run_sync_in_worker_thread return await future File "D:\ai\ai-audio-clone\ai-voice-cloning\venv\lib\site-packages\anyio\_backends\_asyncio.py", line 867, in run result = context.run(func, *args) File "D:\ai\ai-audio-clone\ai-voice-cloning\src\webui.py", line 208, in prepare_dataset_proxy message = prepare_dataset( voice, use_segments=slice_audio, text_length=validation_text_length, audio_length=validation_audio_length, progress=progress ) File "D:\ai\ai-audio-clone\ai-voice-cloning\src\utils.py", line 1570, in prepare_dataset raise Exception(f"Missing dataset: {infile}") Exception: Missing dataset: ./training/zelda//whisper.json ```

Are those .wav files valid? You can use ffprobe to check.

Are those .wav files valid? You can use `ffprobe` to check.

they work fine in generating tab

they work fine in generating tab

Are those .wav files valid? You can use ffprobe to check.

how to do this

> Are those .wav files valid? You can use `ffprobe` to check. how to do this

how to do this

> how to do this ![](https://files.catbox.moe/w3ha7g.png)

thanks it was that ffmpeg was not set in the path as environment variable ,everything is working now

for anyone with same issue just download ffmpeg-6.0 build and set it up as environment variable

thanks it was that ffmpeg was not set in the path as environment variable ,everything is working now for anyone with same issue just download ffmpeg-6.0 build and set it up as environment variable
alxis closed this issue 2023-03-26 18:19:27 +07:00
Sign in to join this conversation.
No Milestone
No project
No Assignees
2 Participants
Notifications
Due Date
The due date is invalid or out of range. Please use the format 'yyyy-mm-dd'.

No due date set.

Dependencies

No dependencies set.

Reference: mrq/ai-voice-cloning#177
There is no content yet.