The content of the generated sound is not correct #358

Open
opened 2023-08-30 12:48:05 +00:00 by Mo-Jiaxuan · 1 comment

Is anyone else experiencing the same problem? I've fine-tuned the model using a Japanese dataset, and when I use the trained model to generate the audio, it doesn't come back with what I've provided!
Where might this be caused by a problem?
I can provide any details of the training process

Is anyone else experiencing the same problem? I've fine-tuned the model using a Japanese dataset, and when I use the trained model to generate the audio, it doesn't come back with what I've provided! Where might this be caused by a problem? I can provide any details of the training process
Owner

The issue I've ran into with naively using Japanese is that there's a problem with the way the default tokenizer will normalize Japanese text (it will convert kana/kanji the wrong way). I honestly don't remember which issue I mentioned this in, and if said issue contained my notes on it again, but it's imperative that you swap over to the provided Japanese tokenizer for training.

The issue I've ran into with naively using Japanese is that there's a problem with the way the default tokenizer will normalize Japanese text (it will convert kana/kanji the wrong way). I honestly don't remember which issue I mentioned this in, and if said issue contained my notes on it again, but it's imperative that you swap over to the provided Japanese tokenizer for training.
Sign in to join this conversation.
No Milestone
No project
No Assignees
2 Participants
Notifications
Due Date
The due date is invalid or out of range. Please use the format 'yyyy-mm-dd'.

No due date set.

Dependencies

No dependencies set.

Reference: mrq/ai-voice-cloning#358
No description provided.