The content of the generated sound is not correct #358
Labels
No Label
bug
duplicate
enhancement
help wanted
insufficient info
invalid
news
not a bug
question
wontfix
No Milestone
No project
No Assignees
2 Participants
Notifications
Due Date
No due date set.
Dependencies
No dependencies set.
Reference: mrq/ai-voice-cloning#358
Loading…
Reference in New Issue
Block a user
No description provided.
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Is anyone else experiencing the same problem? I've fine-tuned the model using a Japanese dataset, and when I use the trained model to generate the audio, it doesn't come back with what I've provided!
Where might this be caused by a problem?
I can provide any details of the training process
The issue I've ran into with naively using Japanese is that there's a problem with the way the default tokenizer will normalize Japanese text (it will convert kana/kanji the wrong way). I honestly don't remember which issue I mentioned this in, and if said issue contained my notes on it again, but it's imperative that you swap over to the provided Japanese tokenizer for training.