Training using "ipa" tokenizers #467

Open
opened 2024-01-19 01:20:14 +00:00 by bahm9919 · 1 comment

Guys has anyone tried to use ipa tokenizers for training? I can't, get the error "expected string or bytes-like object".

Guys has anyone tried to use ipa tokenizers for training? I can't, get the error "expected string or bytes-like object".

You would have to provide more details to help you in this specific case, but if you are going into phonemes I'd suggest creating your own phonemizer using kaikki's dictionary and falling back onto the default what this repo uses [espeak NG]. Espeak doesn't really handle some words well in my experience.

You would have to provide more details to help you in this specific case, but if you are going into phonemes I'd suggest creating your own phonemizer using [kaikki's dictionary](https://kaikki.org/dictionary/English/index.html) and falling back onto the default what this repo uses [espeak NG]. Espeak doesn't really handle some words well in my experience.
Sign in to join this conversation.
No Milestone
No project
No Assignees
2 Participants
Notifications
Due Date
The due date is invalid or out of range. Please use the format 'yyyy-mm-dd'.

No due date set.

Dependencies

No dependencies set.

Reference: mrq/ai-voice-cloning#467
No description provided.