Training using "ipa" tokenizers
#467
Open
opened
Loading…
Reference in New Issue
There is no content yet.
Delete Branch "%!s(<nil>)"
Deleting a branch is permanent. It CANNOT be undone. Continue?
Guys has anyone tried to use ipa tokenizers for training? I can't, get the error "expected string or bytes-like object".
You would have to provide more details to help you in this specific case, but if you are going into phonemes I'd suggest creating your own phonemizer using kaikki's dictionary and falling back onto the default what this repo uses [espeak NG]. Espeak doesn't really handle some words well in my experience.