@psammites yes it's correct
So before a vowel is it labiodental? Or is it bilabial and part of a dipthong?
phonemize and the SAMPA included in SOFES (converted to IPA) don't…
Before a vowel, the pronunciation is labiodental, [ʋ].[5] Before or after a vowel, the pronunciation is bilabial [u̯] and forms a diphthong.[5][13][14] At the beginning of a syllable, before…
I saw you started to train the Slovenian model, any results, could you share some examples?
@mrq I suggest to change istead of:
phonemes = phonemizer( text, preserve_punctuation=True, strip=True )
if you could try this:
` phonemes = phonemize(text,language=lang,strip=True,…
@nk990 Do you have an IPA-annotated Slovnenian dataset? I added the missing symbols to models/tokenizers/ipa.json but SOFES is transcribed in SAMPA and the UCLA Phonetics Lab Slovenian corpus is…
With the default tokenizer, yea. It just strips out all the accents. That's part of the reason @nk990 is running into issues, to it there's no difference between [č] and [c] (or [ć], or [ĉ],…
Post-nap, post-coffee edit: Replace [č] with [q], [ž] with [x], crank up the Text LR Ratio to the max and let her rip. Should work without replacing the tokenizer or diffusion model.
this…
At last, I can train it to speak Ubykh (or at least pronounce გვფრცქვნი)!
@psammites Can you also share you experience with us? Like how many clips do you have in your…
Nope, I used default config settings. Gonna try to increase text LR ratio, can you also share your config file that you've been using for japanese, just for comparison of the parameters.
Your copy is a couple of days out of date.
Right, after getting latest version everything works fine, thnx for your great job!