Set Whisper default to Base-EN #59
Labels
No Label
bug
duplicate
enhancement
help wanted
insufficient info
invalid
news
not a bug
question
wontfix
No Milestone
No project
No Assignees
2 Participants
Notifications
Due Date
No due date set.
Dependencies
No dependencies set.
Reference: mrq/ai-voice-cloning#59
Loading…
Reference in New Issue
Block a user
No description provided.
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
https://github.com/openai/whisper#:~:text=The%20.en%20models%20for%20English%2Donly%20applications%20tend%20to%20perform%20better%2C%20especially%20for%20the%20tiny.en%20and%20base.en%20models.%20We%20observed%20that%20the%20difference%20becomes%20less%20significant%20for%20the%20small.en%20and%20medium.en%20models.
The en versions of whisper, such as the tiny-en, outperforms the normal base. I'd suggest because nobody would probably ever look, set the default to base-en. The majority of users are english so it's reasonable and they would probably enjoy better accuracy.
I'll compromise and have it automatically use the
-en
version if a non-en
model is selected. desu, I think it's better to have it default to a more universal model over having a more "accurate" one, as if you're looking for better accuracy, you'd change the model anyways.Yeah that sounds like a good middle ground. It's only the english model that get's this benefit anyway.
Compromise implemented in commit
3e220ed306
.Default language is set to
en
(compatible with both implemented whisper implementations).When a specialized model for a language is detected (so just
${model}.en
), it'll load that whisper model instead.To override this (load the universal model with English), just leave the language blank, as the universal model will automatically deduce the language this way, anyways.