New model to speed up and improve transcriptions #439
Labels
No Label
bug
duplicate
enhancement
help wanted
insufficient info
invalid
news
not a bug
question
wontfix
No Milestone
No project
No Assignees
1 Participants
Notifications
Due Date
No due date set.
Dependencies
No dependencies set.
Reference: mrq/ai-voice-cloning#439
Loading…
Reference in New Issue
Block a user
No description provided.
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Suggestion to improve transcription quality
New whisper model:
https://github.com/huggingface/distil-whisper
The paper:
https://arxiv.org/abs/2311.00430
Colab benchmark
https://colab.research.google.com/github/sanchit-gandhi/notebooks/blob/main/Distil_Whisper_Benchmark.ipynb#scrollTo=3TKgiw8PnBK0
another model could be explored is also
https://github.com/sanchit-gandhi/whisper-jax