Whisper transcribing isn't doing all the files #265
Labels
No Label
bug
duplicate
enhancement
help wanted
insufficient info
invalid
news
not a bug
question
wontfix
No Milestone
No project
No Assignees
2 Participants
Notifications
Due Date
No due date set.
Dependencies
No dependencies set.
Reference: mrq/ai-voice-cloning#265
Loading…
Reference in New Issue
Block a user
No description provided.
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
I noticed that I was only able to make a "batch size" of 108, even though I knew I had at least 250 files in the training audio folder.
Went and checked, and the train.txt only has 108 entries.
It seems that even though some of the files got sliced up (eg. 1.wmv got cut up into 1_000001.wmv and 1_000002.wmv etc), the transcribed file seems to only have the 1.wmv version, instead of transcribing the sliced up shorter versions...
Edit: Which makes me realise its keeping the un-sliced files, along with the sliced ones. Which seems wrong too.
Also also, it's not ignoring all sliced files... some of the sliced ones are being transcribed, but a lot of them aren't. (and it's not a validation thing, I checked that file and there's only like 5 things in there, so I'm still missing about 100).
Are you using whisper or whisperx? How long are the files it's skipping?