A very strange Issue: "Exception: Empty dataset Error" #465
Labels
No Label
bug
duplicate
enhancement
help wanted
insufficient info
invalid
news
not a bug
question
wontfix
No Milestone
No project
No Assignees
3 Participants
Notifications
Due Date
No due date set.
Dependencies
No dependencies set.
Reference: mrq/ai-voice-cloning#465
Loading…
Reference in New Issue
Block a user
No description provided.
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
I'm using this tool to clone my own voice in italian language. So, i've set to "it" for the transcription language in whisper config. i've correctly prepared my dataset following a well explained tutorial, i've succesfully done transcription of a 12 minutes dataset with whisper, but all the transcription are present only in the validation.txt file. there is nothing in the train.txt file. as far i know, the italian language it's well supported by whisper, but why the train.txt file it's empty? Btw, when i go to the next step (generate configuration tab) this is the error that appears (see the screenshots) i don't know how to rid out of this. i hope someone can help me. thank you.
Exception: Empty dataset Errorto A very strange Issue: "Exception: Empty dataset Error"the problem is the transcription is writing to the wrong file. You need to copy the transcribed caption data in the validation.txt file over to the train.txt file, took me forever to notice.
edit: this worked for an english based project but I am not too sure if it would work for other languages
I have the identical problem in standard English. Were you ever able to repair it? I raised a similar issue recently. I think it was due to an update of one of modules, but I don't know enough code to fix it. / G