Extremely large output file size using tortoise #366
Labels
No Label
bug
duplicate
enhancement
help wanted
insufficient info
invalid
news
not a bug
question
wontfix
No Milestone
No project
No Assignees
2 Participants
Notifications
Due Date
No due date set.
Dependencies
No dependencies set.
Reference: mrq/ai-voice-cloning#366
Loading…
Reference in New Issue
Block a user
No description provided.
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
I've trained multiple models on different voices and Tortoise usually outputs files that are in the 1-5 mb range for a ~10 second clip.
However one of my models generates audio files that are over 100 mb in size for the same prompts. This is consistent across all checkpoints of the model and different prompts. Inference quality and speed are similar to other models. Are there any factors that might cause this to happen?
Verify that you have
Embed Output Metadata
disabled in the settings.