Great job! #72
Labels
No Label
bug
duplicate
enhancement
help wanted
insufficient info
invalid
news
not a bug
question
wontfix
No Milestone
No project
No Assignees
2 Participants
Notifications
Due Date
No due date set.
Dependencies
No dependencies set.
Reference: mrq/ai-voice-cloning#72
Loading…
Reference in New Issue
Block a user
No description provided.
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
This gui and the tools here are what make tortoise usable. Any AI project that doesn't have something like this is missing out on a lot of usability. Fantastic work!!!
I'll second the motion on some kind of batching system. The nature of audio work is that you get a long list of sentences to be spoken by a character and you just want the machine to munch on them for awhile.
Maybe point the AI at a text list of separated sentences, then have it output to a folder with file names like Kennedy_01A, Kennedy_01B Kennedy_01C for the first sentence and 3 candidates, then Kennedy_02A for second sentence, first candidate and so on. Then you can check them out later at your leisure.
The other item on my wishlist is voice generation from thin air. Don't know how you would go about it, but it would be nice!
I see a few bugs here and there, I'll see if I can do my bit and report some.
Unless I'm misinterpreting, that should already be covered with the
Line Delimiter
option where it'll split up by lines.It'll be formatted as
./results/{voice}/{voice}_{index}_{line}_{candidate}.wav
, and combined at the end into./results/{voice}/{voice}_{index}_{candidate}_combined.wav
(where index is the generation number,_{line}
is ignored if only 1 line,_{candidate}
is ignored if one candidate).Although desu, maybe lettering would be better for candidates, as the appended numbering can get quite bothersome.
The closest would be the
random
voice option, where it'll generate a new voice without any specific input data. From a cursory glance at the code, I guess it samples from a model of existing latents to generate a new one.