Block a user
generating voice clip is so much slower compared to using original Tortoise TTS
The bottleneck is largely at the sample generation, afaik. Because higher quality outputs necessarily require more inference time, that's the precise trade-off, and cutting corners, other than…
generating voice clip is so much slower compared to using original Tortoise TTS
Yeah, a fresh install with fresh settings will take ages on the initial run. All these things will definitely eat up time:
- download several models (the AR, the diffusion, the CLVP, the…