-
https://git.ecker.tech/ aims to provide a place to share my efforts while maintaining true ownership of my code, as I do not trust GitHub.
XMR: 4B9TQdkAkBFYrbj5ztvTx89e5LpucPeTSPzemCihdDi9EBnx7btn8RDNZTBz2zihWsjMnDkzn5As1LU6gLv3KQy8BLsZ8SG
- Joined on
2022-10-10
is this because the input is too long should I split it up with the line Delimiter?
No stop tokens found in one of the generated voice clips. This typically means the spoken audio is too…
I feel rather silly.
I imagine the lifeiteng/vall-e implementation had the right idea with having an (almost) single model that handles both AR and NAR…
mmm... I guess I'm due for a bit of an update.
- 15% through my dataset's "epoch" (out of ~5640894 samples, I think I should have a better metric that should calculate how many iterations it will…
DeepSpeed isn't available on Windows natively without going through a nightmare of hoops to get it compiled and installed. The best bet is to have an environment under WSL2 but you're still under…