1
1
forked from mrq/tortoise-tts
Commit Graph

13 Commits

Author SHA1 Message Date
James Betker
4aab81b074 implement clip-guided generation (and never use it...) 2022-04-14 21:50:57 -06:00
James Betker
cf80d7317c Remove intelligibility refinement
It's not longer a concern. :)
2022-04-13 17:04:19 -06:00
James Betker
2eb5d4b0cb Update sweep & eval_multiple with new voices 2022-04-13 17:03:36 -06:00
James Betker
732deaa212 support latents into the diffusion decoder 2022-04-12 20:53:09 -06:00
James Betker
5988aa34eb Updates 2022-04-12 16:40:42 -06:00
James Betker
31cb602e07 support presets for generation 2022-04-10 23:19:15 -06:00
James Betker
7e29c68336 Clip diffusion inputs 2022-04-10 19:29:32 -06:00
James Betker
57ffdeff78 Updates 2022-04-10 14:41:13 -06:00
James Betker
81f6ea1afa integrate new autoregressive model and fix new diffusion bug 2022-04-04 16:51:35 -06:00
James Betker
4747fae381 Integrate new diffusion network 2022-04-01 14:15:17 -06:00
James Betker
d89c51a71c port do_tts to use the API 2022-04-01 11:55:07 -06:00
James Betker
57c2ce7040 Update API to have more expressive interface for controlling various generation knobs
- Also adds typical decoder support; unfortunately this does not work well with the current model.
2022-03-29 13:59:39 -06:00
James Betker
f1adc12505 Upgrade CLIP model and add eval_multiple 2022-03-28 19:33:31 -06:00