tortoise-tts

Author	SHA1	Message	Date
Johan Nordberg	d8f98c07b4	Remove some assumptions about working directory This allows cli tool to run when not standing in repository dir	2022-05-29 01:10:19 +00:00
Johan Nordberg	9f6ae0f0b3	Add tortoise_cli.py	2022-05-28 05:25:23 +00:00
James Betker	3f7386d442	Merge pull request #68 from space-pope/fix-default-arg avoid mutable default in aligner	2022-05-26 15:59:43 -06:00
Josh Ziegler	5b0e50eaa6	avoid mutable default in aligner	2022-05-26 16:20:09 -04:00
James Betker	f56f3d5468	Fix import issue for CVVP	2022-05-26 08:44:20 -06:00
James Betker	3acca1445a	Merge pull request #64 from jnordberg/revive-cvvp Revive CVVP model	2022-05-25 15:59:09 -06:00
Johan Nordberg	b681fa9d11	Skip CLVP if cvvp_amount is 1 Also fixes formatting bug in log message	2022-05-25 11:12:53 +00:00
Johan Nordberg	a52e3026ba	Revive CVVP model	2022-05-25 10:22:50 +00:00
James Betker	7f9f1dbfc3	Fix bug	2022-05-22 05:50:26 -06:00
James Betker	e118785aaf	Support combining voices in do_tts	2022-05-22 05:28:15 -06:00
James Betker	e882484c4a	Update read.py to support multiple candidates	2022-05-22 05:26:01 -06:00
James Betker	8feb18b03f	Merge remote-tracking branch 'origin/main'	2022-05-22 05:13:50 -06:00
James Betker	12a767c7f5	Commit comparisons with naturalspeech This is the first TTS engine I've seen come along that has comparable performance to Tortoise, though what has been released is pretty sparse on actual results. Still, it's an interesting comparison.	2022-05-22 05:13:08 -06:00
James Betker	eae0414f94	Merge pull request #58 from kwibjo/main Update README.md	2022-05-21 10:41:55 -06:00
Jai Mu	5bff5dd819	Update README.md Useless update but it was bothering me.	2022-05-22 00:56:06 +09:30
James Betker	b98860552a	Merge pull request #57 from wavymulder/main Updated train_lescault voices	2022-05-19 15:56:47 -06:00
Tristan Drake	cdc138a3df	Updated lescault voices	2022-05-19 17:39:25 -04:00
James Betker	f4bd9c4dd0	Fix faulty merge	2022-05-19 10:37:57 -06:00
James Betker	1a8c9f741a	Merge remote-tracking branch 'origin/main' # Conflicts: # tortoise/read.py	2022-05-19 10:34:54 -06:00
James Betker	6d3157ebff	Remove faulty 3rd example for train_mouse	2022-05-19 10:30:02 -06:00
James Betker	550874cbec	Update broken train_empire voice	2022-05-19 10:26:46 -06:00
James Betker	2ba8d5bf97	Update requirements to specify version of transformers	2022-05-19 10:22:04 -06:00
James Betker	4641933d74	Merge pull request #55 from jnordberg/models-dir Make models dir configurable	2022-05-19 09:51:21 -06:00
Johan Nordberg	e34ffca8fb	Allow passing additional voice directories when loading voices	2022-05-19 21:02:11 +09:00
Johan Nordberg	20220893af	Allow setting models path from environment variable	2022-05-19 21:02:09 +09:00
James Betker	8139afd0e5	Remove CVVP After training a similar model for a different purpose, I realized that this model is faulty: the contrastive loss it uses only pays attention to high-frequency details which do not contribute meaningfully to output quality. I validated this by comparing a no-CVVP output with a baseline using tts-scores and found no differences.	2022-05-17 12:21:25 -06:00
James Betker	5d5aacc38c	v2.4	2022-05-17 12:15:13 -06:00
James Betker	aef86d21bf	Add a way to get deterministic behavior from tortoise and add debug states for reporting	2022-05-17 12:11:18 -06:00
James Betker	9eac62598a	Merge remote-tracking branch 'origin/main'	2022-05-17 11:22:40 -06:00
James Betker	24612f81c2	Add chapter 1 of GoT for read.py demos	2022-05-17 11:21:57 -06:00
James Betker	160963b105	Add conditioning latent example	2022-05-17 11:21:37 -06:00
James Betker	b5fc8f198b	Merge pull request #49 from faad3/main Fix bug in load_voices in audio.py	2022-05-17 11:20:44 -06:00
Danila Berezin	dc3d7b1667	Fix bug in load_voices in audio.py The read.py script did not work with pth latents, so I fix bug in audio.py. It seems that in the elif statement, instead of voice, voices should be clip, clips. And torch stack doesn't work with tuples, so I had to split this operation.	2022-05-17 18:34:54 +03:00
James Betker	11e80b0dae	Merge pull request #42 from jnordberg/main Improve sentence splitting	2022-05-14 08:52:46 -06:00
James Betker	50690e4465	Automatically pick batch size based on available GPU memory	2022-05-13 10:30:02 -06:00
James Betker	cb7adf16af	Remove samples_generator Was useful but the page is more detailed now.	2022-05-13 10:28:16 -06:00
Johan Nordberg	5197904660	Improve sentence splitting	2022-05-13 11:02:17 +00:00
James Betker	8c0b3855bf	Release notes for 2.3	2022-05-12 20:26:24 -06:00
James Betker	1a4f0fa350	update model paths (including clvp2!)	2022-05-12 20:18:11 -06:00
James Betker	75b0e03ab3	Add error message	2022-05-12 20:15:40 -06:00
James Betker	ec16c0208c	add eval script for testing	2022-05-12 20:15:22 -06:00
James Betker	7d5e7dbba8	CLVP2!	2022-05-12 13:23:03 -06:00
James Betker	fda5130819	Add support for multiple output candidates in do_tts.	2022-05-12 11:25:35 -06:00
James Betker	6ed77b0ea4	update examples	2022-05-12 11:25:03 -06:00
James Betker	0005d02940	read.py: allow user-specified splits	2022-05-12 11:24:55 -06:00
James Betker	945bd88f21	Merge pull request #36 from e0xextazy/main Optimizing graphics card memory	2022-05-11 21:46:16 -06:00
Mark Baushenko	cbccc5e953	Optimizing graphics card memory During inference it does not store gradients, which take up most of the video memory	2022-05-11 16:35:11 +03:00
James Betker	cfd436be00	update requirements to (hopefully) provide better compatibility man python deps are garbage.....	2022-05-10 23:42:37 -06:00
James Betker	609e3a26c6	whoops	2022-05-10 18:25:14 -06:00
James Betker	eae2b92385	Upload finetuned ljspeech examples	2022-05-10 18:23:24 -06:00

1 2 3 4

166 Commits