Commit Graph

156 Commits

Author SHA1 Message Date
James Betker
e882484c4a Update read.py to support multiple candidates 2022-05-22 05:26:01 -06:00
James Betker
8feb18b03f Merge remote-tracking branch 'origin/main' 2022-05-22 05:13:50 -06:00
James Betker
12a767c7f5 Commit comparisons with naturalspeech
This is the first TTS engine I've seen come along that has comparable performance
to Tortoise, though what has been released is pretty sparse on actual results. Still,
it's an interesting comparison.
2022-05-22 05:13:08 -06:00
James Betker
eae0414f94
Merge pull request #58 from kwibjo/main
Update README.md
2022-05-21 10:41:55 -06:00
Jai Mu
5bff5dd819
Update README.md
Useless update but it was bothering me.
2022-05-22 00:56:06 +09:30
James Betker
b98860552a
Merge pull request #57 from wavymulder/main
Updated train_lescault voices
2022-05-19 15:56:47 -06:00
Tristan Drake
cdc138a3df Updated lescault voices 2022-05-19 17:39:25 -04:00
James Betker
f4bd9c4dd0 Fix faulty merge 2022-05-19 10:37:57 -06:00
James Betker
1a8c9f741a Merge remote-tracking branch 'origin/main'
# Conflicts:
#	tortoise/read.py
2022-05-19 10:34:54 -06:00
James Betker
6d3157ebff Remove faulty 3rd example for train_mouse 2022-05-19 10:30:02 -06:00
James Betker
550874cbec Update broken train_empire voice 2022-05-19 10:26:46 -06:00
James Betker
2ba8d5bf97 Update requirements to specify version of transformers 2022-05-19 10:22:04 -06:00
James Betker
4641933d74
Merge pull request #55 from jnordberg/models-dir
Make models dir configurable
2022-05-19 09:51:21 -06:00
Johan Nordberg
e34ffca8fb Allow passing additional voice directories when loading voices 2022-05-19 21:02:11 +09:00
Johan Nordberg
20220893af Allow setting models path from environment variable 2022-05-19 21:02:09 +09:00
James Betker
8139afd0e5 Remove CVVP
After training a similar model for a different purpose, I realized that
this model is faulty: the contrastive loss it uses only pays attention
to high-frequency details which do not contribute meaningfully to
output quality. I validated this by comparing a no-CVVP output with
a baseline using tts-scores and found no differences.
2022-05-17 12:21:25 -06:00
James Betker
5d5aacc38c v2.4 2022-05-17 12:15:13 -06:00
James Betker
aef86d21bf Add a way to get deterministic behavior from tortoise and add debug states for reporting 2022-05-17 12:11:18 -06:00
James Betker
9eac62598a Merge remote-tracking branch 'origin/main' 2022-05-17 11:22:40 -06:00
James Betker
24612f81c2 Add chapter 1 of GoT for read.py demos 2022-05-17 11:21:57 -06:00
James Betker
160963b105 Add conditioning latent example 2022-05-17 11:21:37 -06:00
James Betker
b5fc8f198b
Merge pull request #49 from faad3/main
Fix bug in load_voices in audio.py
2022-05-17 11:20:44 -06:00
Danila Berezin
dc3d7b1667
Fix bug in load_voices in audio.py
The read.py script did not work with pth latents, so I fix bug in audio.py. It seems that in the elif statement, instead of voice, voices should be clip, clips. And torch stack doesn't work with tuples, so I had to split this operation.
2022-05-17 18:34:54 +03:00
James Betker
11e80b0dae
Merge pull request #42 from jnordberg/main
Improve sentence splitting
2022-05-14 08:52:46 -06:00
James Betker
50690e4465 Automatically pick batch size based on available GPU memory 2022-05-13 10:30:02 -06:00
James Betker
cb7adf16af Remove samples_generator
Was useful but the page is more detailed now.
2022-05-13 10:28:16 -06:00
Johan Nordberg
5197904660 Improve sentence splitting 2022-05-13 11:02:17 +00:00
James Betker
8c0b3855bf Release notes for 2.3 2022-05-12 20:26:24 -06:00
James Betker
1a4f0fa350 update model paths (including clvp2!) 2022-05-12 20:18:11 -06:00
James Betker
75b0e03ab3 Add error message 2022-05-12 20:15:40 -06:00
James Betker
ec16c0208c add eval script for testing 2022-05-12 20:15:22 -06:00
James Betker
7d5e7dbba8 CLVP2! 2022-05-12 13:23:03 -06:00
James Betker
fda5130819 Add support for multiple output candidates in do_tts. 2022-05-12 11:25:35 -06:00
James Betker
6ed77b0ea4 update examples 2022-05-12 11:25:03 -06:00
James Betker
0005d02940 read.py: allow user-specified splits 2022-05-12 11:24:55 -06:00
James Betker
945bd88f21
Merge pull request #36 from e0xextazy/main
Optimizing graphics card memory
2022-05-11 21:46:16 -06:00
Mark Baushenko
cbccc5e953
Optimizing graphics card memory
During inference it does not store gradients, which take up most of the video memory
2022-05-11 16:35:11 +03:00
James Betker
cfd436be00 update requirements to (hopefully) provide better compatibility
man python deps are garbage.....
2022-05-10 23:42:37 -06:00
James Betker
609e3a26c6 whoops 2022-05-10 18:25:14 -06:00
James Betker
eae2b92385 Upload finetuned ljspeech examples 2022-05-10 18:23:24 -06:00
James Betker
099bf8363c Update README with suggestions for windows installation 2022-05-08 20:44:44 -06:00
James Betker
317d55c252 re-enable redaction 2022-05-06 09:36:42 -06:00
James Betker
8672075914 temporarily disable redaction 2022-05-06 09:06:20 -06:00
James Betker
d03b8f3f62 Update rainbow 2022-05-06 08:55:08 -06:00
James Betker
ffd0238a16 v2.2 2022-05-06 00:11:10 -06:00
James Betker
b327be56c6 Update readme with prompt engineering 2022-05-03 21:32:06 -06:00
James Betker
ddb19f6b0f Enable redaction by default 2022-05-03 21:21:52 -06:00
James Betker
c1d004aeb0 change quality presets 2022-05-03 21:01:26 -06:00
James Betker
ee6f9b15ce Use librosa for loading mp3s 2022-05-03 20:44:31 -06:00
James Betker
2ca4ea9948 Update notebook 2022-05-02 21:57:19 -06:00