Johan Nordberg
a52e3026ba
Revive CVVP model
2022-05-25 10:22:50 +00:00
James Betker
7f9f1dbfc3
Fix bug
2022-05-22 05:50:26 -06:00
James Betker
e118785aaf
Support combining voices in do_tts
2022-05-22 05:28:15 -06:00
James Betker
e882484c4a
Update read.py to support multiple candidates
2022-05-22 05:26:01 -06:00
James Betker
8feb18b03f
Merge remote-tracking branch 'origin/main'
2022-05-22 05:13:50 -06:00
James Betker
12a767c7f5
Commit comparisons with naturalspeech
...
This is the first TTS engine I've seen come along that has comparable performance
to Tortoise, though what has been released is pretty sparse on actual results. Still,
it's an interesting comparison.
2022-05-22 05:13:08 -06:00
James Betker
eae0414f94
Merge pull request #58 from kwibjo/main
...
Update README.md
2022-05-21 10:41:55 -06:00
Jai Mu
5bff5dd819
Update README.md
...
Useless update but it was bothering me.
2022-05-22 00:56:06 +09:30
James Betker
b98860552a
Merge pull request #57 from wavymulder/main
...
Updated train_lescault voices
2022-05-19 15:56:47 -06:00
Tristan Drake
cdc138a3df
Updated lescault voices
2022-05-19 17:39:25 -04:00
James Betker
f4bd9c4dd0
Fix faulty merge
2022-05-19 10:37:57 -06:00
James Betker
1a8c9f741a
Merge remote-tracking branch 'origin/main'
...
# Conflicts:
# tortoise/read.py
2022-05-19 10:34:54 -06:00
James Betker
6d3157ebff
Remove faulty 3rd example for train_mouse
2022-05-19 10:30:02 -06:00
James Betker
550874cbec
Update broken train_empire voice
2022-05-19 10:26:46 -06:00
James Betker
2ba8d5bf97
Update requirements to specify version of transformers
2022-05-19 10:22:04 -06:00
James Betker
4641933d74
Merge pull request #55 from jnordberg/models-dir
...
Make models dir configurable
2022-05-19 09:51:21 -06:00
Johan Nordberg
e34ffca8fb
Allow passing additional voice directories when loading voices
2022-05-19 21:02:11 +09:00
Johan Nordberg
20220893af
Allow setting models path from environment variable
2022-05-19 21:02:09 +09:00
James Betker
8139afd0e5
Remove CVVP
...
After training a similar model for a different purpose, I realized that
this model is faulty: the contrastive loss it uses only pays attention
to high-frequency details which do not contribute meaningfully to
output quality. I validated this by comparing a no-CVVP output with
a baseline using tts-scores and found no differences.
2022-05-17 12:21:25 -06:00
James Betker
5d5aacc38c
v2.4
2022-05-17 12:15:13 -06:00
James Betker
aef86d21bf
Add a way to get deterministic behavior from tortoise and add debug states for reporting
2022-05-17 12:11:18 -06:00
James Betker
9eac62598a
Merge remote-tracking branch 'origin/main'
2022-05-17 11:22:40 -06:00
James Betker
24612f81c2
Add chapter 1 of GoT for read.py demos
2022-05-17 11:21:57 -06:00
James Betker
160963b105
Add conditioning latent example
2022-05-17 11:21:37 -06:00
James Betker
b5fc8f198b
Merge pull request #49 from faad3/main
...
Fix bug in load_voices in audio.py
2022-05-17 11:20:44 -06:00
Danila Berezin
dc3d7b1667
Fix bug in load_voices in audio.py
...
The read.py script did not work with pth latents, so I fix bug in audio.py. It seems that in the elif statement, instead of voice, voices should be clip, clips. And torch stack doesn't work with tuples, so I had to split this operation.
2022-05-17 18:34:54 +03:00
James Betker
11e80b0dae
Merge pull request #42 from jnordberg/main
...
Improve sentence splitting
2022-05-14 08:52:46 -06:00
James Betker
50690e4465
Automatically pick batch size based on available GPU memory
2022-05-13 10:30:02 -06:00
James Betker
cb7adf16af
Remove samples_generator
...
Was useful but the page is more detailed now.
2022-05-13 10:28:16 -06:00
Johan Nordberg
5197904660
Improve sentence splitting
2022-05-13 11:02:17 +00:00
James Betker
8c0b3855bf
Release notes for 2.3
2022-05-12 20:26:24 -06:00
James Betker
1a4f0fa350
update model paths (including clvp2!)
2022-05-12 20:18:11 -06:00
James Betker
75b0e03ab3
Add error message
2022-05-12 20:15:40 -06:00
James Betker
ec16c0208c
add eval script for testing
2022-05-12 20:15:22 -06:00
James Betker
7d5e7dbba8
CLVP2!
2022-05-12 13:23:03 -06:00
James Betker
fda5130819
Add support for multiple output candidates in do_tts.
2022-05-12 11:25:35 -06:00
James Betker
6ed77b0ea4
update examples
2022-05-12 11:25:03 -06:00
James Betker
0005d02940
read.py: allow user-specified splits
2022-05-12 11:24:55 -06:00
James Betker
945bd88f21
Merge pull request #36 from e0xextazy/main
...
Optimizing graphics card memory
2022-05-11 21:46:16 -06:00
Mark Baushenko
cbccc5e953
Optimizing graphics card memory
...
During inference it does not store gradients, which take up most of the video memory
2022-05-11 16:35:11 +03:00
James Betker
cfd436be00
update requirements to (hopefully) provide better compatibility
...
man python deps are garbage.....
2022-05-10 23:42:37 -06:00
James Betker
609e3a26c6
whoops
2022-05-10 18:25:14 -06:00
James Betker
eae2b92385
Upload finetuned ljspeech examples
2022-05-10 18:23:24 -06:00
James Betker
099bf8363c
Update README with suggestions for windows installation
2022-05-08 20:44:44 -06:00
James Betker
317d55c252
re-enable redaction
2022-05-06 09:36:42 -06:00
James Betker
8672075914
temporarily disable redaction
2022-05-06 09:06:20 -06:00
James Betker
d03b8f3f62
Update rainbow
2022-05-06 08:55:08 -06:00
James Betker
ffd0238a16
v2.2
2022-05-06 00:11:10 -06:00
James Betker
b327be56c6
Update readme with prompt engineering
2022-05-03 21:32:06 -06:00
James Betker
ddb19f6b0f
Enable redaction by default
2022-05-03 21:21:52 -06:00