James Betker
870b2d2fc2
Merge pull request #70 from jnordberg/sentence-split-improve
...
Improve sentence boundary detection
2022-05-28 11:03:43 -06:00
Johan Nordberg
9f6ae0f0b3
Add tortoise_cli.py
2022-05-28 05:25:23 +00:00
Johan Nordberg
561ae9a31e
Typofix
2022-05-28 01:29:34 +00:00
Johan Nordberg
6a71d90316
Improve splitting on text that has many quotes
2022-05-28 01:22:21 +00:00
Johan Nordberg
f199d6b85c
Add riding hood test
...
Also fix a bug discovered by the test that would seek past the text end if it ended in a boundary
2022-05-27 23:08:53 +00:00
Johan Nordberg
b294f0217f
Improve sentence boundary detection
2022-05-27 05:58:09 +00:00
James Betker
3f7386d442
Merge pull request #68 from space-pope/fix-default-arg
...
avoid mutable default in aligner
2022-05-26 15:59:43 -06:00
Josh Ziegler
5b0e50eaa6
avoid mutable default in aligner
2022-05-26 16:20:09 -04:00
James Betker
f56f3d5468
Fix import issue for CVVP
2022-05-26 08:44:20 -06:00
James Betker
3acca1445a
Merge pull request #64 from jnordberg/revive-cvvp
...
Revive CVVP model
2022-05-25 15:59:09 -06:00
Johan Nordberg
b681fa9d11
Skip CLVP if cvvp_amount is 1
...
Also fixes formatting bug in log message
2022-05-25 11:12:53 +00:00
Johan Nordberg
a52e3026ba
Revive CVVP model
2022-05-25 10:22:50 +00:00
James Betker
7f9f1dbfc3
Fix bug
2022-05-22 05:50:26 -06:00
James Betker
e118785aaf
Support combining voices in do_tts
2022-05-22 05:28:15 -06:00
James Betker
e882484c4a
Update read.py to support multiple candidates
2022-05-22 05:26:01 -06:00
James Betker
8feb18b03f
Merge remote-tracking branch 'origin/main'
2022-05-22 05:13:50 -06:00
James Betker
12a767c7f5
Commit comparisons with naturalspeech
...
This is the first TTS engine I've seen come along that has comparable performance
to Tortoise, though what has been released is pretty sparse on actual results. Still,
it's an interesting comparison.
2022-05-22 05:13:08 -06:00
James Betker
eae0414f94
Merge pull request #58 from kwibjo/main
...
Update README.md
2022-05-21 10:41:55 -06:00
Jai Mu
5bff5dd819
Update README.md
...
Useless update but it was bothering me.
2022-05-22 00:56:06 +09:30
James Betker
b98860552a
Merge pull request #57 from wavymulder/main
...
Updated train_lescault voices
2022-05-19 15:56:47 -06:00
Tristan Drake
cdc138a3df
Updated lescault voices
2022-05-19 17:39:25 -04:00
James Betker
f4bd9c4dd0
Fix faulty merge
2022-05-19 10:37:57 -06:00
James Betker
1a8c9f741a
Merge remote-tracking branch 'origin/main'
...
# Conflicts:
# tortoise/read.py
2022-05-19 10:34:54 -06:00
James Betker
6d3157ebff
Remove faulty 3rd example for train_mouse
2022-05-19 10:30:02 -06:00
James Betker
550874cbec
Update broken train_empire voice
2022-05-19 10:26:46 -06:00
James Betker
2ba8d5bf97
Update requirements to specify version of transformers
2022-05-19 10:22:04 -06:00
James Betker
4641933d74
Merge pull request #55 from jnordberg/models-dir
...
Make models dir configurable
2022-05-19 09:51:21 -06:00
Johan Nordberg
e34ffca8fb
Allow passing additional voice directories when loading voices
2022-05-19 21:02:11 +09:00
Johan Nordberg
20220893af
Allow setting models path from environment variable
2022-05-19 21:02:09 +09:00
James Betker
8139afd0e5
Remove CVVP
...
After training a similar model for a different purpose, I realized that
this model is faulty: the contrastive loss it uses only pays attention
to high-frequency details which do not contribute meaningfully to
output quality. I validated this by comparing a no-CVVP output with
a baseline using tts-scores and found no differences.
2022-05-17 12:21:25 -06:00
James Betker
5d5aacc38c
v2.4
2022-05-17 12:15:13 -06:00
James Betker
aef86d21bf
Add a way to get deterministic behavior from tortoise and add debug states for reporting
2022-05-17 12:11:18 -06:00
James Betker
9eac62598a
Merge remote-tracking branch 'origin/main'
2022-05-17 11:22:40 -06:00
James Betker
24612f81c2
Add chapter 1 of GoT for read.py demos
2022-05-17 11:21:57 -06:00
James Betker
160963b105
Add conditioning latent example
2022-05-17 11:21:37 -06:00
James Betker
b5fc8f198b
Merge pull request #49 from faad3/main
...
Fix bug in load_voices in audio.py
2022-05-17 11:20:44 -06:00
Danila Berezin
dc3d7b1667
Fix bug in load_voices in audio.py
...
The read.py script did not work with pth latents, so I fix bug in audio.py. It seems that in the elif statement, instead of voice, voices should be clip, clips. And torch stack doesn't work with tuples, so I had to split this operation.
2022-05-17 18:34:54 +03:00
James Betker
11e80b0dae
Merge pull request #42 from jnordberg/main
...
Improve sentence splitting
2022-05-14 08:52:46 -06:00
James Betker
50690e4465
Automatically pick batch size based on available GPU memory
2022-05-13 10:30:02 -06:00
James Betker
cb7adf16af
Remove samples_generator
...
Was useful but the page is more detailed now.
2022-05-13 10:28:16 -06:00
Johan Nordberg
5197904660
Improve sentence splitting
2022-05-13 11:02:17 +00:00
James Betker
8c0b3855bf
Release notes for 2.3
2022-05-12 20:26:24 -06:00
James Betker
1a4f0fa350
update model paths (including clvp2!)
2022-05-12 20:18:11 -06:00
James Betker
75b0e03ab3
Add error message
2022-05-12 20:15:40 -06:00
James Betker
ec16c0208c
add eval script for testing
2022-05-12 20:15:22 -06:00
James Betker
7d5e7dbba8
CLVP2!
2022-05-12 13:23:03 -06:00
James Betker
fda5130819
Add support for multiple output candidates in do_tts.
2022-05-12 11:25:35 -06:00
James Betker
6ed77b0ea4
update examples
2022-05-12 11:25:03 -06:00
James Betker
0005d02940
read.py: allow user-specified splits
2022-05-12 11:24:55 -06:00
James Betker
945bd88f21
Merge pull request #36 from e0xextazy/main
...
Optimizing graphics card memory
2022-05-11 21:46:16 -06:00