James Betker
|
d6a73acaed
|
Allow processing of multiple audio sources at once from nv_tacotron_dataset
|
2021-08-14 16:04:05 -06:00 |
|
James Betker
|
007976082b
|
GPT_asr for inference
|
2021-08-14 14:37:17 -06:00 |
|
James Betker
|
f5a9b88ef6
|
tacotron cleaners: remove quotation marks
these don't really have relevance for tts or asr
|
2021-08-11 16:18:44 -06:00 |
|
James Betker
|
d120e1aa99
|
Add audio augmentation to wavfile_dataset, utility to test audio similary
|
2021-08-05 22:14:49 -06:00 |
|
James Betker
|
398185e109
|
More work on wave-diffusion
|
2021-07-27 05:36:17 -06:00 |
|
James Betker
|
49e3b310ea
|
Allow audio sample rate interpolation for faster training
|
2021-07-26 17:44:06 -06:00 |
|
James Betker
|
96e90e7047
|
Add support for a gaussian-diffusion-based wave tacotron
|
2021-07-26 16:27:31 -06:00 |
|
James Betker
|
5584cfcc7a
|
tacotron2 work
|
2021-07-14 21:41:57 -06:00 |
|
James Betker
|
fe0c699ced
|
Various fixes
|
2021-07-14 00:08:42 -06:00 |
|
James Betker
|
1ff434218e
|
tacotron2, ready for prime time!
|
2021-07-08 22:13:44 -06:00 |
|
James Betker
|
86fd3ad7fd
|
Initial checkin of nvidia tacotron model & dataset
These two are tested, full support for training to come.
|
2021-07-06 11:11:35 -06:00 |
|