James Betker
|
ed6eae407f
|
More scripts for splitting and formatting audio
|
2021-08-30 21:20:52 -06:00 |
|
James Betker
|
1fede41b7b
|
Audio segmentor
|
2021-08-16 22:51:53 -06:00 |
|
James Betker
|
3580c52eac
|
Fix up wavfile_dataset to be able to provide a full clip
|
2021-08-15 20:53:26 -06:00 |
|
James Betker
|
81e91c99de
|
Misc
|
2021-08-13 13:58:59 -06:00 |
|
James Betker
|
a2afb25e42
|
Fix inference, always flow full text tokens through transformer
|
2021-08-07 20:11:10 -06:00 |
|
James Betker
|
a7496b661c
|
combined dvae ftw
|
2021-08-06 22:01:06 -06:00 |
|
James Betker
|
b43683b772
|
Add lucidrains_dvae
|
2021-08-06 12:03:46 -06:00 |
|
James Betker
|
62c7570512
|
Constrain wav_aug a bit more
|
2021-08-06 08:19:38 -06:00 |
|
James Betker
|
c0f61a2e15
|
Rework how DVAE tokens are ordered
It might make more sense to have top tokens, then bottom tokens
with top tokens having different discretized values.
|
2021-08-05 07:07:17 -06:00 |
|
James Betker
|
36c7c1fbdb
|
Fix training flow for NEXT TOKEN prediction instead of same token prediction
doh
|
2021-08-04 10:28:09 -06:00 |
|
James Betker
|
4c98b9703f
|
Get dalle-style TTS to "work"
|
2021-08-03 21:08:27 -06:00 |
|
James Betker
|
0c9e75bc69
|
Improvements to GptTts
|
2021-07-31 15:57:57 -06:00 |
|
James Betker
|
31ee9ae262
|
Checkin
|
2021-07-30 23:07:35 -06:00 |
|
James Betker
|
2325e7a88c
|
Allow inference for vqvae
|
2021-07-20 10:40:05 -06:00 |
|
James Betker
|
be2745f42d
|
Add waveglow & inference capabilities to audio generator
|
2021-07-08 23:07:36 -06:00 |
|