James Betker
|
64c7582bf5
|
full pipeline
|
2022-04-28 22:47:26 -06:00 |
|
James Betker
|
ab8176b217
|
audio prep misc
|
2022-04-28 10:08:38 -06:00 |
|
James Betker
|
084b1c1527
|
file splitter
|
2022-04-20 00:27:49 -06:00 |
|
James Betker
|
6fc4f49e86
|
some dumb stuff
|
2022-04-07 11:32:34 -06:00 |
|
James Betker
|
0070867d0f
|
inference script for diffusion image models
|
2022-03-26 22:48:24 -06:00 |
|
James Betker
|
be5f052255
|
misc
|
2022-03-22 11:40:56 -06:00 |
|
James Betker
|
3692c4cae3
|
map vocoder into cpu
|
2022-03-21 17:10:57 -06:00 |
|
James Betker
|
c5000420f6
|
more arbitrary fixes
|
2022-03-17 17:45:44 -06:00 |
|
James Betker
|
95ea0a592f
|
More cleaning
|
2022-03-16 12:05:56 -06:00 |
|
James Betker
|
d186414566
|
More spring cleaning
|
2022-03-16 12:04:00 -06:00 |
|
James Betker
|
735f6e4640
|
Move gen_similarities and rename
|
2022-03-16 11:59:34 -06:00 |
|
James Betker
|
54202aa099
|
fix mel normalization
|
2022-03-16 09:26:55 -06:00 |
|
James Betker
|
3f244f6a68
|
add mel_norm to std injector
|
2022-03-15 22:16:59 -06:00 |
|
James Betker
|
f563a8dd41
|
fixes
|
2022-03-15 21:43:00 -06:00 |
|
James Betker
|
1e3a8554a1
|
updates to audio_diffusion_fid
|
2022-03-15 11:35:09 -06:00 |
|
James Betker
|
9c6f776980
|
Add univnet vocoder
|
2022-03-15 11:34:51 -06:00 |
|
James Betker
|
7929fd89de
|
Refactor audio-style models into the audio folder
|
2022-03-15 11:06:25 -06:00 |
|
James Betker
|
f95d3d2b82
|
move waveglow to audio/vocoders
|
2022-03-15 11:03:07 -06:00 |
|
James Betker
|
0419a64107
|
misc
|
2022-03-15 10:36:34 -06:00 |
|
James Betker
|
eecbc0e678
|
Use wider spectrogram when asked
|
2022-03-15 10:35:11 -06:00 |
|
James Betker
|
896accb71f
|
data and prep improvements
|
2022-03-12 15:10:11 -07:00 |
|
James Betker
|
7dabc17626
|
phase2 filter initial commit
|
2022-03-08 15:51:55 -07:00 |
|
James Betker
|
b3def182de
|
move processing pipeline to "phase_1"
|
2022-03-08 15:49:51 -07:00 |
|
James Betker
|
2134f06516
|
Implement conditioning-free diffusion at the eval level
|
2022-02-27 15:11:42 -07:00 |
|
James Betker
|
ba155e4e2f
|
script for uploading models to the HF hub
|
2022-02-27 14:48:38 -07:00 |
|
James Betker
|
e6824e398f
|
Load dvae to cpu
|
2022-02-23 21:21:45 -07:00 |
|
James Betker
|
68726eac74
|
.
|
2022-02-23 17:58:07 -07:00 |
|
James Betker
|
58f6c9805b
|
adf
|
2022-02-22 23:12:58 -07:00 |
|
James Betker
|
52b61b9f77
|
Update scripts and attempt to figure out how UnifiedVoice could be used to produce CTC codes
|
2022-02-13 20:48:06 -07:00 |
|
James Betker
|
0c3cc5ebad
|
use script updates to fix output size disparities
|
2022-02-12 20:00:46 -07:00 |
|
James Betker
|
d1d1ae32a1
|
audio diffusion frechet distance measurement!
|
2022-02-10 22:55:46 -07:00 |
|
James Betker
|
93ca619267
|
script updates
|
2022-02-09 14:26:52 -07:00 |
|
James Betker
|
9e9ae328f2
|
mild updates
|
2022-02-08 23:51:17 -07:00 |
|
James Betker
|
f44b064c5e
|
Update scripts
|
2022-02-07 19:43:18 -07:00 |
|
James Betker
|
5ae816bead
|
ctc gen checkin
|
2022-02-05 15:59:53 -07:00 |
|
James Betker
|
bb3d1ab03d
|
More cleanup
|
2022-02-04 11:06:17 -07:00 |
|
James Betker
|
7f4fc55344
|
Update SR model
|
2022-02-03 21:42:53 -07:00 |
|
James Betker
|
687393de59
|
Add a better split_on_silence (processing_pipeline)
Going to extend this a bit more going forwards to support the entire pipeline.
|
2022-02-03 20:00:26 -07:00 |
|
James Betker
|
1d29999648
|
Uupdates to the TTS production scripts
|
2022-02-03 20:00:01 -07:00 |
|
James Betker
|
fbea6e8eac
|
Adjustments to diffusion networks
|
2022-01-30 16:14:06 -07:00 |
|
James Betker
|
e0e36ed98c
|
Update use_diffuse_tts
|
2022-01-27 19:57:28 -07:00 |
|
James Betker
|
7badbf1b4d
|
update usage scripts
|
2022-01-25 17:57:26 -07:00 |
|
James Betker
|
e2ed0adbd8
|
use_diffuse_tts updates
|
2022-01-24 14:31:28 -07:00 |
|
James Betker
|
8f48848f91
|
misc
|
2022-01-22 08:23:29 -07:00 |
|
James Betker
|
ed35cfe393
|
Update inference scripts
|
2022-01-20 11:28:50 -07:00 |
|
James Betker
|
8e2439f50d
|
Decrease resolution requirements to 2048
|
2022-01-20 11:27:49 -07:00 |
|
James Betker
|
ac13bfefe8
|
use_diffuse_tts
|
2022-01-19 00:35:24 -07:00 |
|
James Betker
|
dc9cd8c206
|
Update use_gpt_tts to be usable with unified_voice2
|
2022-01-18 21:14:17 -07:00 |
|
James Betker
|
7b4544b83a
|
Add an experimental unet_diffusion_tts to perform experiments on
|
2022-01-18 08:38:24 -07:00 |
|
James Betker
|
b398ecca01
|
wer fix
|
2022-01-15 17:28:17 -07:00 |
|