James Betker
|
d3a60633a3
|
codes generation script
|
2022-06-03 11:02:28 -06:00 |
|
James Betker
|
6b43915eb8
|
support projecting to vectors
|
2022-05-28 22:27:45 -06:00 |
|
James Betker
|
b6b4f10e1b
|
...
|
2022-05-28 10:59:03 -06:00 |
|
James Betker
|
f691f5faa1
|
f
|
2022-05-27 13:47:05 -06:00 |
|
James Betker
|
31dec016e0
|
adf
|
2022-05-27 12:28:04 -06:00 |
|
James Betker
|
490d39b967
|
some stuff
|
2022-05-27 11:40:31 -06:00 |
|
James Betker
|
52a20f3aa3
|
und10
|
2022-05-25 12:19:21 -06:00 |
|
James Betker
|
48aab2babe
|
ressurect ctc code gen with some cool new ideas
|
2022-05-24 14:02:33 -06:00 |
|
James Betker
|
2da8f8a666
|
fmp
|
2022-05-23 07:06:25 -06:00 |
|
James Betker
|
ea21a8b107
|
Update music_diffusion_fid to support waveform diffusion from codes
|
2022-05-22 05:23:54 -06:00 |
|
James Betker
|
be937d202e
|
new attempt
|
2022-05-20 17:04:22 -06:00 |
|
James Betker
|
e9fb2ead9a
|
m2v stuff
|
2022-05-20 11:01:17 -06:00 |
|
James Betker
|
c9c16e3b01
|
misc updates
|
2022-05-19 13:39:32 -06:00 |
|
James Betker
|
8202b9f39c
|
some stuff
|
2022-05-15 21:50:54 -06:00 |
|
James Betker
|
ab5acead0e
|
add exp loss for diffusion models
|
2022-05-15 21:50:38 -06:00 |
|
James Betker
|
eb64d18075
|
Fix phoneme tokenizer
|
2022-05-13 17:56:26 -06:00 |
|
James Betker
|
9118f58849
|
uncomment music projector..
|
2022-05-09 09:19:26 -06:00 |
|
James Betker
|
1609101a42
|
musical gap filler
|
2022-05-05 16:47:08 -06:00 |
|
James Betker
|
e402089556
|
abstractify
|
2022-05-02 00:11:26 -06:00 |
|
James Betker
|
64c7582bf5
|
full pipeline
|
2022-04-28 22:47:26 -06:00 |
|
James Betker
|
ab8176b217
|
audio prep misc
|
2022-04-28 10:08:38 -06:00 |
|
James Betker
|
084b1c1527
|
file splitter
|
2022-04-20 00:27:49 -06:00 |
|
James Betker
|
6fc4f49e86
|
some dumb stuff
|
2022-04-07 11:32:34 -06:00 |
|
James Betker
|
0070867d0f
|
inference script for diffusion image models
|
2022-03-26 22:48:24 -06:00 |
|
James Betker
|
be5f052255
|
misc
|
2022-03-22 11:40:56 -06:00 |
|
James Betker
|
3692c4cae3
|
map vocoder into cpu
|
2022-03-21 17:10:57 -06:00 |
|
James Betker
|
c5000420f6
|
more arbitrary fixes
|
2022-03-17 17:45:44 -06:00 |
|
James Betker
|
95ea0a592f
|
More cleaning
|
2022-03-16 12:05:56 -06:00 |
|
James Betker
|
d186414566
|
More spring cleaning
|
2022-03-16 12:04:00 -06:00 |
|
James Betker
|
735f6e4640
|
Move gen_similarities and rename
|
2022-03-16 11:59:34 -06:00 |
|
James Betker
|
54202aa099
|
fix mel normalization
|
2022-03-16 09:26:55 -06:00 |
|
James Betker
|
3f244f6a68
|
add mel_norm to std injector
|
2022-03-15 22:16:59 -06:00 |
|
James Betker
|
f563a8dd41
|
fixes
|
2022-03-15 21:43:00 -06:00 |
|
James Betker
|
1e3a8554a1
|
updates to audio_diffusion_fid
|
2022-03-15 11:35:09 -06:00 |
|
James Betker
|
9c6f776980
|
Add univnet vocoder
|
2022-03-15 11:34:51 -06:00 |
|
James Betker
|
7929fd89de
|
Refactor audio-style models into the audio folder
|
2022-03-15 11:06:25 -06:00 |
|
James Betker
|
f95d3d2b82
|
move waveglow to audio/vocoders
|
2022-03-15 11:03:07 -06:00 |
|
James Betker
|
0419a64107
|
misc
|
2022-03-15 10:36:34 -06:00 |
|
James Betker
|
eecbc0e678
|
Use wider spectrogram when asked
|
2022-03-15 10:35:11 -06:00 |
|
James Betker
|
896accb71f
|
data and prep improvements
|
2022-03-12 15:10:11 -07:00 |
|
James Betker
|
7dabc17626
|
phase2 filter initial commit
|
2022-03-08 15:51:55 -07:00 |
|
James Betker
|
b3def182de
|
move processing pipeline to "phase_1"
|
2022-03-08 15:49:51 -07:00 |
|
James Betker
|
2134f06516
|
Implement conditioning-free diffusion at the eval level
|
2022-02-27 15:11:42 -07:00 |
|
James Betker
|
ba155e4e2f
|
script for uploading models to the HF hub
|
2022-02-27 14:48:38 -07:00 |
|
James Betker
|
e6824e398f
|
Load dvae to cpu
|
2022-02-23 21:21:45 -07:00 |
|
James Betker
|
68726eac74
|
.
|
2022-02-23 17:58:07 -07:00 |
|
James Betker
|
58f6c9805b
|
adf
|
2022-02-22 23:12:58 -07:00 |
|
James Betker
|
52b61b9f77
|
Update scripts and attempt to figure out how UnifiedVoice could be used to produce CTC codes
|
2022-02-13 20:48:06 -07:00 |
|
James Betker
|
0c3cc5ebad
|
use script updates to fix output size disparities
|
2022-02-12 20:00:46 -07:00 |
|
James Betker
|
d1d1ae32a1
|
audio diffusion frechet distance measurement!
|
2022-02-10 22:55:46 -07:00 |
|