James Betker
|
79e8f36d30
|
Convert CLIP models into new folder
|
2022-02-15 20:53:07 -07:00 |
|
James Betker
|
8f767b8b4f
|
...
|
2022-02-15 07:08:17 -07:00 |
|
James Betker
|
29e07913a8
|
Fix
|
2022-02-15 06:58:11 -07:00 |
|
James Betker
|
dd585df772
|
LAMB optimizer
|
2022-02-15 06:48:13 -07:00 |
|
James Betker
|
2bdb515068
|
A few mods to make wav2vec2 trainable with DDP on DLAS
|
2022-02-15 06:28:54 -07:00 |
|
James Betker
|
52b61b9f77
|
Update scripts and attempt to figure out how UnifiedVoice could be used to produce CTC codes
|
2022-02-13 20:48:06 -07:00 |
|
James Betker
|
a4f1641eea
|
Add & refine WER evaluator for w2v
|
2022-02-13 20:47:29 -07:00 |
|
James Betker
|
e16af944c0
|
BSO fix
|
2022-02-12 20:01:04 -07:00 |
|
James Betker
|
15fd60aad3
|
Allow EMA training to be disabled
|
2022-02-12 20:00:23 -07:00 |
|
James Betker
|
102142d1eb
|
f
|
2022-02-11 11:05:13 -07:00 |
|
James Betker
|
40b08a52d0
|
dafuk
|
2022-02-11 11:01:31 -07:00 |
|
James Betker
|
f6a7f12cad
|
Remove broken evaluator
|
2022-02-11 11:00:29 -07:00 |
|
James Betker
|
46b97049dc
|
Fix eval
|
2022-02-11 10:59:32 -07:00 |
|
James Betker
|
5175b7d91a
|
training sweeper checkin
|
2022-02-11 10:46:37 -07:00 |
|
James Betker
|
d1d1ae32a1
|
audio diffusion frechet distance measurement!
|
2022-02-10 22:55:46 -07:00 |
|
James Betker
|
23a310b488
|
Fix BSO
|
2022-02-10 20:54:51 -07:00 |
|
James Betker
|
1e28e02f98
|
BSO improvement to make it work with distributed optimizers
|
2022-02-10 09:53:13 -07:00 |
|
James Betker
|
836eb08afb
|
Update BSO to use the proper step size
|
2022-02-10 09:44:15 -07:00 |
|
James Betker
|
3d946356f8
|
batch_size_optimizer works. sweet! no more tuning batch sizes.
|
2022-02-09 14:26:23 -07:00 |
|
James Betker
|
18938248e4
|
Add batch_size_optimizer support
|
2022-02-08 23:51:31 -07:00 |
|
James Betker
|
de1a1d501a
|
Move audio injectors into their own file
|
2022-02-03 21:42:37 -07:00 |
|
James Betker
|
fbea6e8eac
|
Adjustments to diffusion networks
|
2022-01-30 16:14:06 -07:00 |
|
James Betker
|
798ed7730a
|
i like wasting time
|
2022-01-24 18:12:08 -07:00 |
|
James Betker
|
fc09cff4b3
|
angry
|
2022-01-24 18:09:29 -07:00 |
|
James Betker
|
cc0d9f7216
|
Fix
|
2022-01-24 18:05:45 -07:00 |
|
James Betker
|
3a9e3a9db3
|
consolidate state
|
2022-01-24 17:59:31 -07:00 |
|
James Betker
|
dfef34ba39
|
Load ema to cpu memory if specified
|
2022-01-24 15:08:29 -07:00 |
|
James Betker
|
49edffb6ad
|
Revise device mapping
|
2022-01-24 15:08:13 -07:00 |
|
James Betker
|
33511243d5
|
load model state dicts into the correct device
it's not clear to me that this will make a huge difference, but it's a good idea anyways
|
2022-01-24 14:40:09 -07:00 |
|
James Betker
|
3e16c509f6
|
Misc fixes
|
2022-01-24 14:31:43 -07:00 |
|
James Betker
|
e420df479f
|
Allow steps to specify which state keys to carry forward (reducing memory utilization)
|
2022-01-24 11:01:27 -07:00 |
|
James Betker
|
62475005e4
|
Sort data items in descending order, which I suspect will improve performance because we will hit GC less
|
2022-01-23 19:05:32 -07:00 |
|
James Betker
|
8f48848f91
|
misc
|
2022-01-22 08:23:29 -07:00 |
|
James Betker
|
ce929a6b3f
|
Allow grad scaler to be enabled even in fp32 mode
|
2022-01-21 23:13:24 -07:00 |
|
James Betker
|
bcd8cc51e1
|
Enable collated data for diffusion purposes
|
2022-01-19 00:35:08 -07:00 |
|
James Betker
|
894d245062
|
More zero_grad fixes
|
2022-01-08 20:31:19 -07:00 |
|
James Betker
|
2a9a25e6e7
|
Fix likely defective nan grad recovery
|
2022-01-08 18:24:58 -07:00 |
|
James Betker
|
65ffe38fce
|
misc
|
2022-01-06 22:16:17 -07:00 |
|
James Betker
|
f4484fd155
|
Add "dataset_debugger" support
This allows the datasets themselves compile statistics and report them
via tensorboard and wandb.
|
2022-01-06 12:38:20 -07:00 |
|
James Betker
|
b12f47b36d
|
Add some noise to voice_voice_clip
|
2021-12-29 13:56:30 -07:00 |
|
James Betker
|
64cb4a92db
|
Support adamw_zero
|
2021-12-25 21:32:01 -07:00 |
|
James Betker
|
776a7abfcc
|
Support torch DDP _set_static_graph
|
2021-12-25 21:20:06 -07:00 |
|
James Betker
|
62c8ed9a29
|
move speech utils
|
2021-12-16 20:47:37 -07:00 |
|
James Betker
|
e7957e4897
|
Make loss accumulator for logs accumulate better
|
2021-12-12 22:23:17 -07:00 |
|
James Betker
|
76f86c0e47
|
gaussian_diffusion: support fp16
|
2021-12-12 19:52:21 -07:00 |
|
James Betker
|
aa7cfd1edf
|
Add support for mel norms across the channel dim
|
2021-12-12 19:52:08 -07:00 |
|
James Betker
|
63bf135b93
|
Support norms
|
2021-12-11 08:30:49 -07:00 |
|
James Betker
|
5a664aa56e
|
misc
|
2021-12-11 08:17:26 -07:00 |
|
James Betker
|
306274245b
|
Also do dynamic range compression across mel
|
2021-12-10 20:06:24 -07:00 |
|
James Betker
|
faf55684b8
|
Use slaney norm in the mel filterbank computation
|
2021-12-10 20:04:52 -07:00 |
|