James Betker
|
37640e9759
|
Squelch annoying warning
|
2022-05-22 12:12:40 -06:00 |
|
James Betker
|
519151d83f
|
m2v
|
2022-05-17 15:37:59 -06:00 |
|
James Betker
|
eb64d18075
|
Fix phoneme tokenizer
|
2022-05-13 17:56:26 -06:00 |
|
James Betker
|
51f8c1bced
|
phonetic dataset
|
2022-05-12 11:57:28 -06:00 |
|
James Betker
|
c85ab738c5
|
paired fix
|
2022-04-16 23:41:57 -06:00 |
|
James Betker
|
8fe0dff33c
|
support tts typing
|
2022-04-16 23:36:57 -06:00 |
|
James Betker
|
48cb6a5abd
|
misc
|
2022-04-16 20:28:04 -06:00 |
|
James Betker
|
45804177b8
|
more stuff
|
2022-03-25 00:03:18 -06:00 |
|
James Betker
|
f563a8dd41
|
fixes
|
2022-03-15 21:43:00 -06:00 |
|
James Betker
|
7929fd89de
|
Refactor audio-style models into the audio folder
|
2022-03-15 11:06:25 -06:00 |
|
James Betker
|
9bbbe26012
|
update audio_with_noise
|
2022-03-12 20:41:47 -07:00 |
|
James Betker
|
726e30c4f7
|
Update noise augmentation dataset to include voices that are appended at the end of another clip.
|
2022-03-09 09:43:10 -07:00 |
|
James Betker
|
38fd9fc985
|
Improve efficiency of audio_with_noise_dataset
|
2022-03-08 15:50:13 -07:00 |
|
James Betker
|
30ddac69aa
|
lots of bad entries
|
2022-03-05 23:15:59 -07:00 |
|
James Betker
|
dcf98df0c2
|
++
|
2022-03-05 23:12:34 -07:00 |
|
James Betker
|
64d764ccd7
|
fml
|
2022-03-05 23:11:10 -07:00 |
|
James Betker
|
ef63ff84e2
|
pvd2
|
2022-03-05 23:08:39 -07:00 |
|
James Betker
|
1a05712764
|
pvd
|
2022-03-05 23:05:29 -07:00 |
|
James Betker
|
db0c3340ac
|
Implement guidance-free diffusion in eval
And a few other fixes
|
2022-03-01 11:49:36 -07:00 |
|
James Betker
|
03752c1cd6
|
Report NaN
|
2022-02-22 23:09:37 -07:00 |
|
James Betker
|
af50afe222
|
pairedvoice: error out if clip is too short
|
2022-02-21 19:11:10 -07:00 |
|
James Betker
|
79e8f36d30
|
Convert CLIP models into new folder
|
2022-02-15 20:53:07 -07:00 |
|
James Betker
|
c24682c668
|
Record load times in fast_paired_dataset
|
2022-02-07 15:45:38 -07:00 |
|
James Betker
|
5ae816bead
|
ctc gen checkin
|
2022-02-05 15:59:53 -07:00 |
|
James Betker
|
8fb147e8ab
|
add an autoregressive ctc code generator
|
2022-02-04 11:00:15 -07:00 |
|
James Betker
|
4249681c4b
|
Mods to support a autoregressive CTC code generator
|
2022-02-03 19:58:54 -07:00 |
|
James Betker
|
8c255811ad
|
more fixes
|
2022-01-25 17:57:16 -07:00 |
|
James Betker
|
91b4b240ac
|
dont pickle unique files
|
2022-01-21 00:02:06 -07:00 |
|
James Betker
|
7fef7fb9ff
|
Update fast_paired_dataset to report how many audio files it is actually using
|
2022-01-20 21:49:38 -07:00 |
|
James Betker
|
20312211e0
|
Fix bug in code alignment
|
2022-01-20 11:28:12 -07:00 |
|
James Betker
|
bcd8cc51e1
|
Enable collated data for diffusion purposes
|
2022-01-19 00:35:08 -07:00 |
|
James Betker
|
b6190e96b2
|
fast_paired
|
2022-01-17 15:46:02 -07:00 |
|
James Betker
|
1d30d79e34
|
De-specify fast-paired-dataset
|
2022-01-16 21:20:00 -07:00 |
|
James Betker
|
2b36ca5f8e
|
Revert paired back
|
2022-01-16 21:10:46 -07:00 |
|
James Betker
|
ad3e7df086
|
Split the fast random into its own new dataset
|
2022-01-16 21:10:11 -07:00 |
|
James Betker
|
7331862755
|
Updated paired to randomly index data, offsetting memory costs and speeding up initialization
|
2022-01-16 21:09:22 -07:00 |
|
James Betker
|
37e4e737b5
|
a few fixes
|
2022-01-16 15:17:17 -07:00 |
|
James Betker
|
35db5ebf41
|
paired_voice_audio_dataset - aligned codes support
|
2022-01-15 17:38:26 -07:00 |
|
James Betker
|
6706591d3d
|
Fix dataset
|
2022-01-06 15:24:37 -07:00 |
|
James Betker
|
f4484fd155
|
Add "dataset_debugger" support
This allows the datasets themselves compile statistics and report them
via tensorboard and wandb.
|
2022-01-06 12:38:20 -07:00 |
|
James Betker
|
f3cab45658
|
Revise audio datasets to include interesting statistics in batch
Stats include:
- How many indices were skipped to retrieve a given index
- Whether or not a conditioning input was actually the file itself
|
2022-01-06 11:15:16 -07:00 |
|
James Betker
|
06c1093090
|
Remove collating from paired_voice_audio_dataset
This will now be done at the model level, which is more efficient
|
2022-01-06 10:29:39 -07:00 |
|
James Betker
|
5e1d1da2e9
|
Clean paired_voice
|
2022-01-06 10:26:53 -07:00 |
|
James Betker
|
0fe34f57d1
|
Use torch resampler
|
2022-01-05 15:47:22 -07:00 |
|
James Betker
|
d5a5111890
|
Fix collating on by default on grand_conjoined
|
2022-01-01 10:30:15 -07:00 |
|
James Betker
|
4d9ba4a48a
|
can i has fix now
|
2022-01-01 00:48:27 -07:00 |
|
James Betker
|
56752f1dbc
|
Fix collator bug
|
2022-01-01 00:33:31 -07:00 |
|
James Betker
|
c28d8770c7
|
fix tensor lengths
|
2022-01-01 00:23:46 -07:00 |
|
James Betker
|
bbacffb790
|
dataset improvements and fix to unified_voice_Bilevel
|
2022-01-01 00:16:30 -07:00 |
|
James Betker
|
17fb934575
|
wer update
|
2021-12-31 16:21:39 -07:00 |
|