|
0db8ebc543
|
deduce if preprocessing text by checking the JSON itself instead
|
2023-03-16 14:41:21 +00:00 |
|
|
730f56aa87
|
some day I'll get a commit right on the first try
|
2023-03-16 04:37:49 +00:00 |
|
|
730a04708d
|
added flag to disable preprocessing (because some IPAs will turn into ASCII, implicitly enable for using the specific ipa.json tokenizer vocab)
|
2023-03-16 04:24:32 +00:00 |
|
James Betker
|
7929fd89de
|
Refactor audio-style models into the audio folder
|
2022-03-15 11:06:25 -06:00 |
|
James Betker
|
17fb934575
|
wer update
|
2021-12-31 16:21:39 -07:00 |
|
James Betker
|
f0c4cd6317
|
Taking another stab at a BPE tokenizer
|
2021-12-30 13:41:24 -07:00 |
|