vall-e/vall_e/emb
2024-08-06 15:08:37 -05:00
..
__init__.py Rewrite init 2023-08-02 21:53:35 +00:00
g2p.py some insanity for sanity checks (some phonemes from phonemizing japanese are not in my tokenizer...) 2024-07-22 00:30:40 -05:00
process.py tweaked vall_e.emb.process to instead process audio one file at a time instead of all the files for a given speaker to avoid OOMing on less-memory-filled systems with --low-memory 2024-08-06 14:24:40 -05:00
qnt.py do mixed-precision for AMP inside the compress function itself, because the loudness function gripes when using a float16 (non-power of 2 lengths) or bfloat16 (something about views for bfloat16) 2024-08-06 15:08:37 -05:00
transcribe.py tweaks and things 2024-08-06 08:17:25 -05:00