• https://git.ecker.tech/ aims to provide a place to share my efforts while maintaining true ownership of my code, as I do not trust GitHub.

    XMR: 4B9TQdkAkBFYrbj5ztvTx89e5LpucPeTSPzemCihdDi9EBnx7btn8RDNZTBz2zihWsjMnDkzn5As1LU6gLv3KQy8BLsZ8SG

  • Joined on 2022-10-10
mrq pushed to master at mrq/vall-e 2024-09-26 04:27:37 +00:00
9da630f73a swap order of demo entries, as the model prioritizes adhering to the speaker prompt more (instead of trying to match the ground truth magically)
mrq pushed to master at mrq/vall-e 2024-09-25 01:02:41 +00:00
e84d466261 vall_e.plot tweaks
mrq pushed to master at mrq/resnet-classifier 2024-09-23 03:03:35 +00:00
16ad4fa1c9 fixed not being able to use other resnets as a base
mrq pushed to master at mrq/vall-e 2024-09-21 21:02:40 +00:00
2266d34818 oops
mrq pushed to master at mrq/vall-e 2024-09-21 18:04:06 +00:00
c5e9142863 added option to retokenize phonemes for hdf5 (to save having to remake my hdf5 file)
mrq pushed to master at mrq/vall-e 2024-09-21 17:56:00 +00:00
536c11c4ac actually validated and fixed sampling similar utterances for the prompt (hopefully nothing else is needed)
mrq pushed to master at mrq/vall-e 2024-09-21 17:25:39 +00:00
d31f27119a regex replace out the (lang) markers in espeak, updated tokenizer vocab as lazily as possible to not have unk tokens
mrq pushed to master at mrq/vall-e 2024-09-21 17:15:47 +00:00
769f67dcfe actually fix validation of phonemes in the symmap
mrq pushed to master at mrq/vall-e 2024-09-19 02:37:08 +00:00
mrq pushed to master at mrq/vall-e 2024-09-19 02:30:58 +00:00
fe241f6a99 support for wildcard in training/validation/noise dataset array (to-do: a better way to query between metadata folder and data folder)
mrq pushed to master at mrq/vall-e 2024-09-19 01:15:54 +00:00
b5bec0c9ce oops, turns out these are not split by speaker names already........ (also added sampling the dataset in the webui for easy viewing)
mrq pushed to master at mrq/vall-e 2024-09-19 00:32:09 +00:00
fa9d3f6c06 lang fixes / reworked phoneme symmap validation
mrq pushed to master at mrq/vall-e 2024-09-18 21:40:17 +00:00
84647f588a more tweaks
mrq pushed to master at mrq/vall-e 2024-09-18 03:53:16 +00:00
ebac1db16c maybe final tweaks, I really needed to unify my json read/write and orjson is proven to be fast enough for me to try and rely on it more
mrq pushed to master at mrq/vall-e 2024-09-18 03:40:46 +00:00
6ceed866b5 *faster*
mrq pushed to master at mrq/vall-e 2024-09-18 03:22:41 +00:00
f00283440c faster
mrq pushed to master at mrq/vall-e 2024-09-18 02:54:56 +00:00
be22b65300 solved my problem
mrq pushed to master at mrq/vall-e 2024-09-17 21:22:40 +00:00
8f41d1b324 more tweaks
mrq pushed to master at mrq/vall-e 2024-09-17 20:49:07 +00:00
804ddb5182 optimizations (6 hours to do cosine similarities on a speaker set of just 17k utterances................)
mrq pushed to master at mrq/vall-e 2024-09-17 20:21:33 +00:00
a9fbe81f98 oops