vall-e

mrq/vall-e

History

mrq 2437a86efa ugh		2024-05-12 13:02:15 -05:00
..
cleanup_dataset.py	documentation update	2024-05-04 21:03:46 -05:00
deduplicate_librilight_libritts.py
parse_ppp.py	added sampling by speaker group name (might be better to de-emphasize the LibriVox/Audiobooks that are in large numbers, and emphasize the smaller pools), log cleanup	2023-10-16 19:30:38 -05:00
prepare_librilight.py	dataset preparation script updates, caved and am using HF tokenizer now	2024-04-21 14:49:18 -05:00
prepare_libritts.py
process_dataset.py	ugh	2024-05-12 13:02:15 -05:00
process_libritts.py	actually use the passed-through sample rate from encode for DAC because it does its own resampling I guess	2024-04-18 13:32:41 -05:00
run.sh
setup-training.sh
setup.sh	updated setup script	2023-10-06 20:08:28 -05:00
train_tokenizer.py	documentation update	2024-05-04 21:03:46 -05:00
transcribe_dataset.py	documentation update	2024-05-04 21:03:46 -05:00