3e8365fdecvoicefixed files do not overwrite, as my autism wants to hear the difference between them, incrementing file format fixed for realmrq2023-02-15 05:49:28 +0000
ea1bc770aaadded option: force cpu for conditioning latents, for when you want low chunk counts but your GPU keeps OOMing because fuck fragmentationmrq2023-02-15 05:01:40 +0000
b721e395b5modified conversion scripts to not give a shit about bitrate and formats since torchaudio.load handles all of that anyways, and it all gets resampled anywaysmrq2023-02-15 04:44:14 +0000
2e777e8a67done away with kludgy shit code, just have the user decide how many chunks to slice concat'd samples to (since it actually does improve vocie replicability)mrq2023-02-15 04:39:31 +0000
314feaeea1added reset generation settings to default button, revamped utilities tab to double as plain jane voice importer (and runs through voicefixer despite it not really doing anything if your voice samples are already of decent quality anyways), ditched load_wav_to_torch or whatever it was called because it literally exists as torchaudio.load, sample voice is now a combined waveform of all your samples and will always return even if using a latents filemrq2023-02-14 21:20:04 +0000
0bc2c1f540updates chunk size to the chunked tensor length, just in casemrq2023-02-14 17:13:34 +0000
48275899e8added flag to enable/disable voicefixer using CUDA because I'll OOM on my 2060, changed from naively subdividing eavenly (2,4,8,16 pieces) to just incrementing by 1 (1,2,3,4) when trying to subdivide within constraints of the max chunk size for computing voice latentsmrq2023-02-14 16:47:34 +0000
b648186691history tab doesn't naively reuse the voice dir instead for results, experimental "divide total sound size until it fits under requests max chunk size" doesn't have a +1 to mess things up (need to re-evaluate how I want to calculate sizes of bests fits eventually)mrq2023-02-14 16:23:04 +0000
47f4b5bf81voicefixer uses CUDA if exposedmrq2023-02-13 15:30:49 +0000
8250a79b23Implemented kv_cache "fix" (from 1f3c1b5f4a); guess I should find out why it's crashing DirectML backendmrq2023-02-13 13:48:31 +0000
80eeef01fbMerge pull request 'Download from Gradio' (#31) from Armored1065/tortoise-tts:main into main
mrq
2023-02-13 13:30:09 +0000
446d643d62Merge pull request 'Update 'README.md'' (#1) from armored1065-patch-1 into main
Armored1065
2023-02-13 06:21:37 +0000
8c96aa02c5Merge pull request 'Update 'README.md'' (#1) from armored1065-patch-1 into main
Armored1065
2023-02-13 06:21:37 +0000
37d25573acadded random voice option back because I forgot I accidentally removed itmrq2023-02-13 04:57:06 +0000
f92e432c8dadded random voice option back because I forgot I accidentally removed itmrq2023-02-13 04:57:06 +0000
a84aaa4f96Fixed out of order settings causing other settings to flipflopmrq2023-02-13 03:43:08 +0000
a2bac3fb2cFixed out of order settings causing other settings to flipflopmrq2023-02-13 03:43:08 +0000
4ced0296a2DirectML: fixed redaction/aligner by forcing it to stay on CPUmrq2023-02-12 20:52:04 +0000
5b5e32338cDirectML: fixed redaction/aligner by forcing it to stay on CPUmrq2023-02-12 20:52:04 +0000
409dec98d5fixed voicefixing not working as intended, load TTS before Gradio in the webui due to how long it takes to initialize tortoise (instead of just having a block to preload it)mrq2023-02-12 20:05:59 +0000
824ad38ccafixed voicefixing not working as intended, load TTS before Gradio in the webui due to how long it takes to initialize tortoise (instead of just having a block to preload it)mrq2023-02-12 20:05:59 +0000
b85c9921d7added button to recalculate voice latents, added experimental switch for computing voice latentsmrq2023-02-12 18:11:40 +0000
4d01bbd429added button to recalculate voice latents, added experimental switch for computing voice latentsmrq2023-02-12 18:11:40 +0000
2210b49cb6fixed regression with computing conditional latencies outside of the CPUmrq2023-02-12 17:44:39 +0000
88529fda43fixed regression with computing conditional latencies outside of the CPUmrq2023-02-12 17:44:39 +0000
a2d95fe208fixed silently crashing from enabling kv_cache-ing if using the DirectML backend, throw an error when reading a generated audio file that does not have any embedded metadata in it, cleaned up the blocks of code that would DMA/transfer tensors/models between GPU and CPUmrq2023-02-12 14:46:21 +0000
65f74692a0fixed silently crashing from enabling kv_cache-ing if using the DirectML backend, throw an error when reading a generated audio file that does not have any embedded metadata in it, cleaned up the blocks of code that would DMA/transfer tensors/models between GPU and CPUmrq2023-02-12 14:46:21 +0000
25e70dce1ainstll python3.9, wrapped try/catch when parsing args.listen in case you somehow manage to insert garbage into that field and fuck up your config, removed a very redudnant setup.py install call since that only is required if you're just going to install it for using outside of the tortoise-tts foldermrq2023-02-12 04:35:21 +0000
94757f5b41instll python3.9, wrapped try/catch when parsing args.listen in case you somehow manage to insert garbage into that field and fuck up your config, removed a very redudnant setup.py install call since that only is required if you're just going to install it for using outside of the tortoise-tts foldermrq2023-02-12 04:35:21 +0000
6328466852cleanup loop, save files while generating a batch in the event it crashes midway throughmrq2023-02-12 01:15:22 +0000
ddd0c4ccf8cleanup loop, save files while generating a batch in the event it crashes midway throughmrq2023-02-12 01:15:22 +0000
5f1c032312fixed regression where the auto_conds do not move to the GPU and causes a problem during CVVP compare passmrq2023-02-11 20:34:12 +0000
1b55730e67fixed regression where the auto_conds do not move to the GPU and causes a problem during CVVP compare passmrq2023-02-11 20:34:12 +0000
2f86565969Merge pull request 'Only directories in the voice list' (#20) from lightmare/tortoise-tts:only_dirs_in_voice_list into main
mrq
2023-02-11 20:14:36 +0000
3d69274a46Merge pull request 'Only directories in the voice list' (#20) from lightmare/tortoise-tts:only_dirs_in_voice_list into main
mrq
2023-02-11 20:14:36 +0000
192a510ee1Only directories in the voice list
lightmare
2023-02-11 18:26:51 +0000
13b60db29cOnly directories in the voice list
lightmare
2023-02-11 18:26:51 +0000
84316d8f80Moved experimental settings to main tab, hidden under a check boxmrq2023-02-11 17:21:08 +0000
3a8ce5a110Moved experimental settings to main tab, hidden under a check boxmrq2023-02-11 17:21:08 +0000
4b3b0ead1aAdded candidate selection for outputs, hide output elements (except for the main one) to only show one progress barmrq2023-02-11 16:34:47 +0000
6d06bcce05Added candidate selection for outputs, hide output elements (except for the main one) to only show one progress barmrq2023-02-11 16:34:47 +0000
c5337a6b51Added integration for "voicefixer", fixed issue where candidates>1 and lines>1 only outputs the last combined candidate, numbered step for each generation in progress, output time per generation stepmrq2023-02-11 15:02:11 +0000
a7330164abAdded integration for "voicefixer", fixed issue where candidates>1 and lines>1 only outputs the last combined candidate, numbered step for each generation in progress, output time per generation stepmrq2023-02-11 15:02:11 +0000
fa743e2e9bstore generation time per generation rather than per entire requestmrq2023-02-11 13:00:39 +0000
841754602estore generation time per generation rather than per entire requestmrq2023-02-11 13:00:39 +0000
ffb269e579fixed using old output dir because of my autism with prefixing everything with "./" broke it, fixed incrementing filenamesmrq2023-02-11 12:39:16 +0000
44eba62dc8fixed using old output dir because of my autism with prefixing everything with "./" broke it, fixed incrementing filenamesmrq2023-02-11 12:39:16 +0000
9bf1ea5b0aHistory tab (3/10 it works)mrq2023-02-11 01:45:25 +0000
58e2b22b0eHistory tab (3/10 it works)mrq2023-02-11 01:45:25 +0000
340a89f883Numbering predicates on input_#.json files instead of "number of wavs"mrq2023-02-10 22:51:56 +0000
c924ebd034Numbering predicates on input_#.json files instead of "number of wavs"mrq2023-02-10 22:51:56 +0000
8641cc9906revamped result formatting, added "kludgy" stop buttonmrq2023-02-10 22:12:37 +0000
4f903159eerevamped result formatting, added "kludgy" stop buttonmrq2023-02-10 22:12:37 +0000