tortoise-tts

Author	SHA1	Message	Date
mrq	b721e395b5	modified conversion scripts to not give a shit about bitrate and formats since torchaudio.load handles all of that anyways, and it all gets resampled anyways	2023-02-15 04:44:14 +00:00
mrq	2e777e8a67	done away with kludgy shit code, just have the user decide how many chunks to slice concat'd samples to (since it actually does improve vocie replicability)	2023-02-15 04:39:31 +00:00
mrq	314feaeea1	added reset generation settings to default button, revamped utilities tab to double as plain jane voice importer (and runs through voicefixer despite it not really doing anything if your voice samples are already of decent quality anyways), ditched load_wav_to_torch or whatever it was called because it literally exists as torchaudio.load, sample voice is now a combined waveform of all your samples and will always return even if using a latents file	2023-02-14 21:20:04 +00:00
mrq	0bc2c1f540	updates chunk size to the chunked tensor length, just in case	2023-02-14 17:13:34 +00:00
mrq	48275899e8	added flag to enable/disable voicefixer using CUDA because I'll OOM on my 2060, changed from naively subdividing eavenly (2,4,8,16 pieces) to just incrementing by 1 (1,2,3,4) when trying to subdivide within constraints of the max chunk size for computing voice latents	2023-02-14 16:47:34 +00:00
mrq	b648186691	history tab doesn't naively reuse the voice dir instead for results, experimental "divide total sound size until it fits under requests max chunk size" doesn't have a +1 to mess things up (need to re-evaluate how I want to calculate sizes of bests fits eventually)	2023-02-14 16:23:04 +00:00
mrq	47f4b5bf81	voicefixer uses CUDA if exposed	2023-02-13 15:30:49 +00:00
mrq	8250a79b23	Implemented kv_cache "fix" (from `1f3c1b5f4a`); guess I should find out why it's crashing DirectML backend	2023-02-13 13:48:31 +00:00
mrq	80eeef01fb	Merge pull request 'Download from Gradio' (#31 ) from Armored1065/tortoise-tts:main into main Reviewed-on: mrq/tortoise-tts#31	2023-02-13 13:30:09 +00:00
Armored1065	8c96aa02c5	Merge pull request 'Update 'README.md'' (#1 ) from armored1065-patch-1 into main Reviewed-on: Armored1065/tortoise-tts#1	2023-02-13 06:21:37 +00:00
Armored1065	d458e932be	Update 'README.md' Updated text to reflect the download and playback options	2023-02-13 06:19:42 +00:00
mrq	f92e432c8d	added random voice option back because I forgot I accidentally removed it	2023-02-13 04:57:06 +00:00
mrq	a2bac3fb2c	Fixed out of order settings causing other settings to flipflop	2023-02-13 03:43:08 +00:00
mrq	5b5e32338c	DirectML: fixed redaction/aligner by forcing it to stay on CPU	2023-02-12 20:52:04 +00:00
mrq	824ad38cca	fixed voicefixing not working as intended, load TTS before Gradio in the webui due to how long it takes to initialize tortoise (instead of just having a block to preload it)	2023-02-12 20:05:59 +00:00
mrq	4d01bbd429	added button to recalculate voice latents, added experimental switch for computing voice latents	2023-02-12 18:11:40 +00:00
mrq	88529fda43	fixed regression with computing conditional latencies outside of the CPU	2023-02-12 17:44:39 +00:00
mrq	65f74692a0	fixed silently crashing from enabling kv_cache-ing if using the DirectML backend, throw an error when reading a generated audio file that does not have any embedded metadata in it, cleaned up the blocks of code that would DMA/transfer tensors/models between GPU and CPU	2023-02-12 14:46:21 +00:00
mrq	94757f5b41	instll python3.9, wrapped try/catch when parsing args.listen in case you somehow manage to insert garbage into that field and fuck up your config, removed a very redudnant setup.py install call since that only is required if you're just going to install it for using outside of the tortoise-tts folder	2023-02-12 04:35:21 +00:00
mrq	ddd0c4ccf8	cleanup loop, save files while generating a batch in the event it crashes midway through	2023-02-12 01:15:22 +00:00
mrq	1b55730e67	fixed regression where the auto_conds do not move to the GPU and causes a problem during CVVP compare pass	2023-02-11 20:34:12 +00:00
mrq	3d69274a46	Merge pull request 'Only directories in the voice list' (#20 ) from lightmare/tortoise-tts:only_dirs_in_voice_list into main Reviewed-on: mrq/tortoise-tts#20	2023-02-11 20:14:36 +00:00
lightmare	13b60db29c	Only directories in the voice list	2023-02-11 18:26:51 +00:00
mrq	3a8ce5a110	Moved experimental settings to main tab, hidden under a check box	2023-02-11 17:21:08 +00:00
mrq	126f1a0afe	sloppily guarantee stop/reloading TTS actually works	2023-02-11 17:01:40 +00:00
mrq	6d06bcce05	Added candidate selection for outputs, hide output elements (except for the main one) to only show one progress bar	2023-02-11 16:34:47 +00:00
mrq	a7330164ab	Added integration for "voicefixer", fixed issue where candidates>1 and lines>1 only outputs the last combined candidate, numbered step for each generation in progress, output time per generation step	2023-02-11 15:02:11 +00:00
mrq	841754602e	store generation time per generation rather than per entire request	2023-02-11 13:00:39 +00:00
mrq	44eba62dc8	fixed using old output dir because of my autism with prefixing everything with "./" broke it, fixed incrementing filenames	2023-02-11 12:39:16 +00:00
mrq	58e2b22b0e	History tab (3/10 it works)	2023-02-11 01:45:25 +00:00
mrq	c924ebd034	Numbering predicates on input_#.json files instead of "number of wavs"	2023-02-10 22:51:56 +00:00
mrq	4f903159ee	revamped result formatting, added "kludgy" stop button	2023-02-10 22:12:37 +00:00
mrq	9e0fbff545	Slight notebook adjust	2023-02-10 20:22:12 +00:00
mrq	52a9ed7858	Moved voices out of the tortoise folder because it kept being processed for setup.py	2023-02-10 20:11:56 +00:00
mrq	8b83c9083d	Cleanup	2023-02-10 19:55:33 +00:00
mrq	a09eff5d9c	Added the remaining input settings	2023-02-10 16:47:57 +00:00
mrq	7baf9e3f79	Added a link to the colab notebook	2023-02-10 16:26:13 +00:00
mrq	5b852da720	Colab notebook (part II)	2023-02-10 16:12:11 +00:00
mrq	3d6ac3afaa	Colab notebook (part 1)	2023-02-10 15:58:56 +00:00
mrq	efa556b793	Added new options: "Output Sample Rate", "Output Volume", and documentation	2023-02-10 03:02:09 +00:00
mrq	57af25c6c0	oops	2023-02-09 22:17:57 +00:00
mrq	504db0d1ac	Added 'Only Load Models Locally' setting	2023-02-09 22:06:55 +00:00
mrq	460f5d6e32	Added and documented	2023-02-09 21:07:51 +00:00
mrq	145298b766	Oops	2023-02-09 20:49:22 +00:00
mrq	729be135ef	Added option: listen path	2023-02-09 20:42:38 +00:00
mrq	3f8302a680	I didn't have to suck off a wizard for DirectML support (courtesy of https://github.com/AUTOMATIC1111/stable-diffusion-webui/issues/7600 for leading the way)	2023-02-09 05:05:21 +00:00
mrq	50b4e2c458	oops	2023-02-09 02:39:08 +00:00
mrq	b23d6b4b4c	owari da...	2023-02-09 01:53:25 +00:00
mrq	494f3c84a1	beginning to add DirectML support	2023-02-08 23:03:52 +00:00
mrq	81e4d261b7	Added two flags/settings: embed output settings, slimmer computed voice latents	2023-02-08 14:14:28 +00:00

1 2 3 4 5 ...

264 Commits