|
f087b45943
|
Revert "added parameter to specify the autoregressive_model (tho it still loads the default model first, and then loads the target model, which seems to be because TTS loading just loads whatever is set in the settings first)"
This reverts commit d1dbe3e464 .
|
2023-08-23 19:11:13 +02:00 |
|
|
d1dbe3e464
|
added parameter to specify the autoregressive_model (tho it still loads the default model first, and then loads the target model, which seems to be because TTS loading just loads whatever is set in the settings first)
removed load_tts() call, the cli tool now relys on generate() to load the tts system, which is less fail prone imho
|
2023-08-22 23:33:02 +02:00 |
|
|
89102347a9
|
favor existing arguments from parameters (kwargs) over global (args)
|
2023-08-22 23:09:42 +02:00 |
|
|
5d73d9e71c
|
small QoL change to the StringNone helper, to allow generated text to be "None", maybe someone wants to generate that, we never know...
|
2023-08-22 21:49:49 +02:00 |
|
|
9abcb0f193
|
websocket server: API change(!), better response format
|
2023-08-22 21:37:19 +02:00 |
|
|
1ec3344999
|
Merge branch 'master' of https://git.ecker.tech/ben_mkiv/ai-voice-cloning
|
2023-08-22 21:00:06 +02:00 |
|
|
a902913780
|
websocket server: workaround for values and None type
|
2023-08-22 20:20:49 +02:00 |
|
|
2060b6f21c
|
fixed issue with sliced audio being the wrong sample rate
|
2023-08-22 14:22:39 +00:00 |
|
|
eeddd4cb6b
|
forgot the important reason I even started working on AIVC again
|
2023-08-21 03:42:12 +00:00 |
|
|
72a38ff2fc
|
made initialization faster if there's a lot of voice files (because glob fucking sucks), commiting changes buried on my training rig
|
2023-08-21 03:31:49 +00:00 |
|
mrq
|
91a0c495ff
|
Merge pull request 'added simple websocket server which allows to start tts generation tasks, retrieving autoregressive models and voices list' (#328) from ben_mkiv/ai-voice-cloning:master into master
Reviewed-on: #328
|
2023-08-16 14:01:44 +00:00 |
|
|
2626364c40
|
added simple websocket server which allows to start tts generation tasks, retrieving autoregressive models and voices list
|
2023-08-16 12:51:13 +02:00 |
|
|
ac645e0a20
|
no longer need to install bark under ./modules/
|
2023-07-11 16:20:28 +00:00 |
|
|
e2a6dc1c0a
|
under bark, properly use transcribed audio if the audio wasn't actually sliced (oops)
|
2023-07-11 14:53:32 +00:00 |
|
mrq
|
a325496661
|
Merge pull request 'Freeze pydantic package to 1.10.11' (#301) from Jarod/ai-voice-cloning:master into master
Reviewed-on: #301
|
2023-07-09 15:06:31 +00:00 |
|
Jarod
|
350d2d5a95
|
Freeze pydantic package to 1.10.11
inflect in DLAS requirements.txt installs pydantic 2.0.2, causing issues with the Gradio Webpage
|
2023-07-09 02:36:23 +00:00 |
|
|
6c3f48efba
|
uses gitmylo/bark-voice-cloning-HuBERT-quantizer for creating custom voices (it slightly works better over the base method, but still not very good desu)
|
2023-07-03 02:46:10 +00:00 |
|
|
547e1d1277
|
updated bark support, it'll also query for vocos, it actually works (I don't know what specifically was the issue)
|
2023-07-03 01:22:02 +00:00 |
|
|
76ed34ddd2
|
added CLI script (python ./src/cli.py --text=TEXT --voice=VOICE' etc)
|
2023-06-11 04:46:22 +00:00 |
|
|
e227ab8e08
|
updated whisperX integration for use with the latest version (v3) (NOTE: you WILL need to also update whisperx if you pull this commit)
|
2023-06-09 02:41:29 +00:00 |
|
|
805d7d35e8
|
the power of a separate setup for testing
|
2023-05-22 17:36:28 +00:00 |
|
|
2f5486a8d5
|
oops
|
2023-05-21 23:24:13 +00:00 |
|
|
baa6b76b85
|
added gradio API for changing AR model
|
2023-05-21 23:20:39 +00:00 |
|
|
31da215c5f
|
added checkboxes to use the original method for calculating latents (ignores the voice chunk field)
|
2023-05-21 01:47:48 +00:00 |
|
|
9e3eca2261
|
freeze gradio because I forgot to do it last week when it broke
|
2023-05-18 14:45:49 +00:00 |
|
|
cbe21745df
|
I am very smart (need to validate)
|
2023-05-12 17:41:26 +00:00 |
|
|
74bd0f0cdc
|
revert local change that made its way upstream (showing graphs by it instead of epoch)
|
2023-05-11 03:30:54 +00:00 |
|
|
149aaca554
|
fixed the whisperx has no attribute named load_model whatever because I guess whisperx has as stable of an API as I do
|
2023-05-06 10:45:17 +00:00 |
|
|
e416b0fe6f
|
oops
|
2023-05-05 12:36:48 +00:00 |
|
|
5003bc89d3
|
cleaned up brain worms with wrapping around gradio progress by instead just using tqdm directly (slight regressions with some messages not getting pushed)
|
2023-05-04 23:40:33 +00:00 |
|
|
09d849a78f
|
quick hotfix if it actually is a problem in the repo itself
|
2023-05-04 23:01:47 +00:00 |
|
|
853c7fdccf
|
forgot to uncomment the block to transcribe and slice when using transcribe all because I was piece-processing a huge batch of LibriTTS and somehow that leaked over to the repo
|
2023-05-03 21:31:37 +00:00 |
|
|
fd306d850d
|
updated setup-directml.bat to not hard require torch version because it's updated to torch2 now
|
2023-04-29 00:50:16 +00:00 |
|
|
eddb8aaa9a
|
indentation fix
|
2023-04-28 15:56:57 +00:00 |
|
|
99387920e1
|
backported caching of phonemizer backend from mrq/vall-e
|
2023-04-28 15:31:45 +00:00 |
|
|
c5e9b407fa
|
boolean oops
|
2023-04-27 14:40:22 +00:00 |
|
|
3978921e71
|
forgot to make the transcription tab visible with the bark backend (god the code is a mess now, I'll suck you off if you clean this up for me (not really))
|
2023-04-26 04:55:10 +00:00 |
|
|
b6440091fb
|
Very, very, VERY, barebones integration with Bark (documentation soon)
|
2023-04-26 04:48:09 +00:00 |
|
|
faa8da12d7
|
modified logic to determine valid voice folders, also allows subdirs within the folder (for example: ./voices/SH/james/ will be named SH/james)
|
2023-04-13 21:10:38 +00:00 |
|
|
02beb1dd8e
|
should fix #203
|
2023-04-13 03:14:06 +00:00 |
|
|
8f3e9447ba
|
disable diarize button
|
2023-04-12 20:03:54 +00:00 |
|
|
d8b996911c
|
a bunch of shit i had uncommited over the past while pertaining to VALL-E
|
2023-04-12 20:02:46 +00:00 |
|
mrq
|
b785192dfc
|
Merge pull request 'Make convenient to use with Docker' (#191) from psr/ai-voice-cloning:docker into master
Reviewed-on: #191
|
2023-04-08 14:04:45 +00:00 |
|
psr
|
9afafc69c1
|
docker: add training script
|
2023-04-07 23:15:13 +00:00 |
|
psr
|
c018bfca9c
|
docker: add ffmpeg for whisper and general cleanup
|
2023-04-07 23:14:05 +00:00 |
|
psr
|
d64cba667f
|
docker support
|
2023-04-07 21:52:18 +00:00 |
|
|
0440eac2bc
|
#185
|
2023-03-31 06:55:52 +00:00 |
|
|
9f64153a28
|
fixes #185
|
2023-03-31 06:03:56 +00:00 |
|
|
4744120be2
|
added VALL-E inference support (very rudimentary, gimped, but it will load a model trained on a config generated through the web UI)
|
2023-03-31 03:26:00 +00:00 |
|
|
9b01377667
|
only include auto in the list of models under setting, nothing else
|
2023-03-29 19:53:23 +00:00 |
|