This results in a significant compression of the text domain, I'm curious what the effect on speech quality will be. |
||
|---|---|---|
| .. | ||
| audio_with_noise_dataset.py | ||
| gpt_tts_dataset.py | ||
| nv_tacotron_dataset.py | ||
| paired_voice_audio_dataset.py | ||
| unsupervised_audio_dataset.py | ||
| wav_aug.py | ||