vall-e/README.md

42 lines
2.2 KiB
Markdown
Raw Normal View History

2023-08-02 21:53:35 +00:00
<p align="center">
<img src="./vall-e.png" width="500px"></img>
</p>
# VALL'E
2023-08-02 21:53:35 +00:00
2024-12-12 20:38:42 +00:00
An unofficial PyTorch implementation of [VALL-E](https://vall-e-demo.ecker.tech/) (last updated: `2024.12.11`), utilizing the [EnCodec](https://github.com/facebookresearch/encodec) encoder/decoder.
2024-11-06 04:30:49 +00:00
A demo is available on HuggingFace [here](https://huggingface.co/spaces/ecker/vall-e).
## Requirements
2023-08-02 21:53:35 +00:00
2024-06-29 02:02:54 +00:00
Besides a working PyTorch environment, the only hard requirement is [`espeak-ng`](https://github.com/espeak-ng/espeak-ng/) for phonemizing text:
- Linux users can consult their package managers on installing `espeak`/`espeak-ng`.
- Windows users are required to install [`espeak-ng`](https://github.com/espeak-ng/espeak-ng/releases/tag/1.51#Assets).
+ additionally, you may be required to set the `PHONEMIZER_ESPEAK_LIBRARY` environment variable to specify the path to `libespeak-ng.dll`.
- In the future, an internal homebrew to replace this would be fantastic.
2023-08-02 21:53:35 +00:00
## Install
2023-08-02 21:53:35 +00:00
Simply run `pip install git+https://git.ecker.tech/mrq/vall-e` or `pip install git+https://github.com/e-c-k-e-r/vall-e`.
2023-08-02 21:53:35 +00:00
This repo is tested under Python versions `3.10.9`, `3.11.3`, and `3.12.3`.
2023-08-04 01:36:19 +00:00
2024-12-25 05:14:32 +00:00
### Additional Implementations
An "HF"-ified version of the model is available as [`ecker/vall-e@hf`](https://huggingface.co/ecker/vall-e/tree/hf), but it does require some additional efforts (see the `__main__` of [`./vall_e/models/base.py`](./vall_e/models/base.py) for details).
Additionally, [`vall_e.cpp`](./vall_e.cpp/) is available. Consult its README for more details.
2023-08-22 18:09:44 +00:00
## Pre-Trained Model
Pre-trained weights can be acquired from
* [here](https://huggingface.co/ecker/vall-e) or automatically when either inferencing or running the web UI.
* `./scripts/setup.sh`, a script to setup a proper environment and download the weights. This will also automatically create a `venv`.
* when inferencing, either through the web UI or CLI, if no model is passed, the default model will download automatically instead, and should automatically update.
2023-08-22 18:09:44 +00:00
2024-11-05 22:11:01 +00:00
## Documentation
2023-08-02 21:53:35 +00:00
2024-11-05 22:11:01 +00:00
The provided documentation under [./docs/](./docs/) should provide thorough coverage over most, if not all, of this project.
2023-08-02 21:53:35 +00:00
2024-11-05 22:11:01 +00:00
Markdown files should correspond directly to their respective file or folder under `./vall_e/`.