Web UI for WhisperSpeech (https://github.com/collabora/WhisperSpeech)
Note
Versions 2.x, 3.x and 4.x allow voice generation via API.
Note
Version 4.x supports SillyTavern (1.15)
| Name | Info |
|---|---|
| CPU | AMD Ryzen 9 9950X3D |
| GPU | AMD Radeon 7900XTX |
| RAM | 64GB DDR5 6600MHz |
| Motherboard | Gigabyte X870 AORUS ELITE WIFI7 (BIOS F8) |
| OS | Debian 13.4 |
| Kernel | 6.12.74+deb13+1-amd64 |
| ROCm | 7.2.3 |
| Name | Info |
|---|---|
| CPU | IntelCore i5-12500H |
| GPU | NVIDIA GeForce RTX 4050 |
| RAM | 16GB DDR4 3200MHz |
| Motherboard | GIGABYTE G5 MF (BIOS FB10) |
| OS | CachyOS |
| Kernel | 6.19.12-1-cachyos |
| NVIDIA Driver | 595.58.03 |
| CUDA | 13.2 |
1. Install uv and ffmpeg.
2. Clone repository.
3. Mount the repository directory.
3. Create and activate venv using uv:
uv venv --python 3.13
source .venv/bin/activate4. Run webui.py:
CPU (not recommended):
uv run --extra cpu webui.pyROCm 7.2.3:
uv run --extra rocm webui.pyCUDA 13.2:
uv run --extra cuda webui.py1. Copy whisperspeech-webui folder to SillyTavern/public/scripts/extensions/third-party
2. Run app with api support.
# This is example. If you are using SillyTavern locally, you can remove --listen parameter.
uv run --extra cpu webui.py --listen --api3. Launch SillyTavern. In the extensions tab, expand WhisperSpeech web UI and set the IP and port.
| Languages |
|---|
| English |
| Polish |
1. Extract messages.pot:
pybabel extract -F babel.cfg -o ./locale/messages.pot . 2. Generate files:
New language:
pybabel init -i ./locale/messages.pot -d ./locale -l pl_PL
# Replace pl_PL by your languageUpdate current:
pybabel update -i ./locale/messages.pot -d ./locale -l pl_PL
# Replace pl_PL by your language3. Compile:
pybabel compile -d ./locale