v0.1.0
What's Changed
- Potentially Breaking Changes
- Swapped to
uvdependency management from pip - Baked model files and voicepacks directly into gpu + cpu images
latest-slimtags could use some community testing, but will be optimizing and checking on deployability- Location of dockerfiles + docker compose has been moved into the
dockerdirectory. Be sure to check the paths when launching
- Swapped to
-
UI Changes:
- Multi-select and merging of voices has been enabled.
- An environment flag was set to disable local saving/filepath operations. By default it should still be saving locally
- Made the waveform a dynamic blue color
-
API Changes
- Simplified audio normalization, more stable (likely won't notice a difference as the end user)
- Streaming now respects broken connections, will stop processing on the next chunk
- Minor/Moderate GPU memory handling cleanup and safeties added (clearing intermediate tensors, adding pressure warning, etc)
-
CI/CD live on Github Actions
- Pytest will run through all API tests on any pull requests now. You can modify them to align with new functionality, and add as needed but try not to lose any coverage, makes my life a bit easier
- Pytorch mocks mostly removed, run on CPU version for automated testing.
This has been a great model to work with. Looking forward to when the new 0.24 version is released by https://huggingface.co/hexgrad/Kokoro-82M.
Be sure to check their page out out for updates on model development, and keep in mind they're always looking for more data
New Contributors
Full Changelog: v0.0.5...v0.1.0