v0.1.0

github-actions released this 14 Jan 15:25

· 313 commits to master since this release

880fa7a

What's Changed

Potentially Breaking Changes
- Swapped to uv dependency management from pip
- Baked model files and voicepacks directly into gpu + cpu images
- latest-slim tags could use some community testing, but will be optimizing and checking on deployability
- Location of dockerfiles + docker compose has been moved into the docker directory. Be sure to check the paths when launching

UI Changes:
- Multi-select and merging of voices has been enabled.
- An environment flag was set to disable local saving/filepath operations. By default it should still be saving locally
- Made the waveform a dynamic blue color
API Changes
- Simplified audio normalization, more stable (likely won't notice a difference as the end user)
- Streaming now respects broken connections, will stop processing on the next chunk
- Minor/Moderate GPU memory handling cleanup and safeties added (clearing intermediate tensors, adding pressure warning, etc)
CI/CD live on Github Actions
- Pytest will run through all API tests on any pull requests now. You can modify them to align with new functionality, and add as needed but try not to lose any coverage, makes my life a bit easier
- Pytorch mocks mostly removed, run on CPU version for automated testing.

This has been a great model to work with. Looking forward to when the new 0.24 version is released by https://huggingface.co/hexgrad/Kokoro-82M.

Be sure to check their page out out for updates on model development, and keep in mind they're always looking for more data

New Contributors

@Galunid made their first contribution in #32

Full Changelog: v0.0.5...v0.1.0

Contributors

Galunid

Assets 20