Skip to content

v1.0.0

Compare
Choose a tag to compare
@kfswain kfswain released this 09 Sep 00:04
· 124 commits to main since this release
v1.0.0

Inference Gateway v1

This release marks the v1 of Inference Gateway, and with it the promotion of the InferencePool CRD to v1.

We're excited to announce our v1 release of Inference Gateway! A huge thank you to our contributors, gateway implementers, and downstream community for helping to shape IGW into something we are proud of.

If you're new: Please take a look at our guide to get started! Or learn more about IGW here: https://gateway-api-inference-extension.sigs.k8s.io/

There is still much to do and more enhancements to come. Namely:

  • SLO-based predictive scheduling
  • Flow Control for multi-tenancy support
  • An improved pluggable Data Layer system
  • Multi-modal support
  • APIs to support meeting multiple different SLOs in a single InferencePool

We look forward to what's next in the Inference space and looking forward to continuing to grow with it.

Onwards!

Cheers,
The IGW maintainer team

What's Changed

New Contributors

Full Changelog: v0.5.1...v1.0.0