Skip to content

Releases: kubernetes-sigs/gateway-api-inference-extension

v1.0.1

25 Sep 23:43
v1.0.1
Compare
Choose a tag to compare

What's Changed

Bug fixes to helm charts, no changes in EPP image or IGW APIs

Full Changelog: v1.0.0...v1.0.1

v1.0.1-rc.1

23 Sep 02:56
v1.0.1-rc.1
Compare
Choose a tag to compare
v1.0.1-rc.1 Pre-release
Pre-release

This is a small patch release to fix helm issues.

Context: #1616

v1.0.0

09 Sep 00:04
v1.0.0
Compare
Choose a tag to compare

Inference Gateway v1

This release marks the v1 of Inference Gateway, and with it the promotion of the InferencePool CRD to v1.

We're excited to announce our v1 release of Inference Gateway! A huge thank you to our contributors, gateway implementers, and downstream community for helping to shape IGW into something we are proud of.

If you're new: Please take a look at our guide to get started! Or learn more about IGW here: https://gateway-api-inference-extension.sigs.k8s.io/

There is still much to do and more enhancements to come. Namely:

  • SLO-based predictive scheduling
  • Flow Control for multi-tenancy support
  • An improved pluggable Data Layer system
  • Multi-modal support
  • APIs to support meeting multiple different SLOs in a single InferencePool

We look forward to what's next in the Inference space and looking forward to continuing to grow with it.

Onwards!

Cheers,
The IGW maintainer team

What's Changed

Read more

v1.0.0-rc.4

08 Sep 12:10
v1.0.0-rc.4
7ce1f47
Compare
Choose a tag to compare
v1.0.0-rc.4 Pre-release
Pre-release

a list of PRs that are cherry picked into RC4:

CRD updates:

#1521

performance issues fixed in pickers:

#1523
#1514
#1528

helm chart fix:

#1522
#1540
#1542

bug fix in prefix when no request id header is supplied by the gateway:

#1490 (was on the original list but somehow missed, without this prefix cache won't work in bursty workload)

test flake fix, required for llm-d to use formal image of IGW:

#1534

** all the items in this list have been cherry picked successfully into the release branch.

v1.0.0-rc.3

05 Sep 11:57
v1.0.0-rc.3
9c24d20
Compare
Choose a tag to compare
v1.0.0-rc.3 Pre-release
Pre-release

cherry picked PRs:
#1508 - critical bug fix to allow setting custom plugins config through helm chart
#1509 - prefix writing its state to CycleState.
#1412 - new weighted random picker

v1.0.0-rc.2

29 Aug 00:11
v1.0.0-rc.2
Compare
Choose a tag to compare
v1.0.0-rc.2 Pre-release
Pre-release

This release is primarily updating the InferencePool API and Conformance tests after the completion of the API review conducted in this PR: #1173

NOTE: Barring any breaking change after this RC the APIs are considered frozen for the remainder of the v1.0 release cycle

v1.0.0-rc.1

26 Aug 12:59
v1.0.0-rc.1
Compare
Choose a tag to compare
v1.0.0-rc.1 Pre-release
Pre-release

What's Changed

Read more

v0.5.1

23 Jul 20:04
v0.5.1
Compare
Choose a tag to compare

This patch fix is intended to resolve a few bug fixes. Justification & breakdown here: #1215

v0.5.1-rc.1

22 Jul 23:20
v0.5.1-rc.1
Compare
Choose a tag to compare
v0.5.1-rc.1 Pre-release
Pre-release

This patch fix is intended to resolve a few bug fixes. Justification & breakdown here: #1215

v0.5.0

21 Jul 18:20
38577e6
Compare
Choose a tag to compare

Overview

Major Highlights

  • Conformance Tests: Validate your controller’s behavior with end-to-end tests covering InferencePool,
    InferenceModel, HTTPRoute, and more.

  • New Config API: A new Config API which allows the configuration of plugins through a config file without touching core code.

  • Helm Charts: helm chart update to support the reuse of Config API easily.

What's Changed

Read more