Skip to content
This repository was archived by the owner on Oct 15, 2025. It is now read-only.

Commit c9e16e9

Browse files
authored
Swap from kgateway to istio as the default in sample app (#307)
- Validated on vanilla and shift k8s. Signed-off-by: Brent Salisbury <[email protected]>
1 parent f750a8d commit c9e16e9

File tree

7 files changed

+13
-10
lines changed

7 files changed

+13
-10
lines changed

charts/llm-d/Chart.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,7 @@
11
apiVersion: v2
22
name: llm-d
33
type: application
4-
version: 1.0.17
4+
version: 1.0.18
55
appVersion: "0.1"
66
icon: 
77
description: llm-d is a Kubernetes-native high-performance distributed LLM inference framework

charts/llm-d/README.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,7 @@
11

22
# llm-d Helm Chart
33

4-
![Version: 1.0.17](https://img.shields.io/badge/Version-1.0.17-informational?style=flat-square)
4+
![Version: 1.0.18](https://img.shields.io/badge/Version-1.0.18-informational?style=flat-square)
55
![Type: application](https://img.shields.io/badge/Type-application-informational?style=flat-square)
66

77
llm-d is a Kubernetes-native high-performance distributed LLM inference framework
@@ -150,7 +150,7 @@ Kubernetes: `>= 1.30.0-0`
150150
| gateway.annotations | Additional annotations provided to the Gateway resource | object | `{}` |
151151
| gateway.enabled | Deploy resources related to Gateway | bool | `true` |
152152
| gateway.fullnameOverride | String to fully override gateway.fullname | string | `""` |
153-
| gateway.gatewayClassName | Gateway class that determines the backend used Currently supported values: "kgateway" or "istio" | string | `"kgateway"` |
153+
| gateway.gatewayClassName | Gateway class that determines the backend used Currently supported values: "kgateway" or "istio" | string | `"istio"` |
154154
| gateway.nameOverride | String to partially override gateway.fullname | string | `""` |
155155
| gateway.serviceType | Gateway's service type. Ingress is only available if the service type is set to NodePort. Accepted values: ["LoadBalancer", "NodePort"] | string | `"NodePort"` |
156156
| global | Global parameters Global Docker image parameters Please, note that this will override the image parameters, including dependencies, configured to use the global value Current available global Docker image parameters: imageRegistry, imagePullSecrets and storageClass | object | See below |

charts/llm-d/templates/tests/test-connection.yaml

Lines changed: 6 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -7,6 +7,7 @@ metadata:
77
helm.sh/hook: test
88
helm.sh/hook-weight: "1" # Deploy after other resources, implicit weight is 0
99
spec:
10+
restartPolicy: Never
1011
securityContext:
1112
seccompProfile:
1213
type: RuntimeDefault
@@ -36,12 +37,14 @@ spec:
3637
- |
3738
echo -e "\e[32m🥷 Waiting for pods to come up\e[0m"
3839
echo ""
39-
curl --connect-timeout 5 --max-time 20 --retry 20 --retry-delay 10 --retry-max-time 60 --retry-all-errors http://{{ include "gateway.fullname" . }}/v1/models
40+
curl --connect-timeout 5 --max-time 20 --retry 20 --retry-delay 10 --retry-max-time 60 --retry-all-errors http://{{ include "gateway.fullname" . }}-istio/v1/models
4041
4142
echo ""
4243
echo ""
4344
echo -e "\e[32m🥷 Basic chat validation\e[0m"
4445
echo ""
45-
curl --connect-timeout 5 --max-time 20 --retry 20 --retry-delay 10 --retry-max-time 60 --retry-all-errors http://{{ include "gateway.fullname" . }}/v1/chat/completions -H 'accept: application/json' -H 'Content-Type: application/json' -d '{"model":"food-review","messages": [{"content":"Say hi", "role": "user"}],"stream": false}'
46-
restartPolicy: Never
46+
curl --connect-timeout 5 --max-time 20 --retry 20 --retry-delay 10 --retry-max-time 60 --retry-all-errors http://{{ include "gateway.fullname" . }}-istio/v1/chat/completions \
47+
-H 'accept: application/json' \
48+
-H 'Content-Type: application/json' \
49+
-d '{"model":"food-review","messages":[{"content":"Say hi","role":"user"}],"stream":false}'
4750
{{- end }}

charts/llm-d/values.schema.json

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -68,7 +68,7 @@
6868
"title": "fullnameOverride"
6969
},
7070
"gatewayClassName": {
71-
"default": "kgateway",
71+
"default": "istio",
7272
"description": "Gateway class that determines the backend used Currently supported values: \"kgateway\" or \"istio\"",
7373
"required": [],
7474
"title": "gatewayClassName"

charts/llm-d/values.schema.tmpl.json

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -68,7 +68,7 @@
6868
"title": "fullnameOverride"
6969
},
7070
"gatewayClassName": {
71-
"default": "kgateway",
71+
"default": "istio",
7272
"description": "Gateway class that determines the backend used Currently supported values: \"kgateway\" or \"istio\"",
7373
"required": [],
7474
"title": "gatewayClassName"

charts/llm-d/values.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -187,7 +187,7 @@ gateway:
187187

188188
# -- Gateway class that determines the backend used
189189
# Currently supported values: "kgateway" or "istio"
190-
gatewayClassName: kgateway
190+
gatewayClassName: istio
191191

192192
# @schema
193193
# additionalProperties: true

quickstart/test-request.sh

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -174,7 +174,7 @@ validation() {
174174

175175
# ── Minikube gateway validation ───────────────────────────────────────────────
176176
minikube_validation() {
177-
SVC_HOST="llm-d-inference-gateway.${NAMESPACE}.svc.cluster.local:80"
177+
SVC_HOST="llm-d-inference-gateway-istio.${NAMESPACE}.svc.cluster.local:80"
178178
echo "Minikube validation: hitting gateway DNS at ${SVC_HOST}"
179179

180180
# 1) GET /v1/models via DNS gateway

0 commit comments

Comments
 (0)