add mesh gateway (#83)

kalantar · web-flow · commit 7ef57ac3f4c8 · 2023-05-16T18:24:47.000-04:00
* add mesh gateway

Signed-off-by: Michael Kalantar &lt;kalantar@us.ibm.com&gt;

* update references from hub to iter8

Signed-off-by: Michael Kalantar &lt;kalantar@us.ibm.com&gt;

* remove reference

Signed-off-by: Michael Kalantar &lt;kalantar@us.ibm.com&gt;

* add in cluster instructions

Signed-off-by: Michael Kalantar &lt;kalantar@us.ibm.com&gt;

---------

Signed-off-by: Michael Kalantar &lt;kalantar@us.ibm.com&gt;
diff --git a/docs/tutorials/autox/autox.md b/docs/tutorials/autox/autox.md
@@ -52,7 +52,7 @@ kubectl label deployment httpbin app.kubernetes.io/version=1.0.0
 Next, we will configure and install the AutoX controller.
 
 ```bash
-helm install autox autox --repo https://iter8-tools.github.io/iter8/ --version 0.1.6 \
+helm install autox autox --repo https://iter8-tools.github.io/iter8 --version 0.1.6 \
 --set 'groups.httpbin.trigger.name=httpbin' \
 --set 'groups.httpbin.trigger.namespace=default' \
 --set 'groups.httpbin.trigger.group=apps' \
@@ -180,8 +180,6 @@ AutoX is designed to automate a variety of experiments. For example, instead of
 
 Furthermore, you can add additional tasks that ship out-of-the-box with Iter8, in order to enrich the experiments. For example, you can add a `slack` task so that your experiment results will be posted on Slack. That way, you can automatically have the latest performance statistics after every update. Here is the [documentation](../../user-guide/tasks/slack.md) for the `slack` task as well as a [tutorial](../../tutorials/integrations/slack.md) for using the Slack task.
 
-You can also automate experiments that are not from Iter8. For example, a [Litmus Chaos chaos experiment](https://github.com/iter8-tools/iter8/tree/v0.14.5/charts/litmuschaos) is available, which can also be configured with AutoX.
-
 Lastly, recall that you can provide multiple groups and experiment specs so AutoX can launch and manage a whole suite of experiments for multiple Kubernetes applications and namespaces.
 
 ## Clean up
diff --git a/docs/tutorials/chaos/slo-validation-chaos.md b/docs/tutorials/chaos/slo-validation-chaos.md
@@ -57,7 +57,7 @@ Launch the LitmusChaos and Iter8 experiments as described below.
 === "LitmusChaos"
     ```shell
     helm install httpbin litmuschaos \
-    --repo https://iter8-tools.github.io/iter8/ \
+    --repo https://iter8-tools.github.io/iter8 \
     --set applabel='app=httpbin' \
     --set totalChaosDuration=3600 \
     --set chaosInterval=5
diff --git a/docs/tutorials/integrations/kserve-mm/blue-green.md b/docs/tutorials/integrations/kserve-mm/blue-green.md
@@ -85,24 +85,43 @@ kubectl get virtualservice -o yaml wisdom
 
 To send inference requests to the model:
 
-1. In a separate terminal, port-forward the ingress gateway:
-  ```shell
-  kubectl -n istio-system port-forward svc/istio-ingressgateway 8080:80
-  ```
-
-2. Download the proto file and a sample input:
-  ```shell
-  curl -sO https://raw.githubusercontent.com/iter8-tools/docs/v0.13.18/samples/modelmesh-serving/kserve.proto
-  curl -sO https://raw.githubusercontent.com/iter8-tools/docs/v0.13.18/samples/modelmesh-serving/grpc_input.json
-  ```
-
-3. Send inference requests:
-  ```shell
-  cat grpc_input.json | \
-  grpcurl -plaintext -proto kserve.proto -d @ \
-  -authority wisdom.modelmesh-serving \
-  localhost:8080 inference.GRPCInferenceService.ModelInfer
-  ```
+=== "From within the cluster"
+    1. Create a "sleep" pod in the cluster from which requests can be made:
+    ```shell
+    curl -s https://raw.githubusercontent.com/iter8-tools/docs/v0.13.18/samples/modelmesh-serving/sleep.sh | sh -
+    ```
+
+    2. exec into the sleep pod:
+    ```shell
+    kubectl exec --stdin --tty "$(kubectl get pod --sort-by={metadata.creationTimestamp} -l app=sleep -o jsonpath={.items..metadata.name} | rev | cut -d' ' -f 1 | rev)" -c sleep -- /bin/sh
+    ```
+
+    3. Make inference requests:
+    ```shell
+    cd demo
+    cat wisdom.sh
+    . wisdom.sh
+    ```
+
+=== "From outside the cluster"
+    1. In a separate terminal, port-forward the ingress gateway:
+      ```shell
+      kubectl -n istio-system port-forward svc/istio-ingressgateway 8080:80
+      ```
+
+    2. Download the proto file and a sample input:
+      ```shell
+      curl -sO https://raw.githubusercontent.com/iter8-tools/docs/v0.13.18/samples/modelmesh-serving/kserve.proto
+      curl -sO https://raw.githubusercontent.com/iter8-tools/docs/v0.13.18/samples/modelmesh-serving/grpc_input.json
+      ```
+
+    3. Send inference requests:
+      ```shell
+      cat grpc_input.json | \
+      grpcurl -plaintext -proto kserve.proto -d @ \
+      -authority wisdom.modelmesh-serving \
+      localhost:8080 inference.GRPCInferenceService.ModelInfer
+      ```
 
 Note that the model version responding to each inference request can be determined from the `modelName` field of the response.
 
diff --git a/docs/tutorials/integrations/kserve-mm/canary.md b/docs/tutorials/integrations/kserve-mm/canary.md
@@ -86,34 +86,57 @@ kubectl get virtualservice -o yaml wisdom
 
 To send inference requests to the model:
 
-1. In a separate terminal, port-forward the ingress gateway:
-  ```shell
-  kubectl -n istio-system port-forward svc/istio-ingressgateway 8080:80
-  ```
-
-2. Download the proto file and a sample input:
-  ```shell
-  curl -sO https://raw.githubusercontent.com/iter8-tools/docs/v0.13.18/samples/modelmesh-serving/kserve.proto
-  curl -sO https://raw.githubusercontent.com/iter8-tools/docs/v0.13.18/samples/modelmesh-serving/grpc_input.json
-  ```
-
-3. Send inference requests:
-  ```shell
-  cat grpc_input.json | \
-  grpcurl -plaintext -proto kserve.proto -d @ \
-  -authority wisdom.modelmesh-serving \
-  localhost:8080 inference.GRPCInferenceService.ModelInfer
-  ```
-
-  or, send request with header `traffic: test`:
-
-  ```shell
-  cat grpc_input.json | \
-  grpcurl -plaintext -proto kserve.proto -d @ \
-  -H 'traffic: test' \
-  -authority wisdom.modelmesh-serving \
-  localhost:8080 inference.GRPCInferenceService.ModelInfer
-  ```
+=== "From within the cluster"
+    1. Create a "sleep" pod in the cluster from which requests can be made:
+    ```shell
+    curl -s https://raw.githubusercontent.com/iter8-tools/docs/v0.13.18/samples/modelmesh-serving/sleep.sh | sh -
+    ```
+
+    2. exec into the sleep pod:
+    ```shell
+    kubectl exec --stdin --tty "$(kubectl get pod --sort-by={metadata.creationTimestamp} -l app=sleep -o jsonpath={.items..metadata.name} | rev | cut -d' ' -f 1 | rev)" -c sleep -- /bin/sh
+    ```
+
+    3. Make inference requests:
+    ```shell
+    cd demo
+    cat wisdom.sh
+    . wisdom.sh
+    . wisdom-test.sh
+    ```
+    or, to send a request with header `traffic: test`:
+    ```shell
+    cat wisdom-test.sh
+    . wisdom-test.sh
+    ```
+
+=== "From outside the cluster"
+    1. In a separate terminal, port-forward the ingress gateway:
+      ```shell
+      kubectl -n istio-system port-forward svc/istio-ingressgateway 8080:80
+      ```
+
+    2. Download the proto file and a sample input:
+      ```shell
+      curl -sO https://raw.githubusercontent.com/iter8-tools/docs/v0.13.18/samples/modelmesh-serving/kserve.proto
+      curl -sO https://raw.githubusercontent.com/iter8-tools/docs/v0.13.18/samples/modelmesh-serving/grpc_input.json
+      ```
+
+    3. Send inference requests:
+      ```shell
+      cat grpc_input.json | \
+      grpcurl -plaintext -proto kserve.proto -d @ \
+      -authority wisdom.modelmesh-serving \
+      localhost:8080 inference.GRPCInferenceService.ModelInfer
+      ```
+      or, to send a request with header `traffic: test`:
+      ```shell
+      cat grpc_input.json | \
+      grpcurl -plaintext -proto kserve.proto -d @ \
+      -H 'traffic: test' \
+      -authority wisdom.modelmesh-serving \
+      localhost:8080 inference.GRPCInferenceService.ModelInfer
+      ```
 
 Note that the model version responding to each inference request can be determined from the `modelName` field of the response.
 
diff --git a/docs/tutorials/integrations/kserve-mm/deleteiter8controller.md b/docs/tutorials/integrations/kserve-mm/deleteiter8controller.md
@@ -8,12 +8,12 @@
 === "Kustomize"
     Delete the Iter8 controller using `kustomize` as follows.
 
-    === "cluster scoped"
+    === "namespace scoped"
         ```shell
-        kubectl delete -k 'https://github.com/iter8-tools/hub.git/kustomize/traffic/clusterScoped?ref=traffic-templates-0.1.1'
+        kubectl delete -k 'https://github.com/iter8-tools/iter8.git/kustomize/traffic/namespaceScoped?ref=v0.14.4'
         ```
 
-    === "namespace scoped"
+    === "cluster scoped"
         ```shell
-        kubectl delete -k 'https://github.com/iter8-tools/hub.git/kustomize/traffic/namespaceScoped?ref=traffic-templates-0.1.1'
+        kubectl delete -k 'https://github.com/iter8-tools/iter8.git/kustomize/traffic/clusterScoped?ref=v0.14.4'
         ```
diff --git a/docs/tutorials/integrations/kserve-mm/installiter8controller.md b/docs/tutorials/integrations/kserve-mm/installiter8controller.md
@@ -8,12 +8,12 @@
 === "Kustomize"
     Install the Iter8 controller using `kustomize` as follows.
 
-    === "cluster scoped"
+    === "namespace scoped"
         ```shell
-        kubectl apply -k 'https://github.com/iter8-tools/hub.git/kustomize/traffic/clusterScoped?ref=traffic-templates-0.1.1'
+        kubectl apply -k 'https://github.com/iter8-tools/iter8.git/kustomize/traffic/namespaceScoped?ref=v0.14.4'
         ```
 
-    === "namespace scoped"
+    === "cluster scoped"
         ```shell
-        kubectl apply -k 'https://github.com/iter8-tools/hub.git/kustomize/traffic/namespaceScoped?ref=traffic-templates-0.1.1'
+        kubectl apply -k 'https://github.com/iter8-tools/iter8.git/kustomize/traffic/clusterScoped?ref=v0.14.4'
         ```
diff --git a/docs/tutorials/integrations/kserve-mm/mirror.md b/docs/tutorials/integrations/kserve-mm/mirror.md
@@ -86,24 +86,43 @@ kubectl get virtualservice -o yaml wisdom
 
 To send inference requests to the model:
 
-1. In a separate terminal, port-forward the ingress gateway:
-  ```shell
-  kubectl -n istio-system port-forward svc/istio-ingressgateway 8080:80
-  ```
-
-2. Download the proto file and a sample input:
-  ```shell
-  curl -sO https://raw.githubusercontent.com/iter8-tools/docs/v0.13.18/samples/modelmesh-serving/kserve.proto
-  curl -sO https://raw.githubusercontent.com/iter8-tools/docs/v0.13.18/samples/modelmesh-serving/grpc_input.json
-  ```
-
-3. Send inference requests:
-  ```shell
-  cat grpc_input.json | \
-  grpcurl -plaintext -proto kserve.proto -d @ \
-  -authority wisdom.modelmesh-serving \
-  localhost:8080 inference.GRPCInferenceService.ModelInfer
-  ```
+=== "From within the cluster"
+    1. Create a "sleep" pod in the cluster from which requests can be made:
+    ```shell
+    curl -s https://raw.githubusercontent.com/iter8-tools/docs/v0.13.18/samples/modelmesh-serving/sleep.sh | sh -
+    ```
+
+    2. exec into the sleep pod:
+    ```shell
+    kubectl exec --stdin --tty "$(kubectl get pod --sort-by={metadata.creationTimestamp} -l app=sleep -o jsonpath={.items..metadata.name} | rev | cut -d' ' -f 1 | rev)" -c sleep -- /bin/sh
+    ```
+
+    3. Make inference requests:
+    ```shell
+    cd demo
+    cat wisdom.sh
+    . wisdom.sh
+    ```
+
+=== "From outside the cluster"
+    1. In a separate terminal, port-forward the ingress gateway:
+      ```shell
+      kubectl -n istio-system port-forward svc/istio-ingressgateway 8080:80
+      ```
+
+    2. Download the proto file and a sample input:
+      ```shell
+      curl -sO https://raw.githubusercontent.com/iter8-tools/docs/v0.13.18/samples/modelmesh-serving/kserve.proto
+      curl -sO https://raw.githubusercontent.com/iter8-tools/docs/v0.13.18/samples/modelmesh-serving/grpc_input.json
+      ```
+
+    3. Send inference requests:
+      ```shell
+      cat grpc_input.json | \
+      grpcurl -plaintext -proto kserve.proto -d @ \
+      -authority wisdom.modelmesh-serving \
+      localhost:8080 inference.GRPCInferenceService.ModelInfer
+      ```
 
 Note that the model version responding to each inference request can be determined from the `modelName` field of the response.
 
diff --git a/docs/user-guide/tasks/github.md b/docs/user-guide/tasks/github.md
@@ -31,7 +31,7 @@ See [here](../../tutorials/integrations/ghactions.md#use-iter8-to-trigger-a-gith
 | owner | string | Yes | N/A | Owner of the GitHub repository |
 | repo | string | Yes | N/A | GitHub repository |
 | token | string | Yes | N/A | Authorization token |
-| payloadTemplateURL | string | No | [https://raw.githubusercontent.com/iter8-tools/iter8/v0.14.5/charts/iter8/templates/_payload-github.tpl](https://raw.githubusercontent.com/iter8-tools/iter8/v0.14.5/charts/iter8/templates/_payload-github.tpl) | URL to a payload template |
+| payloadTemplateURL | string | No | [https://raw.githubusercontent.com/iter8-tools/iter8/v0.14.5/templates/notify/_payload-github.tpl](https://raw.githubusercontent.com/iter8-tools/iter8/v0.14.5/templates/notify/_payload-github.tpl) | URL to a payload template |
 | softFailure | bool | No | true | Indicates the task and experiment should not fail if the task cannot successfully send the request |
 | if | string | No | N/A | An if condition that can be control when the task is run in a [multi-looped experiment](../../getting-started/concepts.md#runner). To learn more, see [here](#if-parameter). |
 
diff --git a/docs/user-guide/tasks/slack.md b/docs/user-guide/tasks/slack.md
@@ -26,13 +26,13 @@ See [here](../../tutorials/integrations/slack.md#use-iter8-to-send-a-message-to-
 | Name | Type | Required | Default value | Description |
 | ---- | ---- | -------- | ------------- | ----------- |
 | url | string | Yes | N/A | URL to the Slack webhook |
-| payloadTemplateURL | string | No | [https://raw.githubusercontent.com/iter8-tools/iter8/v0.14.5/charts/iter8/templates/_payload-slack.tpl](https://raw.githubusercontent.com/iter8-tools/iter8/v0.14.5/charts/iter8/templates/_payload-slack.tpl) | URL to a payload template |
+| payloadTemplateURL | string | No | [https://raw.githubusercontent.com/iter8-tools/iter8/v0.14.5/templates/notify/_payload-slack.tpl](https://raw.githubusercontent.com/iter8-tools/iter8/v0.14.5/templates/notify/_payload-slack.tpl) | URL to a payload template |
 | softFailure | bool | No | true | Indicates the task and experiment should not fail if the task cannot successfully send the request |
 | if | string | No | N/A | An if condition that can be control when the task is run in a [multi-looped experiment](../../getting-started/concepts.md#runner). To learn more, see [here](#if-parameter). |
 
 ## Default payload
 
-The payload will determine what will be contained in the Slack message. The [default payload template](https://raw.githubusercontent.com/iter8-tools/iter8/v0.14.5/charts/iter8/templates/_payload-slack.tpl) of the `slack` task is to send the experiment report in text form.
+The payload will determine what will be contained in the Slack message. The [default payload template](https://raw.githubusercontent.com/iter8-tools/iter8/v0.14.5/templates/notify/_payload-slack.tpl) of the `slack` task is to send the experiment report in text form.
 
 However, if you would like to use a different payload template, simply set a `payloadTemplateURL` and Iter8 will not use the default.
 
diff --git a/docs/user-guide/topics/autox.md b/docs/user-guide/topics/autox.md
@@ -13,7 +13,7 @@ The trigger object is specified by providing the name, namespace, and the group-
 See the following example:
 
 ```bash
-helm install autox autox --repo https://iter8-tools.github.io/iter8/ --version 0.1.6 \
+helm install autox autox --repo https://iter8-tools.github.io/iter8 --version 0.1.6 \
 --set 'groups.myApp.trigger.name=myApp' \
 --set 'groups.myApp.trigger.namespace=default' \
 --set 'groups.myApp.trigger.group=apps' \
@@ -36,7 +36,7 @@ In this example, there is only one experiment group named `myApp` (`groups.myApp
 In this next example, we have augmented the previous example with an additional experiment spec.
 
 ```bash
-  helm install autox autox --repo https://iter8-tools.github.io/iter8/ --version 0.1.6 \
+  helm install autox autox --repo https://iter8-tools.github.io/iter8 --version 0.1.6 \
   --set 'groups.myApp.trigger.name=myApp' \
   --set 'groups.myApp.trigger.namespace=default' \
   --set 'groups.myApp.trigger.group=apps' \
diff --git a/samples/modelmesh-serving/sleep.sh b/samples/modelmesh-serving/sleep.sh
@@ -21,16 +21,16 @@ spec:
         imagePullPolicy: IfNotPresent
         volumeMounts:
         - name: config-volume
-          mountPath: /wisdom
+          mountPath: /demo
       volumes:
       - name: config-volume
         configMap:
-          name: wisdom-input
+          name: demo-input
 ---
 apiVersion: v1
 kind: ConfigMap
 metadata:
-  name: wisdom-input
+  name: demo-input
 data:
   kserve.proto: |
     syntax = "proto3";
@@ -362,11 +362,29 @@ data:
       ]
     }
   wisdom.sh: |
-    cat grpc_input.json | grpcurl -plaintext -proto kserve.proto -d @ -H 'mm-model: wisdom' modelmesh-serving.modelmesh-serving:8033 inference.GRPCInferenceService.ModelInfer
-  lightspeed.sh: |
-    cat grpc_input.json | grpcurl -plaintext -proto kserve.proto -d @ -H 'mm-model: lightspeed' modelmesh-serving.modelmesh-serving:8033 inference.GRPCInferenceService.ModelInfer
+    cat grpc_input.json | \
+    grpcurl -plaintext -proto kserve.proto -d @ \
+    -authority wisdom.modelmesh-serving \
+    modelmesh-serving.modelmesh-serving:8033 \
+    inference.GRPCInferenceService.ModelInfer
   wisdom-test.sh: |
-    cat grpc_input.json | grpcurl -plaintext -proto kserve.proto -d @ -H 'mm-model: wisdom' -H 'traffic: test' modelmesh-serving.modelmesh-serving:8033 inference.GRPCInferenceService.ModelInfer
+   cat grpc_input.json | \
+    grpcurl -plaintext -proto kserve.proto -d @ \
+    -authority wisdom.modelmesh-serving \
+    -H 'traffic: test' \
+    modelmesh-serving.modelmesh-serving:8033 \
+    inference.GRPCInferenceService.ModelInfer
+  lightspeed.sh: |
+    cat grpc_input.json | \
+    grpcurl -plaintext -proto kserve.proto -d @ \
+    -authority lightspeed.modelmesh-serving \
+    modelmesh-serving.modelmesh-serving:8033 \
+    inference.GRPCInferenceService.ModelInfer
   lightspeed-test.sh: |
-    cat grpc_input.json | grpcurl -plaintext -proto kserve.proto -d @ -H 'mm-model: lightspeed' -H 'traffic: test' modelmesh-serving.modelmesh-serving:8033 inference.GRPCInferenceService.ModelInfer
+    cat grpc_input.json | \
+    grpcurl -plaintext -proto kserve.proto -d @ \
+    -authority lightspeed.modelmesh-serving \
+    -H 'traffic: test' \
+    modelmesh-serving.modelmesh-serving:8033 \
+    inference.GRPCInferenceService.ModelInfer
 EOF