From 0c555cec66d03a2efadcab92c399ef3c029ffadc Mon Sep 17 00:00:00 2001 From: Nir Rozenbaum Date: Mon, 21 Jul 2025 21:57:25 +0300 Subject: [PATCH] fixed try it out section in quickstart Signed-off-by: Nir Rozenbaum --- site-src/guides/index.md | 38 +++++++++++--------------------------- 1 file changed, 11 insertions(+), 27 deletions(-) diff --git a/site-src/guides/index.md b/site-src/guides/index.md index 67681c002..f6b1419a8 100644 --- a/site-src/guides/index.md +++ b/site-src/guides/index.md @@ -248,33 +248,17 @@ This quickstart guide is intended for engineers familiar with k8s and model serv Wait until the gateway is ready. -=== "GPU-Based Model Server" - - ```bash - IP=$(kubectl get gateway/inference-gateway -o jsonpath='{.status.addresses[0].value}') - PORT=80 - - curl -i ${IP}:${PORT}/v1/completions -H 'Content-Type: application/json' -d '{ - "model": "food-review", - "prompt": "Write as if you were a critic: San Francisco", - "max_tokens": 100, - "temperature": 0 - }' - ``` - -=== "CPU-Based Model Server" - - ```bash - IP=$(kubectl get gateway/inference-gateway -o jsonpath='{.status.addresses[0].value}') - PORT=80 - - curl -i ${IP}:${PORT}/v1/completions -H 'Content-Type: application/json' -d '{ - "model": "Qwen/Qwen2.5-1.5B-Instruct", - "prompt": "Write as if you were a critic: San Francisco", - "max_tokens": 100, - "temperature": 0 - }' - ``` + ```bash + IP=$(kubectl get gateway/inference-gateway -o jsonpath='{.status.addresses[0].value}') + PORT=80 + + curl -i ${IP}:${PORT}/v1/completions -H 'Content-Type: application/json' -d '{ + "model": "food-review", + "prompt": "Write as if you were a critic: San Francisco", + "max_tokens": 100, + "temperature": 0 + }' + ``` ### Cleanup