Update _posts/2025-04-11-transformers-backend.md

ariG23498 · hmellor · web-flow · commit ddef28ded877 · 2025-07-23T07:16:38.000+05:30
Co-authored-by: Harry Mellor &lt;19981378+hmellor@users.noreply.github.com&gt;
diff --git a/_posts/2025-04-11-transformers-backend.md b/_posts/2025-04-11-transformers-backend.md
@@ -33,9 +33,6 @@ Here is how one can serve a multimodal model using the transformers backend.
 ```bash
 vllm serve llava-hf/llava-onevision-qwen2-0.5b-ov-hf \
 --model_impl transformers \
---disable-mm-preprocessor-cache \
---no-enable-prefix-caching \
---no-enable-chunked-prefill
 ```
 
 To consume the model one can use the `openai` API like so: