Skip to content

Commit ddef28d

Browse files
ariG23498hmellor
andauthored
Update _posts/2025-04-11-transformers-backend.md
Co-authored-by: Harry Mellor <[email protected]>
1 parent 5eedfb4 commit ddef28d

File tree

1 file changed

+0
-3
lines changed

1 file changed

+0
-3
lines changed

_posts/2025-04-11-transformers-backend.md

Lines changed: 0 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -33,9 +33,6 @@ Here is how one can serve a multimodal model using the transformers backend.
3333
```bash
3434
vllm serve llava-hf/llava-onevision-qwen2-0.5b-ov-hf \
3535
--model_impl transformers \
36-
--disable-mm-preprocessor-cache \
37-
--no-enable-prefix-caching \
38-
--no-enable-chunked-prefill
3936
```
4037

4138
To consume the model one can use the `openai` API like so:

0 commit comments

Comments
 (0)