You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/source/models/supported_models.rst
+8-2Lines changed: 8 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -345,6 +345,9 @@ Text Embedding
345
345
Some model architectures support both generation and embedding tasks.
346
346
In this case, you have to pass :code:`--task embedding` to run the model in embedding mode.
347
347
348
+
.. tip::
349
+
You can override the model's pooling method by passing :code:`--override-pooler-config`.
350
+
348
351
Reward Modeling
349
352
---------------
350
353
@@ -364,7 +367,7 @@ Reward Modeling
364
367
- ✅︎
365
368
366
369
.. note::
367
-
As an interim measure, these models are supported via Embeddings API. See `this RFC <https://github.com/vllm-project/vllm/issues/8967>`_ for upcoming changes.
370
+
As an interim measure, these models are supported in both offline and online inference via Embeddings API.
368
371
369
372
Classification
370
373
---------------
@@ -385,7 +388,7 @@ Classification
385
388
- ✅︎
386
389
387
390
.. note::
388
-
As an interim measure, these models are supported via Embeddings API. It will be supported via Classification API in the future (no reference APIs exist now).
391
+
As an interim measure, these models are supported in both offline and online inference via Embeddings API.
389
392
390
393
391
394
Multimodal Language Models
@@ -600,6 +603,9 @@ Multimodal Embedding
600
603
Some model architectures support both generation and embedding tasks.
601
604
In this case, you have to pass :code:`--task embedding` to run the model in embedding mode.
602
605
606
+
.. tip::
607
+
You can override the model's pooling method by passing :code:`--override-pooler-config`.
0 commit comments