[Bug]: vllm serve for Qwen3-Reranker-8B, report warning and return no data.

### Your current environment

vLLM API server version 0.10.1.dev73+g7728dd77b


### 🐛 Describe the bug


Running cmd:
VLLM_CPU_OMP_THREADS_BIND="36-47" VLLM_USE_MODELSCOPE=1 VLLM_USE_V1=1 VLLM_CPU_KVCACHE_SPACE=32 vllm serve ~/Qwen3-Reranker-8B/ --served-model-name Qwen3-Reranker-8B --port 8080 --trust-remote-code

```
INFO:     Started server process [5675]
INFO:     Waiting for application startup.
INFO:     Application startup complete.
INFO:     10.112.41.17:10649 - "GET /v1/models HTTP/1.1" 200 OK
WARNING 07-28 04:05:12 [logger.py:71] To indicate that the rerank API is not part of the standard OpenAI API, we have located it at `/rerank`. Please update your client accordingly. (Note: Conforms to JinaAI rerank API)
INFO:     10.112.41.17:10900 - "POST /v1/rerank HTTP/1.1" 200 OK
INFO:     10.112.41.17:10941 - "POST /v1/rerank HTTP/1.1" 200 OK
INFO:     10.112.41.17:10950 - "POST /v1/rerank HTTP/1.1" 200 OK

```

VLLM reports the warning, and returns no data of reranked result.


### Before submitting a new issue...

- [x] Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the [documentation page](https://docs.vllm.ai/en/latest/), which can answer lots of frequently asked questions.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Bug]: vllm serve for Qwen3-Reranker-8B, report warning and return no data. #21706

Your current environment

🐛 Describe the bug

Before submitting a new issue...

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

[Bug]: vllm serve for Qwen3-Reranker-8B, report warning and return no data. #21706

Description

Your current environment

🐛 Describe the bug

Before submitting a new issue...

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions