[Bug]: Unable to run Qwen3.5 on RTX5090

### Your current environment

<details>
<summary>The output of <code>python collect_env.py</code></summary>

```text
CUDA:12.8
Torch: 2.10
VLLM: 0.17.0
```

</details>


### 🐛 Describe the bug

Running log:
`(EngineCore_DP0 pid=384010) ERROR 03-09 13:22:08 [core.py:1100] torch.AcceleratorError: CUDA error: the provided PTX was compiled with an unsupported toolchain.
(EngineCore_DP0 pid=384010) ERROR 03-09 13:22:08 [core.py:1100] Search for cudaErrorUnsupportedPtxVersion in https://docs.nvidia.com/cuda/cuda-runtime-api/group__CUDART__TYPES.html for more information.
(EngineCore_DP0 pid=384010) ERROR 03-09 13:22:08 [core.py:1100] CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect.
(EngineCore_DP0 pid=384010) ERROR 03-09 13:22:08 [core.py:1100] For debugging consider passing CUDA_LAUNCH_BLOCKING=1
(EngineCore_DP0 pid=384010) ERROR 03-09 13:22:08 [core.py:1100] Compile with TORCH_USE_CUDA_DSA to enable device-side assertions.
(EngineCore_DP0 pid=384010) ERROR 03-09 13:22:08 [core.py:1100] `



### Before submitting a new issue...

- [x] Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the [documentation page](https://docs.vllm.ai/en/latest/), which can answer lots of frequently asked questions.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Bug]: Unable to run Qwen3.5 on RTX5090 #36455

Your current environment

🐛 Describe the bug

Before submitting a new issue...

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

[Bug]: Unable to run Qwen3.5 on RTX5090 #36455

Description

Your current environment

🐛 Describe the bug

Before submitting a new issue...

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions