fix: replace hardcoded .cuda() with dynamic device inference by Mr-Neutr0n · Pull Request #1921 · haotian-liu/LLaVA

Mr-Neutr0n · 2026-02-11T18:29:47Z

Bug

In llava/eval/run_llava.py, input_ids is placed on device via a hardcoded .cuda() call (line 111). This breaks inference on non-CUDA devices (MPS, CPU) and can cause device mismatch errors in multi-GPU setups.

Notably, images_tensor on the preceding line already correctly uses .to(model.device, ...), making the hardcoded .cuda() on input_ids inconsistent.

Fix

Replaced .cuda() with .to(model.device) so input_ids is placed on the same device as the model, matching the pattern already used for images_tensor.

fix: replace hardcoded .cuda() with dynamic device in eval

ca92acb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: replace hardcoded .cuda() with dynamic device inference#1921

fix: replace hardcoded .cuda() with dynamic device inference#1921
Mr-Neutr0n wants to merge 1 commit intohaotian-liu:mainfrom
Mr-Neutr0n:fix/hardcoded-cuda-device

Mr-Neutr0n commented Feb 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Mr-Neutr0n commented Feb 11, 2026

Bug

Fix

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant