[Bug]: torch.opcheck fails for `_C.rms_norm_per_block_quant`

### Your current environment

<details>
<summary>The output of <code>python collect_env.py</code></summary>

```text
Your output of `python collect_env.py` here
```

</details>


### 🐛 Describe the bug

In the unit test for `torch.ops._C.rms_norm_per_block_quant` custom kernel, for some reason opcheck fails because it thinks the weight tensor got mutated. A closer look reveals a weird issue: the cloned weight arg is the one that gets modified, and the original weight arg stays intact. I could not find a memory issue, I manually confirmed the original weight stays intact when not using opcheck, and E2E evals look good.

```
torch.testing._internal.optests.generate_tests.OpCheckError: opcheck(op, ...): test_schema failed with Argument weight is not defined as mutable but was mutated (scroll up for stack trace)
```

https://github.com/vllm-project/vllm/blob/0ebf4e969b43d99c240fd085703ea1ed97897499/tests/kernels/core/test_fused_quant_layernorm.py#L291-L304

### Before submitting a new issue...

- [x] Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the [documentation page](https://docs.vllm.ai/en/latest/), which can answer lots of frequently asked questions.

	opcheck(
	torch.ops._C.rms_norm_per_block_quant,
	(
	output,
	x,
	layer.weight,
	scales,
	1e-5,
	scale_ub,
	residual,
	group_size[1],
	True, # is_scale_transposed
	),
	)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Bug]: torch.opcheck fails for `_C.rms_norm_per_block_quant` #36688

Your current environment

🐛 Describe the bug

Before submitting a new issue...

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

[Bug]: torch.opcheck fails for _C.rms_norm_per_block_quant #36688

Description

Your current environment

🐛 Describe the bug

Before submitting a new issue...

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

[Bug]: torch.opcheck fails for `_C.rms_norm_per_block_quant` #36688