Make ReLU-QP woring on CPU #5

grilloandrea6 · 2025-09-30T15:41:56Z

I noticed that some operations in the current code fail when running without a CUDA-enabled GPU.

Timing: The implementation used torch.cuda.Event, which only works with CUDA. This has been updated to use the standard Python time module when CUDA is not available, ensuring correct timing on both CPU and GPU.
jit_forward method: The previous version used torch.matmul(W, input, out=input). On CPU, this causes incorrect results because the out tensor overlaps with an input operand. The fix replaces it with an assignment to a new tensor, which avoids overlapping writes.

The inplace torch.matmul does not give correct results on CPU

grilloandrea6 added 2 commits September 30, 2025 10:58

Solver timing work in CPU

ab6d190

fix jit_forward to be used in CPU

3d43127

The inplace torch.matmul does not give correct results on CPU

Provide feedback