-
Notifications
You must be signed in to change notification settings - Fork 3
Description
I'm running this command (REPO: https://github.com/ROCm/tritoninferenceserver-vllm)
python3 ~/tritoninferenceserver-vllm/build-vllm-docker.py --no-container-pull --enable-rocm --endpoint=grpc --image gpu-base,rocm/pytorch:rocm6.4_ubuntu22.04_py3.10_pytorch_release_2.6.0 --endpoint=http --backend=python --backend=vllm
I've been making changes to the "build-vllm-docker.py" file. I managed to compile the default "tritonserver" by making some changes, but it's quite outdated and doesn't allow vLLM to deploy any models.
I was trying to compile Triton Inference Server with the image "rocm/pytorch:rocm6.4_ubuntu24.04_py3.12_pytorch_release_2.6.0" but couldn't make progress. I'm currently testing with the image "rocm/pytorch:rocm6.4_ubuntu22.04_py3.10_pytorch_release_2.6.0", which is progressing further. As errors arise, I continue to resolve them...
The error I have now is:
95.46 /var/lib/jenkins/libs/flash-attention/csrc/flash_attn_rocm/composable_kernel/include/ck/tensor_operation/gpu/block/blockwise_gemm_xdlops_hip.hpp:870:32: error: no member named 'a_origin' in 'BlockwiseGemmXdlops_v2<BlockSize, FloatAB, FloatAcc, ATileDesc, BTileDesc, AMmaTileDesc, BMmaTileDesc, MPerBlock, NPerBlock, KPerBlock, MPerXDL, NPerXDL, MRepeat, NRepeat, KPack, TransposeC, AMmaKStride, BMmaKStride>'
95.46 870 | : a_thread_copy_(other.a_origin), b_thread_copy_(other.b_origin)
95.46 | ~~~~~ ^
95.46 /var/lib/jenkins/libs/flash-attention/csrc/flash_attn_rocm/composable_kernel/include/ck/tensor_operation/gpu/block/blockwise_gemm_xdlops_hip.hpp:870:64: error: no member named 'b_origin' in 'BlockwiseGemmXdlops_v2<BlockSize, FloatAB, FloatAcc, ATileDesc, BTileDesc, AMmaTileDesc, BMmaTileDesc, MPerBlock, NPerBlock, KPerBlock, MPerXDL, NPerXDL, MRepeat, NRepeat, KPack, TransposeC, AMmaKStride, BMmaKStride>'
95.46 870 | : a_thread_copy_(other.a_origin), b_thread_copy_(other.b_origin)
95.46 | ~~~~~ ^
NOTE: This would be the ideal image to compile
"rocm/pytorch:rocm6.4_ubuntu24.04_py3.12_pytorch_release_2.6.0"