Describe the issue
like title: Do we have a method for 2-bit quantization on intel iGPU?
Or Can our hardware machine support 2-bit quantization? Are there any other open-source methods(openvino,pytorch) that have implemented 2-bit quantitative inference on igpu.
my igpu is :
Intel(R) Arc(TM) 140V GPU (16GB)
or:
Intel(R) Arc(TM) 140T GPU (16GB)