We need more capable CPU only LLMs for universal compatibility and speed. Please try to add BitNet as a supported model. https://github.com/microsoft/BitNet