Skip to content

Conversation

kaixuanliu
Copy link

For Intel XPU case, use MatMul8bitFp is faster than MatMul8bitLt. And it can avoid the datatype overflow issue in L105

@yao-matrix
Copy link

@matthewdouglas , could u pls help review, it will make int8 LoRA finetuning loss overflow to nan on XPU, thx very much.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants