-
Notifications
You must be signed in to change notification settings - Fork 49
Update quick-tune lists for Strix and Navi48 #2073
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: quick_tune_code_review
Are you sure you want to change the base?
Conversation
6c69571 to
313c50f
Compare
9418d5a to
19028bb
Compare
313c50f to
8a091e6
Compare
19028bb to
9091cef
Compare
8a091e6 to
c51b8db
Compare
424cb38 to
f78e931
Compare
c51b8db to
f517713
Compare
f78e931 to
bb22001
Compare
f517713 to
2dd7ba7
Compare
bb22001 to
c6eec8e
Compare
2dd7ba7 to
d3a9398
Compare
8fb43c1 to
b249779
Compare
d3a9398 to
c9365e3
Compare
b249779 to
c6eec8e
Compare
c9365e3 to
d3a9398
Compare
c6eec8e to
1e55ac8
Compare
d3a9398 to
e0cda50
Compare
| {"gfx950_conv_i8", {PopulateParamsXDL::initParametersForwardI8ConvGfx950, PopulateParamsXDL::nInitParametersForwardI8ConvGfx950}}, | ||
| {"gfx1000_gemm_f16", {PopulateParamsWmma::initParametersFp16GemmGfx1000, PopulateParamsWmma::nInitParametersFp16GemmGfx1000}}, | ||
| {"gfx1100_gemm_f16", {PopulateParamsWmma::initParametersFp16GemmGfx1100, PopulateParamsWmma::nInitParametersFp16GemmGfx1100}}, | ||
| {"gfx1200_gemm_f16", {PopulateParamsWmma::initParametersFp16GemmGfx1200, PopulateParamsWmma::nInitParametersFp16GemmGfx1200}}, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Does the gfx1200 fall back to gfx1201? The gfx1200 does exist and should be supported.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yes, it will fallback to gfx1201. The fallback logic is implemented in #1994, and it basically looks for the closest arch if it can't find an exact match. The old list was also based on gfx1201, it was just generically named gfx1200 because it would be used for any gfx12**, that's why it is redundant now.
We can tune for gfx1200 as well, if needed.
1e55ac8 to
ba5e67f
Compare
097b46c to
9f5cc09
Compare
ba5e67f to
1d11bb4
Compare
9f5cc09 to
500fe5f
Compare
500fe5f to
a56a4d1
Compare
a56a4d1 to
86b2118
Compare
Motivation
Update gemm and conv quick-tune lists for:
Technical Details
Test Plan
Test Result
Submission Checklist