[RFC][LV] Add support for speculative loads in loops that may fault #151300

arcbbb · 2025-07-30T09:54:24Z

This patch enables vectorization of early-exit loops containing a single potentially faulting load, by supporting unit-stride first-faulting loads via the vp.load.ff intrinsic (as introduced in #128593 and cherry-picked here)

Key changes

Relax legality check: Allow a single non-dereferenceable load in the loop header for early-exit loops.
Tail-folding support for early-exit loop
First-faulting load support
Add VPWidenFFLoadRecipe, which defines two results: the loaded data and the updated vector length.
Lower VPWidenFFLoad to VPWidenFFLoadEVL in the EVL transform.
EVL propagation
The data flow of the header mask does not reflect control flow order. To handle vector length changes introduced by WidenFFLoadEVL, control flow is used to adjust the EVL update logic. Recipes and basic blocks following WidenFFLoadEVL must now consume its returned vector length.

Note: While some changes might be better split into separate PRs for review clarity, presenting them together provides a complete overview and helps identify potential blockers.

And it depends on the following conditions being met:

The speculative load must reside in the loop header (non-predicated), avoiding the need for EVL phis across multiple predecessors.
Tail-folding kind must be DataWithEVL, and the predicate kind must be PredicateOrDontVectorize.

arcbbb · 2025-07-30T09:54:44Z

Regarding EVL propagation, I just noticed that header mask handling is fixed in #150202. This fix will need to be incorporated accordingly.

github-actions · 2025-07-30T09:57:31Z

⚠️ C/C++ code formatter, clang-format found issues in your code. ⚠️

You can test this locally with the following command:

git-clang-format --diff HEAD~1 HEAD --extensions h,cpp -- llvm/include/llvm/Analysis/TargetTransformInfo.h llvm/include/llvm/Analysis/TargetTransformInfoImpl.h llvm/include/llvm/CodeGen/SelectionDAG.h llvm/include/llvm/CodeGen/SelectionDAGNodes.h llvm/include/llvm/Transforms/Vectorize/LoopVectorizationLegality.h llvm/lib/Analysis/TargetTransformInfo.cpp llvm/lib/CodeGen/SelectionDAG/LegalizeTypes.h llvm/lib/CodeGen/SelectionDAG/LegalizeVectorTypes.cpp llvm/lib/CodeGen/SelectionDAG/SelectionDAG.cpp llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.h llvm/lib/IR/IntrinsicInst.cpp llvm/lib/Target/RISCV/RISCVISelLowering.cpp llvm/lib/Target/RISCV/RISCVISelLowering.h llvm/lib/Target/RISCV/RISCVTargetTransformInfo.h llvm/lib/Transforms/Vectorize/LoopVectorizationLegality.cpp llvm/lib/Transforms/Vectorize/LoopVectorize.cpp llvm/lib/Transforms/Vectorize/VPlan.h llvm/lib/Transforms/Vectorize/VPlanAnalysis.cpp llvm/lib/Transforms/Vectorize/VPlanRecipes.cpp llvm/lib/Transforms/Vectorize/VPlanTransforms.cpp llvm/lib/Transforms/Vectorize/VPlanValue.h llvm/lib/Transforms/Vectorize/VPlanVerifier.cpp llvm/unittests/IR/VPIntrinsicTest.cpp

View the diff from clang-format here.

diff --git a/llvm/lib/Transforms/Vectorize/VPlanTransforms.cpp b/llvm/lib/Transforms/Vectorize/VPlanTransforms.cpp
index 557bd838d..32bb13644 100644
--- a/llvm/lib/Transforms/Vectorize/VPlanTransforms.cpp
+++ b/llvm/lib/Transforms/Vectorize/VPlanTransforms.cpp
@@ -2282,8 +2282,8 @@ static VPValue *transformRecipestoEVLRecipes(VPlan &Plan) {
         continue;
       }
       // TODO: Split optimizeMaskToEVL out and move into
-      // VPlanTransforms::optimize. transformRecipestoEVLRecipes should be run in
-      // tryToBuildVPlanWithVPRecipes beforehand.
+      // VPlanTransforms::optimize. transformRecipestoEVLRecipes should be run
+      // in tryToBuildVPlanWithVPRecipes beforehand.
       VPRecipeBase *EVLRecipe =
           optimizeMaskToEVL(Plan, CurRecipe, TypeInfo, *AllOneMask, *LastEVL);
       if (!EVLRecipe)

llvm/lib/CodeGen/SelectionDAG/SelectionDAG.cpp

llvm/include/llvm/CodeGen/SelectionDAG.h

llvm/lib/Target/RISCV/RISCVISelLowering.cpp

llvm/lib/Transforms/Vectorize/LoopVectorizationLegality.cpp

llvm/lib/Transforms/Vectorize/LoopVectorize.cpp

alexey-bataev · 2025-07-30T13:01:29Z

llvm/lib/Transforms/Vectorize/LoopVectorize.cpp

+    if (!EnableSpeculativeLoads) {
+      reportVectorizationFailure("Auto-vectorization of loops with speculative "
+                                 "load is not enabled",
+                                 "SpeculativeLoadsDisabled", ORE, L);
+      return false;
+    }


Check this first?

Updated. Thanks!

llvm/lib/Transforms/Vectorize/LoopVectorize.cpp

llvm/lib/Transforms/Vectorize/VPlanRecipes.cpp

david-arm · 2025-07-30T13:13:43Z

I think it would be good for reviewers if this patch was split up into several parts, probably roughly in this order:

Codegen specific parts related to the RISCV intrinsic,
A standalone loop vectorisation legality patch similar to [LV] Add initial legality checks for ee loops with stores #145663,
A loop vectoriser patch that starts vectorising the loops you're interested in.

arcbbb · 2025-08-01T13:05:52Z

I think it would be good for reviewers if this patch was split up into several parts, probably roughly in this order:

Codegen specific parts related to the RISCV intrinsic,

A standalone loop vectorisation legality patch similar to [LV] Add initial legality checks for ee loops with stores #145663,

A loop vectoriser patch that starts vectorising the loops you're interested in.

Thanks for outlining the steps. That’s very helpful! I’ll follow this order for splitting up the patch.

arcbbb added 4 commits July 30, 2025 01:56

Cherry-pick vp.load.ff intrinsic support

52fa48f

Enable earlyexit loop with EVL

5550dda

Add WidenFFLoad

51d5a9e

Update tests after rebase

16ea10e

arcbbb requested review from fhahn, alexey-bataev and david-arm July 30, 2025 09:54

Meinersbur mentioned this pull request Jul 30, 2025

[MLIR][OpenMP] Add canonical loop LLVM-IR lowering #147069

Merged

arcbbb requested review from lukel97 and Mel-Chen July 30, 2025 10:04

david-arm requested a review from huntergr-arm July 30, 2025 10:08

arcbbb mentioned this pull request Jul 30, 2025

[VP][RISCV] Add a vp.load.ff intrinsic for fault only first load. #128593

Merged

alexey-bataev reviewed Jul 30, 2025

View reviewed changes

arcbbb changed the title ~~[LV] Add support for speculative loads in loops that may fault~~ [RFC][LV] Add support for speculative loads in loops that may fault Jul 31, 2025

arcbbb added 3 commits August 1, 2025 00:16

clang-formatted

b1f0b92

Refine codegen

855874d

Address comments

34b8c6e

arcbbb added 2 commits August 4, 2025 01:36

Expand the description on DataWithEVL dependency

71c04cb

Check EnableSpeculativeLoads first

d044e7a

arcbbb mentioned this pull request Aug 7, 2025

[LV] Add initial legality checks for loops with unbound loads. #152422

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[RFC][LV] Add support for speculative loads in loops that may fault #151300

[RFC][LV] Add support for speculative loads in loops that may fault #151300

Uh oh!

arcbbb commented Jul 30, 2025

Uh oh!

arcbbb commented Jul 30, 2025

Uh oh!

github-actions bot commented Jul 30, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

alexey-bataev Jul 30, 2025

Uh oh!

arcbbb Aug 4, 2025

Uh oh!

Uh oh!

Uh oh!

david-arm commented Jul 30, 2025

Uh oh!

arcbbb commented Aug 1, 2025

Uh oh!

Uh oh!

[RFC][LV] Add support for speculative loads in loops that may fault #151300

Are you sure you want to change the base?

[RFC][LV] Add support for speculative loads in loops that may fault #151300

Uh oh!

Conversation

arcbbb commented Jul 30, 2025

Uh oh!

arcbbb commented Jul 30, 2025

Uh oh!

github-actions bot commented Jul 30, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

alexey-bataev Jul 30, 2025

Choose a reason for hiding this comment

Uh oh!

arcbbb Aug 4, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

david-arm commented Jul 30, 2025

Uh oh!

arcbbb commented Aug 1, 2025

Uh oh!

Uh oh!