Skip to content

Remove header mask from EVL tail folded VPInterleaveRecipes #150162

@lukel97

Description

@lukel97

After #149981/#150074, we generate tail folded loops with interleaved accesses.

This should work with EVL tail folding, but we will still emit a mask to control the tail. In optimizeMaskToEVL in VPlanTransforms, we should try and remove the mask if possible to convert the wide loads/stores to VP intrinsics.

The wide masked loads and stores are generated inside VPInterleaveRecipe. We could create a VLEVLInterleaveRecipe to generate VP intrinsics instead, or maybe we could look at splitting up VPInterleaveRecipe to take the wide loads/stores as a separate operand to avoid creating a new recipe.

Metadata

Metadata

Assignees

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions