[tmva][sofie] Optimize ROperator_Tile with a direct-mapping algorithm #19603

olia110 · 2025-08-11T11:19:58Z

This Pull request:

This PR improves the performance of the ROperator_Tile by replacing its code generation logic.
The previous implementation used an iterative method with multiple loops and std::copy operations.

The new implementation uses a faster direct-mapping algorithm. It pre-calculates memory strides and then uses a single loop to compute the source index for each destination element.

Checklist:

tested changes locally
updated the docs (if necessary)

sanjibansg

A good approach to optimizing the Tile operator, thanks, some comments:

tmva/sofie/inc/TMVA/ROperator_Tile.hxx

sanjibansg · 2025-08-19T12:54:37Z

tmva/sofie/inc/TMVA/ROperator_Tile.hxx

+
+      const int rank = fShapeInput.size();
+
+      out << SP << "const int input_shape[" << rank << "] = " << ConvertShapeToString(fShapeInput) << ";\n";


maybe better to use size_t here instead of just int.

cc: @lmoneta

sanjibansg · 2025-08-19T12:58:22Z

tmva/sofie/inc/TMVA/ROperator_Tile.hxx

+
+      // For each output element, calculating the corresponding input element's index.
+      out << SP << SP << "for (int i = 0; i < " << rank << "; ++i) {\n";
+      out << SP << SP << SP << "const int out_coord = current_idx / output_strides[i];\n";


could we avoid these repetitive division steps since they are more expensive.

Updated and faster ROperator_Tile

9409d78

olia110 requested a review from lmoneta as a code owner August 11, 2025 11:19

couet assigned lmoneta Aug 11, 2025

sanjibansg requested changes Aug 19, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[tmva][sofie] Optimize ROperator_Tile with a direct-mapping algorithm #19603

[tmva][sofie] Optimize ROperator_Tile with a direct-mapping algorithm #19603

Uh oh!

olia110 commented Aug 11, 2025 •

edited

Loading

Uh oh!

sanjibansg left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sanjibansg Aug 19, 2025

Uh oh!

sanjibansg Aug 19, 2025

Uh oh!

Uh oh!


		const int rank = fShapeInput.size();

		out << SP << "const int input_shape[" << rank << "] = " << ConvertShapeToString(fShapeInput) << ";\n";

[tmva][sofie] Optimize ROperator_Tile with a direct-mapping algorithm #19603

Are you sure you want to change the base?

[tmva][sofie] Optimize ROperator_Tile with a direct-mapping algorithm #19603

Uh oh!

Conversation

olia110 commented Aug 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

This Pull request:

Checklist:

Uh oh!

sanjibansg left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sanjibansg Aug 19, 2025

Choose a reason for hiding this comment

Uh oh!

sanjibansg Aug 19, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

olia110 commented Aug 11, 2025 •

edited

Loading