You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi,
I’m building an inference pipeline in C# using ONNX Runtime with an instance segmentation model exported to ONNX.
My dataset contains many small, densely packed objects, so running inference on the full image often increases FN (missed instances), and pushing higher resolution becomes expensive in terms of latency/memory. That’s why I’d like to implement SAHI-like sliced/tiling inference in C# (split image into overlapping tiles → run inference per tile → merge results back to the original image).
However, SAHI is Python-centric and I couldn’t find a solid C# reference, so I’m considering building a minimal SAHI-style C# implementation. Before I start, I’d appreciate guidance on best practices and implementation details:
Goals
Split the input image into overlapping tiles
Run instance segmentation inference per tile with ONNX Runtime
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
Uh oh!
There was an error while loading. Please reload this page.
-
Hi,
I’m building an inference pipeline in C# using ONNX Runtime with an instance segmentation model exported to ONNX.
My dataset contains many small, densely packed objects, so running inference on the full image often increases FN (missed instances), and pushing higher resolution becomes expensive in terms of latency/memory. That’s why I’d like to implement SAHI-like sliced/tiling inference in C# (split image into overlapping tiles → run inference per tile → merge results back to the original image).
However, SAHI is Python-centric and I couldn’t find a solid C# reference, so I’m considering building a minimal SAHI-style C# implementation. Before I start, I’d appreciate guidance on best practices and implementation details:
Goals
Split the input image into overlapping tiles
Run instance segmentation inference per tile with ONNX Runtime
Map detections/masks back to original coordinates
Merge tile outputs (NMS / weighted merging / mask-based merging)
I’m not necessarily aiming for a full SAHI port at first—just a practical minimum viable sliced inference + merge that works reliably in production.
Thanks in advance!
Beta Was this translation helpful? Give feedback.
All reactions