Skip to content

Conversation

allnes
Copy link
Contributor

@allnes allnes commented Oct 16, 2025

Summary

  • Introduces JIT executors for 3D Convolution and 3D Deconvolution on AArch64 targeting FP16 (5D tensors).
  • Registers impl_desc_type::jit_asimd and prefers JIT on ARM64 for 5D FP16, decoupling from ACL availability.
  • Adds factory-based path selection in deconv.cpp under OPENVINO_ARCH_ARM64.

@github-actions github-actions bot added the category: CPU OpenVINO CPU plugin label Oct 16, 2025
allnes added 15 commits October 16, 2025 15:25
…e `parallel_for3d` for better clarity and efficiency. Adjust FP32 to FP16 conversions for consistency.
…ty by reformatting code, removing unused helpers, and ensuring consistent formatting.
…mproved readability by adopting consistent naming, reformatting, and splitting large code blocks into helper functions.
…64 JIT 3D Deconvolution Executors to optimize initialization and grouped workloads.
…ndex_eo_idx` helper to replace redundant lambda functions, improving code reuse and readability.
…te logic in AArch64 JIT 3D Deconvolution Executor for improved maintainability and clarity.
…paths in AArch64 JIT 3D Deconvolution Executor for cleaner and more maintainable code.
…Deconvolution Executors, remove redundant FP32 paths, and streamline packed weight logic for reduced initialization latency and improved maintainability.
… AArch64 JIT 3D Deconvolution Executor to improve maintainability, clarity, and consistency.
…ecutors by removing redundant comments, obsolete logic, and unused code to improve clarity and maintainability.
…nsolidating repetitive kernel invocation logic into reusable lambda functions, improving code clarity, maintainability, and reducing duplication.
…ive kernel invocation logic into reusable lambda functions, reducing code duplication and improving maintainability.
…ns, and redundant includes to simplify and clean up AArch64 JIT 3D Convolution code.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

category: CPU OpenVINO CPU plugin

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant