[TRT EP] Fix `trt_load_user_initializer` for large models where weight are not correctly excluded #25502

gedoensmax · 2025-07-23T06:24:27Z

Description

This change respects initializers that are external but already loaded in memory. This is required due to an optimization that leaves it to the backend to read a mapped memory area.

@chilo-ms can you help run the CI and merge this change ?

gedoensmax · 2025-07-23T06:32:59Z

Can we get this into 1.23 since it is a fix on top of #25409.

Copilot

Pull Request Overview

This PR fixes an issue in the TensorRT execution provider where external initializers (weights) that are already loaded in memory were not being handled correctly for large models. The fix ensures that initializers with external data in memory are properly recognized and processed alongside those with raw data.

Converts TensorrtUserWeights from struct to class with proper accessors to improve encapsulation
Adds handling for external initializers that are already loaded in memory using HasExternalDataInMemory checks
Updates serialization logic to exclude external data from graph proto when include_initializer_data is false

Reviewed Changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 2 comments.

File	Description
tensorrt_execution_provider.h	Converts TensorrtUserWeights to class with proper encapsulation and accessor methods
tensorrt_execution_provider.cc	Adds external data handling logic and updates weight processing to use new class interface
graph_proto_serializer.cc	Updates serialization condition to handle external data in memory alongside raw data

onnxruntime/core/providers/tensorrt/tensorrt_execution_provider.cc

jywu-msft · 2025-07-23T07:20:34Z

/azp run Linux QNN CI Pipeline,Win_TRT_Minimal_CUDA_Test_CI,Windows ARM64 QNN CI Pipeline,Windows GPU Doc Gen CI Pipeline,Windows x64 QNN CI Pipeline

azure-pipelines · 2025-07-23T07:20:55Z

Azure Pipelines successfully started running 5 pipeline(s).

Co-authored-by: Copilot <[email protected]>

jywu-msft · 2025-07-23T22:51:19Z

/azp run Linux QNN CI Pipeline,Win_TRT_Minimal_CUDA_Test_CI,Windows ARM64 QNN CI Pipeline,Windows GPU Doc Gen CI Pipeline,Windows x64 QNN CI Pipeline

azure-pipelines · 2025-07-23T22:51:39Z

Azure Pipelines successfully started running 5 pipeline(s).

jywu-msft · 2025-07-23T22:52:27Z

there are build errors (warnings as errors)
/onnxruntime_src/onnxruntime/core/providers/tensorrt/tensorrt_execution_provider.h:173:3: error: type qualifiers ignored on function return type [-Werror=ignored-qualifiers]
173 | const int64_t Size() const {
| ^~~~~
cc1plus: all warnings being treated as errors

gedoensmax · 2025-07-24T10:00:08Z

Thanks, should be fixed now. I also fixed some test usage and improved the logging.

jywu-msft · 2025-07-24T15:06:02Z

/azp run Linux QNN CI Pipeline,Win_TRT_Minimal_CUDA_Test_CI,Windows ARM64 QNN CI Pipeline,Windows GPU Doc Gen CI Pipeline,Windows x64 QNN CI Pipeline

azure-pipelines · 2025-07-24T15:06:24Z

Azure Pipelines successfully started running 5 pipeline(s).

gedoensmax · 2025-07-25T10:28:40Z

The failing CI seems tobe unrelated to my changes but had some network outage or builder crashes.

snnn · 2025-07-25T16:33:15Z

Hi there! We haven't cut the release branch for this version yet, so I'm removing the release:1.23.0 label for now to keep things tidy. Thanks so much for your contribution! We'll make sure this gets included when the release is prepared. 🤖

@chilo-ms

…t are not correctly excluded (microsoft#25502) ### Description This change respects initializers that are external but already loaded in memory. This is required due to an optimization that leaves it to the backend to read a mapped memory area. @chilo-ms can you help run the CI and merge this change ? --------- Co-authored-by: Copilot <[email protected]>

respect initializers that are external and already in memory

99bb4de

jywu-msft requested a review from Copilot July 23, 2025 07:19

jywu-msft added the release:1.23.0 label Jul 23, 2025

Copilot AI reviewed Jul 23, 2025

View reviewed changes

onnxruntime/core/providers/tensorrt/tensorrt_execution_provider.cc Show resolved Hide resolved

onnxruntime/core/providers/tensorrt/tensorrt_execution_provider.cc Outdated Show resolved Hide resolved

make logging message consistent with others

344544e

Co-authored-by: Copilot <[email protected]>

gedoensmax added 2 commits July 24, 2025 11:59

fix linux compilation error

fb6f184

improve user facing logging

f7b2109

jywu-msft approved these changes Jul 25, 2025

View reviewed changes

jywu-msft merged commit 0905c56 into microsoft:main Jul 25, 2025
90 of 100 checks passed

snnn removed the release:1.23.0 label Jul 25, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[TRT EP] Fix `trt_load_user_initializer` for large models where weight are not correctly excluded #25502

[TRT EP] Fix `trt_load_user_initializer` for large models where weight are not correctly excluded #25502

gedoensmax commented Jul 23, 2025

Uh oh!

gedoensmax commented Jul 23, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

jywu-msft commented Jul 23, 2025

Uh oh!

azure-pipelines bot commented Jul 23, 2025

Uh oh!

jywu-msft commented Jul 23, 2025

Uh oh!

azure-pipelines bot commented Jul 23, 2025

Uh oh!

jywu-msft commented Jul 23, 2025

Uh oh!

gedoensmax commented Jul 24, 2025

Uh oh!

jywu-msft commented Jul 24, 2025

Uh oh!

azure-pipelines bot commented Jul 24, 2025

Uh oh!

gedoensmax commented Jul 25, 2025

Uh oh!

Uh oh!

snnn commented Jul 25, 2025

Uh oh!

Uh oh!

[TRT EP] Fix trt_load_user_initializer for large models where weight are not correctly excluded #25502

[TRT EP] Fix trt_load_user_initializer for large models where weight are not correctly excluded #25502

Conversation

gedoensmax commented Jul 23, 2025

Description

Uh oh!

gedoensmax commented Jul 23, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

jywu-msft commented Jul 23, 2025

Uh oh!

azure-pipelines bot commented Jul 23, 2025

Uh oh!

jywu-msft commented Jul 23, 2025

Uh oh!

azure-pipelines bot commented Jul 23, 2025

Uh oh!

jywu-msft commented Jul 23, 2025

Uh oh!

gedoensmax commented Jul 24, 2025

Uh oh!

jywu-msft commented Jul 24, 2025

Uh oh!

azure-pipelines bot commented Jul 24, 2025

Uh oh!

gedoensmax commented Jul 25, 2025

Uh oh!

Uh oh!

snnn commented Jul 25, 2025

Uh oh!

Uh oh!

[TRT EP] Fix `trt_load_user_initializer` for large models where weight are not correctly excluded #25502

[TRT EP] Fix `trt_load_user_initializer` for large models where weight are not correctly excluded #25502