Skip to content

Commit 8a14e60

Browse files
authored
Add tensor and pipeline parallelism to run config
1 parent a3436f5 commit 8a14e60

File tree

1 file changed

+2
-0
lines changed

1 file changed

+2
-0
lines changed

script/app-mlperf-inference-nvidia/customize.py

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -703,6 +703,8 @@ def preprocess(i):
703703
else:
704704
run_config += f" --tensor_parallelism={tmp_tp_size}"
705705
run_config += f" --pipeline_parallelism={tmp_pp_size}"
706+
if is_true(env.get('MLC_CUSTOM_CONFIG', '')):
707+
run_config += f" --llm_gen_config_path={env['MLC_MLPERF_INFERENCE_NVIDIA_CODE_PATH']}/code/llama2-70b/tensorrt/generation_config.json"
706708

707709
enable_sort = env.get('MLC_MLPERF_NVIDIA_HARNESS_ENABLE_SORT')
708710
if enable_sort and not is_false(enable_sort):

0 commit comments

Comments
 (0)