Skip to content

Commit 77a1c5a

Browse files
authored
Merge pull request #1249 from rohithreddy0087/patch-1
Remove draft_neuron_config.sequence_parallel_enabled flag in 405b eagle speculation example to avoid compilation error
2 parents 1514f01 + 98a7a85 commit 77a1c5a

File tree

1 file changed

+1
-2
lines changed

1 file changed

+1
-2
lines changed

libraries/nxd-inference/tutorials/trn2-llama3.1-405b-tutorial.rst

Lines changed: 1 addition & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -303,7 +303,6 @@ This example uses the following configuration options:
303303
draft_neuron_config.trace_tokengen_model = True
304304
draft_neuron_config.enable_fused_speculation = False
305305
draft_neuron_config.is_eagle_draft = True
306-
draft_neuron_config.sequence_parallel_enabled = False
307306
draft_config = LlamaInferenceConfig(
308307
draft_neuron_config,
309308
load_config=load_pretrained_config(draft_model_path)
@@ -358,4 +357,4 @@ This example uses the following configuration options:
358357
359358
360359
if __name__ == "__main__":
361-
run_llama_generate()
360+
run_llama_generate()

0 commit comments

Comments
 (0)