We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
2 parents 1514f01 + 98a7a85 commit 77a1c5aCopy full SHA for 77a1c5a
libraries/nxd-inference/tutorials/trn2-llama3.1-405b-tutorial.rst
@@ -303,7 +303,6 @@ This example uses the following configuration options:
303
draft_neuron_config.trace_tokengen_model = True
304
draft_neuron_config.enable_fused_speculation = False
305
draft_neuron_config.is_eagle_draft = True
306
- draft_neuron_config.sequence_parallel_enabled = False
307
draft_config = LlamaInferenceConfig(
308
draft_neuron_config,
309
load_config=load_pretrained_config(draft_model_path)
@@ -358,4 +357,4 @@ This example uses the following configuration options:
358
357
359
360
if __name__ == "__main__":
361
- run_llama_generate()
+ run_llama_generate()
0 commit comments