Skip to content

Commit a259eff

Browse files
committed
Adding ccl_enabled flag during model loading and passing CCL lists during compilation process
Signed-off-by: Vahid Janfaza <[email protected]>
1 parent 6dc9d41 commit a259eff

File tree

2 files changed

+2
-2
lines changed

2 files changed

+2
-2
lines changed

examples/performance/compute_context_length/gemma3.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -38,7 +38,7 @@
3838
model_id,
3939
config=config,
4040
attn_implementation="eager",
41-
kv_offload=False,
41+
kv_offload=True,
4242
qaic_config={
4343
"ccl_enabled":True,
4444
},

examples/performance/compute_context_length/molmo.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -33,7 +33,7 @@
3333

3434
qeff_model = QEFFAutoModelForCausalLM.from_pretrained(
3535
model_id,
36-
kv_offload=False,
36+
kv_offload=True,
3737
trust_remote_code=True,
3838
config=config,
3939
qaic_config={

0 commit comments

Comments
 (0)