[WWB] Add eagle3 pipeline #2812

sunxiaoxia2022 · 2025-10-10T00:49:25Z

Description

Add eagle3 pipeline

Checklist:

Tests have been updated or added to cover the new code
This patch fully addresses the ticket.
I have made corresponding changes to the documentation

Copilot

Pull Request Overview

This PR adds eagle3 pipeline support for speculative decoding in the who_what_benchmark tool. The changes enable users to configure and use draft models for speculative decoding with various configuration options.

Added command-line arguments for speculative decoding configuration including draft model path, device, and eagle3 mode
Modified text generation functions to use a unified generation config object instead of individual parameters
Updated model loader to support draft model configuration and speculative decoding setup

Reviewed Changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 4 comments.

File	Description
wwb.py	Added CLI arguments for speculative decoding and eagle3 mode, updated generation config handling
text_evaluator.py	Modified generation function signatures to use generation config object
model_loaders.py	Added draft model loading and configuration support for speculative decoding

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

tools/who_what_benchmark/whowhatbench/wwb.py

tools/who_what_benchmark/whowhatbench/model_loaders.py

tools/who_what_benchmark/whowhatbench/wwb.py

tools/who_what_benchmark/whowhatbench/model_loaders.py

tools/who_what_benchmark/whowhatbench/text_evaluator.py

tools/who_what_benchmark/whowhatbench/wwb.py

sbalandi · 2025-10-10T13:20:16Z

tools/who_what_benchmark/whowhatbench/wwb.py

                tokenizer is not None and tokenizer.chat_template is not None and not args.omit_chat_template
            )
+
+            gen_config = openvino_genai.GenerationConfig()


please, import openvino_genai and create GenerationConfig only if --genai option is set

you can create and set generation config once when you create the GenAI pipeline in model_loaders.py

Ok, Updated.

Copilot

Pull Request Overview

Copilot reviewed 3 out of 3 changed files in this pull request and generated 1 comment.

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

tools/who_what_benchmark/whowhatbench/model_loaders.py

MaximProshin · 2025-10-20T11:26:20Z

@sunxiaoxia2022 , please share wwb Similarity numbers for eagle3 models from #2740

sunxiaoxia2022 · 2025-10-21T01:32:33Z

@sunxiaoxia2022 , please share wwb Similarity numbers for eagle3 models from #2740

Hi @MaximProshin
Test platform: LNL Ultra 7 258V Win
Models:

llama-3.1-8b-instruct:
target model: meta-llama/Llama-3.1-8B-Instruct
Eagle3 draft model: yuhuili/EAGLE3-LLaMA3.1-Instruct-8B
qwen3-8b:
target model: Qwen/Qwen3-8B
Eagle3 draft model: Tengyunw/qwen3_8b_eagle3

The similarity numbers are as follows:

model	precision	prompt	base	eagle pipeline
llama-3.1-8b-instruct	INT4	short	0.935972	0.935842
llama-3.1-8b-instruct	INT4	long	0.923789	0.918256
qwen3-8b	INT4	short	0.935486	0.935193
qwen3-8b	INT4	long	0.913537	0.914053

apaniukov · 2025-10-21T07:04:05Z

@sunxiaoxia2022 , please share wwb Similarity numbers for eagle3 models from #2740

Hi @MaximProshin Test platform: LNL Ultra 7 258V Win Models:

llama-3.1-8b-instruct:
target model: meta-llama/Llama-3.1-8B-Instruct
Eagle3 draft model: yuhuili/EAGLE3-LLaMA3.1-Instruct-8B

qwen3-8b:
target model: Qwen/Qwen3-8B
Eagle3 draft model: Tengyunw/qwen3_8b_eagle3

The similarity numbers are as follows:
<style> </style>
model precision prompt base eagle pipeline
llama-3.1-8b-instruct INT4 short 0.935972 0.935842
llama-3.1-8b-instruct INT4 long 0.923789 0.918256
qwen3-8b INT4 short 0.935486 0.935193
qwen3-8b INT4 long 0.913537 0.914053

What are num-assistant-tokens and assistant-confidence-threshold?

Wovchena · 2025-10-21T08:12:46Z

Why doesn't the similarity match exactly between base and eagle pipelines? I thought the expert model should be exactly the same

sbalandi · 2025-10-21T09:01:04Z

LGTM, as part with speculative decoding in wwb part, but still need clarification about Similarity numbers

sunxiaoxia2022 added 4 commits September 23, 2025 15:01

add eagle3

337e4cd

update calling interface

96c3672

change default_gen_answer parameters

1f0eba8

fix conflict

7493bea

sunxiaoxia2022 requested review from Copilot, peterchen-intel, songbell, wangleis and xufang-lisa October 10, 2025 00:49

github-actions bot added the category: WWB PR changes WWB label Oct 10, 2025

Copilot AI reviewed Oct 10, 2025

View reviewed changes

sunxiaoxia2022 added 2 commits October 10, 2025 09:04

merge master

7e4062c

format issue

acc068a

peterchen-intel assigned MaximProshin Oct 10, 2025

peterchen-intel reviewed Oct 10, 2025

View reviewed changes

tools/who_what_benchmark/whowhatbench/wwb.py Outdated Show resolved Hide resolved

peterchen-intel reviewed Oct 10, 2025

View reviewed changes

tools/who_what_benchmark/whowhatbench/wwb.py Outdated Show resolved Hide resolved

peterchen-intel requested a review from andreyanufr October 10, 2025 01:46

songbell reviewed Oct 10, 2025

View reviewed changes

tools/who_what_benchmark/whowhatbench/model_loaders.py Outdated Show resolved Hide resolved

songbell reviewed Oct 10, 2025

View reviewed changes

tools/who_what_benchmark/whowhatbench/text_evaluator.py Outdated Show resolved Hide resolved

songbell reviewed Oct 10, 2025

View reviewed changes

tools/who_what_benchmark/whowhatbench/text_evaluator.py Outdated Show resolved Hide resolved

sunxiaoxia2022 added 2 commits October 10, 2025 10:53

fix by comments

b2ac5b7

add draft_cb_config

abfb6ef

xufang-lisa reviewed Oct 10, 2025

View reviewed changes

tools/who_what_benchmark/whowhatbench/wwb.py Show resolved Hide resolved

MaximProshin assigned Wovchena and sbalandi and unassigned MaximProshin and Wovchena Oct 10, 2025

remove unused parameters in TextEvaluator

bb33c7c

sunxiaoxia2022 requested review from songbell and xufang-lisa October 10, 2025 07:53

sunxiaoxia2022 requested a review from peterchen-intel October 10, 2025 07:53

sbalandi reviewed Oct 10, 2025

View reviewed changes

MaximProshin added the Code Freeze label Oct 10, 2025

sunxiaoxia2022 added 4 commits October 11, 2025 10:39

change gen_config to separate parameters

e48a0e1

remove blank lines

30395a5

fix line too long issue

246dc4c

warning

eb83451

sunxiaoxia2022 requested a review from sbalandi October 11, 2025 03:19

format issue

1504519

peterchen-intel requested a review from Copilot October 11, 2025 03:37

Copilot AI reviewed Oct 11, 2025

View reviewed changes

tools/who_what_benchmark/whowhatbench/model_loaders.py Outdated Show resolved Hide resolved

sunxiaoxia2022 and others added 2 commits October 11, 2025 14:15

format issue

311e026

Merge branch 'master' into xiaoxia/wwb_add_eagle3

2f637f1

MaximProshin self-requested a review October 21, 2025 06:20

MaximProshin approved these changes Oct 21, 2025

View reviewed changes

MaximProshin mentioned this pull request Oct 21, 2025

eagle3 cb impl with top-1 proposal #2740

Open

moslex added this to the 2025.4 milestone Oct 21, 2025

moslex added the priority: high High piority label Oct 21, 2025

sbalandi approved these changes Oct 21, 2025

View reviewed changes

[WWB] Add eagle3 pipeline #2812

Are you sure you want to change the base?

[WWB] Add eagle3 pipeline #2812

Conversation

sunxiaoxia2022 commented Oct 10, 2025 • edited by peterchen-intel Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Checklist:

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sbalandi Oct 10, 2025

Choose a reason for hiding this comment

Uh oh!

sbalandi Oct 10, 2025

Choose a reason for hiding this comment

Uh oh!

sunxiaoxia2022 Oct 11, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Uh oh!

Uh oh!

MaximProshin commented Oct 20, 2025

Uh oh!

sunxiaoxia2022 commented Oct 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

apaniukov commented Oct 21, 2025

Uh oh!

Wovchena commented Oct 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sbalandi commented Oct 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

9 participants

sunxiaoxia2022 commented Oct 10, 2025 •

edited by peterchen-intel

Loading

sunxiaoxia2022 commented Oct 21, 2025 •

edited

Loading

Wovchena commented Oct 21, 2025 •

edited

Loading

sbalandi commented Oct 21, 2025 •

edited

Loading