[Entrypoints] initialize processor error handling #1796

brian-dellabetta · 2025-09-03T14:52:06Z

SUMMARY:
Resolves #1795

[Bug]: MistralCommonTokenizer not supported #1795

Currently, we initialize a processor in entrypoint pre_process even if one isn't provided, even though it isn't needed for data-free recipes like FP8_DYNAMIC or W4A16, causing downstream user issues like #1795. This updates pre-processing to

wrap processor initialization in a try/catch
error out if initialization fails and a processor is required (i.e. if a dataset is needed for training/calibration)
otherwise, log a warning if an output_dir is provided, because the processor will not be saved with the trained/compressed model.

TEST PLAN:
Example script in #1795 succeeds on this branch, confirmed error is raised if output_dir is set and error is raised if dataset is set.

gemini-code-assist

Summary of Changes

Hello @brian-dellabetta, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request addresses an issue where the system would attempt to initialize a processor even when no dataset was provided, causing errors for data-free operations. The changes ensure that processor initialization only occurs when a dataset is present, streamlining the pre-processing flow and preventing unnecessary resource allocation.

Highlights

Conditional Processor Initialization: The system will now only initialize a processor during pre-processing if a dataset is explicitly provided. This resolves an issue where processors were unnecessarily initialized for data-free recipes, leading to user issues.
pre_process Function Signature Update: The pre_process function in train.py and utils.py has been updated to accept dataset_args, enabling the new conditional logic.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point in your pull request via creating an issue comment (i.e. comment on the pull request page) using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in issue comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist

Code Review

This pull request aims to prevent the initialization of a processor when no dataset is provided, which is useful for data-free recipes. The changes involve passing dataset_args to the pre_process function and adding a condition to only initialize the processor if a dataset is specified. The overall approach is correct, but I've identified a potential issue where specifying a dataset via dataset_path might not be handled correctly by the new condition. I've left a comment with a suggestion to make the check more robust. Please also note that get_processed_dataset in src/llmcompressor/datasets/utils.py seems to have a related issue where it only checks for dataset_args.dataset and might need to be updated to fully support dataset_path.

fynnsu

LGTM (although fix the gemini issue)

github-actions · 2025-09-03T14:59:15Z

👋 Hi! Thank you for contributing to llm-compressor. Please add the ready label when the PR is ready for review.

Note: This is required to complete the testing suite, please only add the label once the PR is code complete and local testing has been performed.

shanjiaz

Makes a lot of sense to me! Could you add the fix Gemini proposed? Thanks!

kylesayrs

Can we check that not saving the processor doesn't break loading in vllm?

shanjiaz

Thanks for adding the warnings!

src/llmcompressor/args/dataset_arguments.py

Signed-off-by: Brian Dellabetta <[email protected]>

gemini-code-assist bot reviewed Sep 3, 2025

View reviewed changes

brian-dellabetta force-pushed the bdellabe/processor-init branch from 592214a to 3241205 Compare September 3, 2025 14:52

brian-dellabetta requested review from kylesayrs, dsikka, rahul-tuli and shanjiaz September 3, 2025 14:53

brian-dellabetta added the ready When a PR is ready for review label Sep 3, 2025

gemini-code-assist bot reviewed Sep 3, 2025

View reviewed changes

brian-dellabetta changed the title ~~only init processor if dataset provided~~ [Entrypoints] only init processor if dataset provided Sep 3, 2025

fynnsu previously approved these changes Sep 3, 2025

View reviewed changes

shanjiaz previously approved these changes Sep 3, 2025

View reviewed changes

kylesayrs requested changes Sep 3, 2025

View reviewed changes

brian-dellabetta dismissed stale reviews from shanjiaz and fynnsu via 91eb9de September 3, 2025 16:37

vllm-project deleted a comment from gemini-code-assist bot Sep 4, 2025

rahul-tuli previously approved these changes Sep 4, 2025

View reviewed changes

brian-dellabetta dismissed rahul-tuli’s stale review via dfd7731 September 4, 2025 17:35

brian-dellabetta changed the title ~~[Entrypoints] only init processor if dataset provided~~ [Entrypoints] init processor error handling Sep 4, 2025

brian-dellabetta changed the title ~~[Entrypoints] init processor error handling~~ [Entrypoints] initialize processor error handling Sep 4, 2025

brian-dellabetta requested review from kylesayrs, fynnsu and shanjiaz September 4, 2025 17:37

kylesayrs previously approved these changes Sep 4, 2025

View reviewed changes

brian-dellabetta requested a review from rahul-tuli September 4, 2025 17:46

shanjiaz previously approved these changes Sep 4, 2025

View reviewed changes

fynnsu requested changes Sep 4, 2025

View reviewed changes

src/llmcompressor/args/dataset_arguments.py Outdated Show resolved Hide resolved

brian-dellabetta added 3 commits September 4, 2025 19:08

only init processor if dataset provided

32cf466

Signed-off-by: Brian Dellabetta <[email protected]>

bugfix

5e1ad46

Signed-off-by: Brian Dellabetta <[email protected]>

post-deep dive revisions

5c0d0b9

Signed-off-by: Brian Dellabetta <[email protected]>

brian-dellabetta added 2 commits September 4, 2025 19:08

stylefix

49554c9

Signed-off-by: Brian Dellabetta <[email protected]>

stylefix

32cbebd

Signed-off-by: Brian Dellabetta <[email protected]>

brian-dellabetta force-pushed the bdellabe/processor-init branch from dfd7731 to 32cbebd Compare September 4, 2025 19:08

rename helper

c3d3148

Signed-off-by: Brian Dellabetta <[email protected]>

brian-dellabetta dismissed stale reviews from kylesayrs and shanjiaz via c3d3148 September 4, 2025 19:08

brian-dellabetta requested review from fynnsu, kylesayrs and shanjiaz September 4, 2025 19:09

shanjiaz approved these changes Sep 4, 2025

View reviewed changes

fynnsu approved these changes Sep 4, 2025

View reviewed changes

brian-dellabetta enabled auto-merge (squash) September 4, 2025 21:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Entrypoints] initialize processor error handling #1796

[Entrypoints] initialize processor error handling #1796

brian-dellabetta commented Sep 3, 2025 •

edited

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

fynnsu left a comment

Uh oh!

github-actions bot commented Sep 3, 2025

Uh oh!

shanjiaz left a comment

Uh oh!

kylesayrs left a comment

Uh oh!

shanjiaz left a comment

Uh oh!

Uh oh!

Uh oh!

[Entrypoints] initialize processor error handling #1796

Are you sure you want to change the base?

[Entrypoints] initialize processor error handling #1796

Conversation

brian-dellabetta commented Sep 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Summary of Changes

Highlights

Footnotes

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

fynnsu left a comment

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Sep 3, 2025

Uh oh!

shanjiaz left a comment

Choose a reason for hiding this comment

Uh oh!

kylesayrs left a comment

Choose a reason for hiding this comment

Uh oh!

shanjiaz left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

brian-dellabetta commented Sep 3, 2025 •

edited

Loading