Fix AttributeError in hf2megads weight converter #483

KeitaW · 2025-09-27T23:59:36Z

This PR is aiming to address #482 by adding DummyOptimizerWithStateDict class that inherits from torch.optim.Optimizer to resolve the AttributeError that occurs when DeepSpeed tries to save checkpoints during weight conversion.

The built-in DummyOptim created when passing optimizer=None to deepspeed.initialize() lacks a state_dict() method. Additionally, DeepSpeed validates that the optimizer is an instance of expected types (Optimizer, None, or Callable).

This fix provides a custom optimizer that:

Inherits from torch.optim.Optimizer to pass DeepSpeed's type check
Implements required methods: step(), state_dict(), and load_state_dict()
Returns empty state since optimizer state is not needed during conversion

Add DummyOptimizerWithStateDict class that inherits from torch.optim.Optimizer to resolve the AttributeError that occurs when DeepSpeed tries to save checkpoints during weight conversion. The built-in DummyOptim created when passing optimizer=None to deepspeed.initialize() lacks a state_dict() method. Additionally, DeepSpeed validates that the optimizer is an instance of expected types (Optimizer, None, or Callable). This fix provides a custom optimizer that: - Inherits from torch.optim.Optimizer to pass DeepSpeed's type check - Implements required methods: step(), state_dict(), and load_state_dict() - Returns empty state since optimizer state is not needed during conversion Co-authored-by: aravneelaws <[email protected]>

KeitaW requested review from GuanhuaWang, jeffra and tjruwase as code owners September 27, 2025 23:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix AttributeError in hf2megads weight converter #483

Fix AttributeError in hf2megads weight converter #483

Uh oh!

KeitaW commented Sep 27, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Fix AttributeError in hf2megads weight converter #483

Are you sure you want to change the base?

Fix AttributeError in hf2megads weight converter #483

Uh oh!

Conversation

KeitaW commented Sep 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

KeitaW commented Sep 27, 2025 •

edited

Loading