Adding StateDictAdapter #1601

HosseinKaviani-H · 2025-08-19T18:14:58Z

In this PR, I'm adding the StateDictAdapter for Qwen3 to enable loading HF checkpoints. We can use this script to adapt the checkpoint from HF to the format that we can load into the torchtitan model and vice versa. This can enable us to do a parity test with the HF implementation and make sure that our results are aligned with the HF implementation.

torchtitan/experiments/qwen3/model/state_dict_adapter.py

wwwjn

Update: Please fix __init__.py first

wwwjn

Almost forgot to mention, you need to plug in the stateDictAdapter in __init__.py

…_local

HosseinKaviani-H · 2025-08-20T21:32:20Z

Update: Please fix __init__.py first

Fixed

facebook-github-bot · 2025-08-20T22:37:15Z

@HosseinKaviani-H has imported this pull request. If you are a Meta employee, you can view this in D80660953.

facebook-github-bot · 2025-08-21T21:28:23Z

@HosseinKaviani-H has imported this pull request. If you are a Meta employee, you can view this in D80660953.

Fix config file path in run_train.sh

a09953b

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Aug 19, 2025

Adding state_dict_adapter

8dbceeb

wwwjn reviewed Aug 19, 2025

View reviewed changes

torchtitan/experiments/qwen3/model/state_dict_adapter.py Outdated Show resolved Hide resolved

Adding state_dict_adapter

a755f53

tianyu-l reviewed Aug 19, 2025

View reviewed changes

torchtitan/experiments/qwen3/model/state_dict_adapter.py Show resolved Hide resolved

Hossein Kavianihamedani and others added 2 commits August 19, 2025 16:19

Resolve README conflict

0a04bde

Merge branch 'main' into main

ee4485f

wwwjn approved these changes Aug 20, 2025

View reviewed changes

wwwjn requested changes Aug 20, 2025

View reviewed changes

Hossein Kavianihamedani added 3 commits August 20, 2025 14:25

Resolve README conflict and add StateDictAdapter changes

41f6589

Merge branch 'main' of https://github.com/HosseinKaviani-H/torchtitan…

8c93715

…_local

Update __init__.py file

4c790fb

wwwjn approved these changes Aug 20, 2025

View reviewed changes

Merge branch 'pytorch:main' into main

43c5565

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Adding StateDictAdapter #1601

Adding StateDictAdapter #1601

HosseinKaviani-H commented Aug 19, 2025

Uh oh!

Uh oh!

Uh oh!

wwwjn left a comment •

edited

Loading

Uh oh!

wwwjn left a comment •

edited

Loading

Uh oh!

HosseinKaviani-H commented Aug 20, 2025

Uh oh!

facebook-github-bot commented Aug 20, 2025

Uh oh!

facebook-github-bot commented Aug 21, 2025

Uh oh!

Uh oh!

Adding StateDictAdapter #1601

Are you sure you want to change the base?

Adding StateDictAdapter #1601

Conversation

HosseinKaviani-H commented Aug 19, 2025

Uh oh!

Uh oh!

Uh oh!

wwwjn left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

wwwjn left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

HosseinKaviani-H commented Aug 20, 2025

Uh oh!

facebook-github-bot commented Aug 20, 2025

Uh oh!

facebook-github-bot commented Aug 21, 2025

Uh oh!

Uh oh!

wwwjn left a comment •

edited

Loading

wwwjn left a comment •

edited

Loading