Adds gradient cap for teacher student distillation #91

alessandroassirelli98 · 2025-05-09T13:59:30Z

This PR adds a gradient cap to the teacher-student distillation setup.
The goal is to prevent excessively large gradients from destabilizing training.

📌 Changes
Introduced a clipping mechanism to cap the gradients during backpropagation in the distillation process.

Helps improve training stability, especially in early iterations.

Mayankm96

Thanks a lot for the MR!

Hm not sure why there are formatting issues on the runner file. Could we undo those please?

Also is there a value of max grad norm that works decently? Might make sense to keep them as default.

alessandroassirelli98 · 2025-05-09T14:20:20Z

Yeah I was wondering the same.
I added this because I am trying to bring HOVER up to date with the latest rsl_rl. In that paper they use 0.2, should I set that as default? Also we should update the RslRlDistillationAlgorithmCfg in isaaclab to keep consistency

Mayankm96 · 2025-05-09T14:23:40Z

You are right. Making the default 0.2 might or might not work for all usecases. It's alright to keep it None.

Yes, we need to modify IsaacLab configure to support this parameter as well.

Merging this MR then. Not sure why the formatting happened differently but CI didn't complain so it might be just pre-commit version differences.

Mayankm96 · 2025-05-09T14:30:35Z

@ClemensSchwarke after merging #84, #85, #87 -- might make sense to do a patch release of rsl-rl as well.

This PR adds a gradient cap to the teacher-student distillation setup. The goal is to prevent excessively large gradients from destabilizing training. 📌 Changes Introduced a clipping mechanism to cap the gradients during backpropagation in the distillation process. Helps improve training stability, especially in early iterations. --------- Co-authored-by: alessandro.assirelli <[email protected]>

# Description Added `max_grad_norm` field to `RslRlDistillationAlgorithmCfg` in order to be compatible with leggedrobotics/rsl_rl#91 ## Type of change - New feature (non-breaking change which adds functionality) ## Checklist - [x] I have run the [`pre-commit` checks](https://pre-commit.com/) with `./isaaclab.sh --format` - [ ] I have made corresponding changes to the documentation - [x] My changes generate no new warnings - [ ] I have added tests that prove my fix is effective or that my feature works - [ ] I have updated the changelog and the corresponding version in the extension's `config/extension.toml` file - [x] I have added my name to the `CONTRIBUTORS.md` or my name already exists there --------- Signed-off-by: Mayank Mittal <[email protected]> Co-authored-by: alessandro.assirelli <[email protected]> Co-authored-by: Mayank Mittal <[email protected]>

…-sim#2454) # Description Added `max_grad_norm` field to `RslRlDistillationAlgorithmCfg` in order to be compatible with leggedrobotics/rsl_rl#91 ## Type of change - New feature (non-breaking change which adds functionality) ## Checklist - [x] I have run the [`pre-commit` checks](https://pre-commit.com/) with `./isaaclab.sh --format` - [ ] I have made corresponding changes to the documentation - [x] My changes generate no new warnings - [ ] I have added tests that prove my fix is effective or that my feature works - [ ] I have updated the changelog and the corresponding version in the extension's `config/extension.toml` file - [x] I have added my name to the `CONTRIBUTORS.md` or my name already exists there --------- Signed-off-by: Mayank Mittal <[email protected]> Co-authored-by: alessandro.assirelli <[email protected]> Co-authored-by: Mayank Mittal <[email protected]>

# Description Added `max_grad_norm` field to `RslRlDistillationAlgorithmCfg` in order to be compatible with leggedrobotics/rsl_rl#91 ## Type of change - New feature (non-breaking change which adds functionality) ## Checklist - [x] I have run the [`pre-commit` checks](https://pre-commit.com/) with `./isaaclab.sh --format` - [ ] I have made corresponding changes to the documentation - [x] My changes generate no new warnings - [ ] I have added tests that prove my fix is effective or that my feature works - [ ] I have updated the changelog and the corresponding version in the extension's `config/extension.toml` file - [x] I have added my name to the `CONTRIBUTORS.md` or my name already exists there --------- Signed-off-by: Mayank Mittal <[email protected]> Co-authored-by: alessandro.assirelli <[email protected]> Co-authored-by: Mayank Mittal <[email protected]>

alessandro.assirelli added 3 commits April 17, 2025 18:13

add cap on gradient

34f2105

max_grad_norm not used as default

91fa7a5

run formatter

e4a9955

Mayankm96 reviewed May 9, 2025

View reviewed changes

Mayankm96 approved these changes May 9, 2025

View reviewed changes

Mayankm96 changed the title ~~Add gradient cap for teacher student distillation~~ Adds gradient cap for teacher student distillation May 9, 2025

Mayankm96 merged commit c6834de into leggedrobotics:main May 9, 2025

alessandroassirelli98 mentioned this pull request May 9, 2025

Adds gradient clipping parameter for distillation using RSL-RL isaac-sim/IsaacLab#2454

Merged

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Adds gradient cap for teacher student distillation #91

Adds gradient cap for teacher student distillation #91

Uh oh!

alessandroassirelli98 commented May 9, 2025

Uh oh!

Mayankm96 left a comment

Uh oh!

alessandroassirelli98 commented May 9, 2025

Uh oh!

Mayankm96 commented May 9, 2025 •

edited

Loading

Uh oh!

Mayankm96 commented May 9, 2025

Uh oh!

Uh oh!

Adds gradient cap for teacher student distillation #91

Adds gradient cap for teacher student distillation #91

Uh oh!

Conversation

alessandroassirelli98 commented May 9, 2025

Uh oh!

Mayankm96 left a comment

Choose a reason for hiding this comment

Uh oh!

alessandroassirelli98 commented May 9, 2025

Uh oh!

Mayankm96 commented May 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Mayankm96 commented May 9, 2025

Uh oh!

Uh oh!

Mayankm96 commented May 9, 2025 •

edited

Loading