Replies: 1 comment
-
|
CPU offloading is not on our roadmap right now. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hello, I wonder if there's any plan to support offload optimizer states even params like ZeRO-offload? DeepSpeed has offload but without mcore's parallel or TE. I hope there's any possibility to use offload with TP/PP/CP and TE, achieving high performance, especially when h2d/d2h bandwidth is higher like those mentioned in https://openreview.net/pdf?id=rqn2v1Ltgn0.
Looking forward to your reply and thanks!
Beta Was this translation helpful? Give feedback.
All reactions