-
Notifications
You must be signed in to change notification settings - Fork 563
Add custom communicator for trtllm_mnnvl_ar #2056
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. Weβll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
|
Important Review skippedDraft detected. Please check the settings in the CodeRabbit UI or the You can disable this status message by setting the β¨ Finishing touchesπ§ͺ Generate unit tests (beta)
Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out. Comment |
| group_rank: int, | ||
| device_idx: int, | ||
| is_multi_node: bool = True, | ||
| comm: Optional[CommBackend] = None, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
How about calling it comm_backend or communicator?
cc @nvmbreughe in case you have any preference.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
comm_backend sounds good. Or even more explicit comm_backend_for_handle_transfer Besides the name I would also list some options: MpiComm()
| group_rank: int, | ||
| device_idx: int, | ||
| is_multi_node: bool = True, | ||
| comm: Optional[CommBackend] = None, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
comm_backend sounds good. Or even more explicit comm_backend_for_handle_transfer Besides the name I would also list some options: MpiComm()
| ) | ||
|
|
||
| @torch.inference_mode() | ||
| def row_linear_residual_norm_forward( |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can we make this a helper function as tests/comm/test_trtllm_mnnvl_allreduce.py also uses it?
π Description
π Related Issues
π Pull Request Checklist
Thank you for contributing to FlashInfer! Before we review your pull request, please make sure the following items are complete.
β Pre-commit Checks
pre-commitby runningpip install pre-commit(or used your preferred method).pre-commit install.pre-commit run --all-filesand fixed any reported issues.π§ͺ Tests
unittest, etc.).Reviewer Notes