Remove mamba-ssm package #22409

tlrmchlsmth · 2025-08-06T23:47:14Z

Purpose

The packages mamba-ssm and causal-conv-1d are not used by vLLM but are used to compare against the huggingface transformer baselines in tests.

In particular, those packages are need in order to run PLaMo2 at all. The model definition for PLaMo2 is at https://huggingface.co/pfnet/plamo-2-1b/blob/main/modeling_plamo.py, rather than being in huggingface transformers.

This PR removes the packages to avoid pain points when upgrading PyTorch or CUDA.

Test Plan

Test Result

Signed-off-by: Tyler Michael Smith <[email protected]>

gemini-code-assist

Code Review

This pull request removes the mamba-ssm package and its related causal-conv-1d dependency, which were only used for testing the PLaMo2 model against its Hugging Face implementation. The changes correctly remove the dependencies from requirements/test.in, the Dockerfile, and documentation.

However, I've identified a critical issue in tests/models/language/generation/test_hybrid.py where the PLaMo2 model (pfnet/plamo-2-1b) is completely removed from the test suite, instead of just skipping the comparison with the Hugging Face baseline. I've provided a suggestion to fix this to ensure the model remains tested by vLLM.

tests/models/language/generation/test_hybrid.py

github-actions · 2025-08-06T23:48:26Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

Signed-off-by: Tyler Michael Smith <[email protected]>

DarkLight1337 · 2025-08-07T02:31:32Z

Some other tests still require mamba-ssm. It's just that causal-conv1d is not required

tlrmchlsmth · 2025-08-07T03:33:48Z

Some other tests still require mamba-ssm. It's just that causal-conv1d is not required

I see, you're right - there are both numeric issues and also seeing this:

[2025-08-07T01:50:59Z]         if self.use_fast_kernels:
--
  | [2025-08-07T01:50:59Z]             if not is_fast_path_available or "cuda" not in self.x_proj.weight.device.type:
  | [2025-08-07T01:50:59Z] >               raise ValueError(
  | [2025-08-07T01:50:59Z]                     "Fast Mamba kernels are not available. Make sure to they are installed and that the mamba module is on a CUDA device"
  | [2025-08-07T01:50:59Z]                 )
  | [2025-08-07T01:50:59Z] E               ValueError: Fast Mamba kernels are not available. Make sure to they are installed and that the mamba module is on a CUDA device
  | [2025-08-07T01:50:59Z]
  | [2025-08-07T01:50:59Z] /usr/local/lib/python3.12/dist-packages/transformers/models/jamba/modeling_jamba.py:826: ValueError

Alnusjaponica · 2025-08-08T05:33:34Z

As mentioned earlier #20047 (comment), some tests run pfnet/palmo-2-1b inference with transformers implementation, which implicitly depends on both mamba-ssm and causal-conv1d. Sorry for the inconvenience.
Maybe we can avoid this dependency by hardcoding the inference results. Is it acceptable for the unit tests?

tdoublep · 2025-08-08T20:02:34Z

Can't we just remove these as vLLM dependencies but install them inside the test image? This is what we are doing now for the causal_conv_1d package.

Related topic: the hybrid tests are completely messed up right now due to some CUDA version mismatch issue. This would also be solved if we install mamba_ssm inside the test container but in the "right way" (--no-build-isolation).

tdoublep · 2025-08-08T20:15:40Z

This is what I propose as a different solution to this problem:
#22541

DarkLight1337 · 2025-08-09T06:05:40Z

Closing in favor of #22541

Remove mamba-ssm package

c41ea52

Signed-off-by: Tyler Michael Smith <[email protected]>

tlrmchlsmth requested review from bigPYJ1151, hmellor, DarkLight1337 and ywang96 as code owners August 6, 2025 23:47

tlrmchlsmth added the ready ONLY add when PR is ready to merge/full CI is needed label Aug 6, 2025

mergify bot added documentation Improvements or additions to documentation ci/build labels Aug 6, 2025

gemini-code-assist bot reviewed Aug 6, 2025

View reviewed changes

tests/models/language/generation/test_hybrid.py Outdated Show resolved Hide resolved

fix

ddb65da

Signed-off-by: Tyler Michael Smith <[email protected]>

tlrmchlsmth mentioned this pull request Aug 6, 2025

Update PyTorch to 2.8.0 #20358

Open

10 tasks

DarkLight1337 approved these changes Aug 7, 2025

View reviewed changes

DarkLight1337 mentioned this pull request Aug 7, 2025

[CI/Build] Update causal-conv1d and lm-eval #22141

Closed

4 tasks

DarkLight1337 closed this Aug 9, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Remove mamba-ssm package #22409

Remove mamba-ssm package #22409

Uh oh!

tlrmchlsmth commented Aug 6, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

github-actions bot commented Aug 6, 2025

Uh oh!

DarkLight1337 commented Aug 7, 2025 •

edited

Loading

Uh oh!

tlrmchlsmth commented Aug 7, 2025

Uh oh!

Alnusjaponica commented Aug 8, 2025

Uh oh!

tdoublep commented Aug 8, 2025 •

edited

Loading

Uh oh!

tdoublep commented Aug 8, 2025 •

edited

Loading

Uh oh!

DarkLight1337 commented Aug 9, 2025

Uh oh!

Uh oh!

Uh oh!

Remove mamba-ssm package #22409

Remove mamba-ssm package #22409

Uh oh!

Conversation

tlrmchlsmth commented Aug 6, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

github-actions bot commented Aug 6, 2025

Uh oh!

DarkLight1337 commented Aug 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tlrmchlsmth commented Aug 7, 2025

Uh oh!

Alnusjaponica commented Aug 8, 2025

Uh oh!

tdoublep commented Aug 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tdoublep commented Aug 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

DarkLight1337 commented Aug 9, 2025

Uh oh!

Uh oh!

tlrmchlsmth commented Aug 6, 2025 •

edited by github-actions bot

Loading

DarkLight1337 commented Aug 7, 2025 •

edited

Loading

tdoublep commented Aug 8, 2025 •

edited

Loading

tdoublep commented Aug 8, 2025 •

edited

Loading