add changes for lora adapter support and /v1/models endpoint by sven-knoblauch · Pull Request #121 · runpod-workers/worker-vllm

sven-knoblauch · 2024-10-09T10:54:42Z

small changes to add the lora modules ENV variable (supporting 1 lora adapter) as solution for #119
with the format: {"name": "xxx", "path": "xxx/xxxxx", "base_model_name": "xxx/xxxx"}

also change of the v1/models endpoint to return all models

pandyamarut · 2024-10-14T22:44:52Z

Thanks for the PR. @sven-knoblauch . Can you please also add how did you test the PR?

sven-knoblauch · 2024-10-16T13:03:36Z

I made a docker container with the given Dockerfile (on dockerhub: svenknob/runpod-vllm-worker) and tested it on runpod serverless. Worked with a custom trained lora adapter (added in the runpod GUI as ENV variable: LORA_MODULES) with an awq mistral model. The lora adapter is also visible in the v1/models endpoint.

nerdylive123 · 2024-11-07T16:24:02Z

Hi, is there a documentation for this env usage? in the markdown perhaps?

nielsrolf · 2024-11-12T12:24:56Z

Is a docker image publicly available that contains this PR?

sven-knoblauch · 2024-11-12T13:02:53Z

Added a pull request for changing the readme #130.
Usage is similar to the usage in the "original" vllm server. The env var name is LORA_MODULES and the format is {"name": "xxx", "path": "xxx/xxxx", "base_model_name": "xxx/xxxx"}, where the name is the name the http requests are made for, the path is the huggingface path of the adapter and the base_model_name is the modelname it is trained on.

For now you can use my docker image svenknob/runpod-vllm-worker, till it has been published.

add changes for lora adapter support and /v1/models endpoint

5cd12ba

sven-knoblauch marked this pull request as draft October 9, 2024 11:42

update code for case of no lora adapter

677a01e

sven-knoblauch marked this pull request as ready for review October 9, 2024 12:53

KYG-APPS mentioned this pull request Oct 31, 2024

Lora-module support needed for using adapters #119

Closed

pandyamarut merged commit 6e8696c into runpod-workers:main Oct 31, 2024

paulwoodward mentioned this pull request Mar 28, 2025

Initialise static LoRA config #172

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add changes for lora adapter support and /v1/models endpoint#121

add changes for lora adapter support and /v1/models endpoint#121
pandyamarut merged 2 commits intorunpod-workers:mainfrom
sven-knoblauch:lora-modules

sven-knoblauch commented Oct 9, 2024 •

edited

Loading

Uh oh!

pandyamarut commented Oct 14, 2024

Uh oh!

sven-knoblauch commented Oct 16, 2024

Uh oh!

nerdylive123 commented Nov 7, 2024

Uh oh!

nielsrolf commented Nov 12, 2024

Uh oh!

sven-knoblauch commented Nov 12, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

sven-knoblauch commented Oct 9, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pandyamarut commented Oct 14, 2024

Uh oh!

sven-knoblauch commented Oct 16, 2024

Uh oh!

nerdylive123 commented Nov 7, 2024

Uh oh!

nielsrolf commented Nov 12, 2024

Uh oh!

sven-knoblauch commented Nov 12, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

sven-knoblauch commented Oct 9, 2024 •

edited

Loading