feat: Add enable_model_discovery to enable/disable model discovery on startup #3677

akram · 2025-10-03T21:50:18Z

Add enable_model_discovery configuration flag to vLLM provider to control model listing behavior
Implement enable_model_discovery() method across all providers with default implementations in base classes
Prevent HTTP requests to /v1/models endpoint when enable_model_discovery=false to prevent crash on startup

What does this PR do?

Allows to disable model listing on startup. Useful for models that are declared in vllm but not reachable (not configured or behind authentication wall)
participate in fixing #3151

Test Plan

providers:
  inference:
  - provider_id: ${env.VLLM_URL:+vllm}
    provider_type: remote::vllm
    config:
      url: ${env.VLLM_URL:=}
      max_tokens: ${env.VLLM_MAX_TOKENS:=4096}
      api_token: ${env.VLLM_API_TOKEN:=fake}
      tls_verify: ${env.VLLM_TLS_VERIFY:=true}
      refresh_models: ${env.VLLM_REFRESH_MODELS:=false}
      enable_model_discovery: ${env.ENABLE_MODEL_DISCOVERY:=false}

models:
- metadata:
    display_name: vllm
  model_id: vllm
  provider_id: vllm
  model_type: llm

run

LLAMA_STACK_LOGGING="all=debug" VLLM_URL=https://my-vllm-server:8443/v1  MILVUS_DB_PATH=./milvus.db INFERENCE_MODEL=vllm uv run --with llama-stack llama stack build --distro starter --image-type venv --run

you should see:

DEBUG    2025-10-06 17:06:43,560 llama_stack.providers.remote.inference.vllm.vllm:286 inference::vllm: VLLM list_models called, enable_model_discovery=False
DEBUG    2025-10-06 17:06:43,561 llama_stack.providers.remote.inference.vllm.vllm:288 inference::vllm: VLLM list_models returning None due to enable_model_discovery=False```

• Add allow_listing_models configuration flag to VLLM provider to control model listing behavior • Implement allow_listing_models() method across all providers with default implementations in base classes • Prevent HTTP requests to /v1/models endpoint when allow_listing_models=false for improved security and performance • Fix unit tests to include allow_listing_models method in test classes and mock objects

akram · 2025-10-04T21:52:39Z

@ashwinb can you PTAL ?

docs/docs/providers/inference/remote_vllm.mdx

• Add comprehensive error handling in check_model_availability method • Provide helpful error messages with actionable solutions for 404 errors • Warn when API token is set but model discovery is disabled

llama_stack/core/routing_tables/models.py

llama_stack/providers/remote/inference/vllm/vllm.py

mattf

the logic around model management is already complex with multiple knobs.

can you use allowed_models instead?

akram requested review from ashwinb, yanxi0830, hardikjshah, raghotham, ehhuang, terrytangyuan, leseb, bbrowning, reluctantfuturist, mattf and slekkala1 as code owners October 3, 2025 21:50

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Oct 3, 2025

akram force-pushed the add_allow_listing_models branch 3 times, most recently from 3ad3f20 to d301389 Compare October 3, 2025 22:12

akram force-pushed the add_allow_listing_models branch from d301389 to e9214f9 Compare October 3, 2025 22:18

leseb mentioned this pull request Oct 6, 2025

feat: implement graceful model discovery for vLLM provider #3673

Closed

leseb reviewed Oct 6, 2025

View reviewed changes

docs/docs/providers/inference/remote_vllm.mdx Outdated Show resolved Hide resolved

akram changed the title ~~feat: Add allow_listing_models~~ feat: Add enable_model_discovery to enable/disable model discovery on startup Oct 6, 2025

Improve VLLM model discovery error handling

e28bc93

• Add comprehensive error handling in check_model_availability method • Provide helpful error messages with actionable solutions for 404 errors • Warn when API token is set but model discovery is disabled

akram requested a review from franciscojavierarceo as a code owner October 6, 2025 10:57

leseb requested changes Oct 6, 2025

View reviewed changes

Review changes

055179a

mattf requested changes Oct 6, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: Add enable_model_discovery to enable/disable model discovery on startup #3677

feat: Add enable_model_discovery to enable/disable model discovery on startup #3677

akram commented Oct 3, 2025 •

edited

Loading

Uh oh!

akram commented Oct 4, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mattf left a comment

Uh oh!

Uh oh!

feat: Add enable_model_discovery to enable/disable model discovery on startup #3677

Are you sure you want to change the base?

feat: Add enable_model_discovery to enable/disable model discovery on startup #3677

Conversation

akram commented Oct 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Test Plan

Uh oh!

akram commented Oct 4, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mattf left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

akram commented Oct 3, 2025 •

edited

Loading