feat: add prompt_cache_key for cross-call cache routing by san0808 · Pull Request #653 · bolna-ai/bolna

san0808 · 2026-04-17T16:46:44Z

Summary

Adds prompt_cache_key=agent_id to all OpenAI and Azure LLM API call sites so that calls for the same agent route to the same GPU server, enabling cross-call cache hits on the system prompt.

Previously, each new call landed on a random server and started with a cold cache on the first turn. Now the first turn reuses the cached system prompt from previous calls for the same agent.

Caller-controlled opt-in: bolna sets prompt_cache_key only when agent_id is explicitly passed in kwargs. See https://github.com/bolna-ai/dashboard-backend/pull/2041 for the allowlist consumer.

What changed

File	Change
`task_manager.py`	Forward `agent_id` to LLM when caller passes it in kwargs
`openai_llm.py`	`extra_body` with `prompt_cache_key` in streaming and non-streaming chat completions; pop and merge for WebSocket path
`azure_llm.py`	Store `agent_id`; `extra_body` in streaming and non-streaming chat completions
`openai_base.py`	`extra_body` in `_build_responses_create_kwargs` and `_generate_responses`
`graph_agent.py`, `knowledgebase_agent.py`	Pass `agent_id` through to LLM initialization

All changes guarded by if self.agent_id, no effect when absent.

…ache routing

…aque routing key

san0808 added 4 commits April 20, 2026 21:45

feat: add prompt_cache_key to OpenAI/Azure LLM calls for cross-call c…

c597129

…ache routing

fix: ruff format for multi-arg create() calls

ffe817a

fix: use agent_id instead of assistant_id for prompt_cache_key routing

095c6c9

refactor: require caller to pass agent_id for prompt_cache_key opt-in

a54388c

san0808 force-pushed the feat/prompt-cache-key-routing branch from 539cf92 to a54388c Compare April 20, 2026 16:27

github-actions Bot and others added 2 commits April 20, 2026 16:27

bump: version 0.10.8 → 0.10.9

b265b34

refactor: rename kwarg "agent_id" to "prompt_cache_key" to reflect op…

f537628

…aque routing key

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: add prompt_cache_key for cross-call cache routing#653

feat: add prompt_cache_key for cross-call cache routing#653
san0808 wants to merge 6 commits intomasterfrom
feat/prompt-cache-key-routing

san0808 commented Apr 17, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

san0808 commented Apr 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

What changed

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

san0808 commented Apr 17, 2026 •

edited

Loading