Add stable prompt cache boundary by vsumner · Pull Request #577 · spacedriveapp/spacebot

vsumner · 2026-04-19T23:26:30Z

Summary

Adds an explicit system prompt cache boundary after stable channel prompt content.
Splits Anthropic system prompt blocks so only the stable prefix receives cache control.
Strips the internal boundary marker for non-Anthropic providers and documents the prompt cache behavior.
Sorts MCP worker capability names so the cached prefix stays deterministic.

Test Plan

cargo fmt --all
cargo test --lib get_tool_names_returns_deterministic_sorted_names
cargo test --lib prompt_cache
git diff --check
just preflight
just gate-pr

Review Finding Closure

P2 MCP capability ordering: fixed in McpManager::get_tool_names by sorting rendered capability lines before prompt rendering. Verified with cargo test --lib get_tool_names_returns_deterministic_sorted_names.

Note

Implements prompt cache boundary infrastructure for Anthropic API integration. Adds new llm/anthropic module with auth path detection (API key vs OAuth vs proxy bearer), cache retention policies (None/Short/Long with TTL), and parameter building. Updates MCP manager to sort tool names deterministically for stable cache prefixes. System prompts now include explicit boundary markers that are preserved for Anthropic but stripped for other providers, enabling cache hits on stable channel/branch/worker content while keeping dynamic elements below the boundary. Includes comprehensive tests for cache control resolution, tool name ordering, and auth path detection.

_{Written by Tembo for commit 4ec775f}

coderabbitai · 2026-04-19T23:26:36Z

Walkthrough

Adds a system-prompt cache boundary marker, helpers to split/strip it, threads the marker into channel prompt rendering and templates, applies selective cache_control when building Anthropic system blocks, strips the marker for non‑Anthropic providers, updates tests, docs, and sorts MCP tool-name output.

Changes

Cohort / File(s)	Summary
Documentation & Templates `docs/content/docs/(core)/prompts.mdx`, `prompts/en/channel.md.j2`	Introduce "Prompt Cache Boundary" docs and inject `system_prompt_cache_boundary` into the channel prompt template after `worker_capabilities`.
Prompt Engine Public API & Logic `src/prompts/engine.rs`, `src/prompts.rs`	Add `SYSTEM_PROMPT_CACHE_BOUNDARY` constant, `split_system_prompt_cache_boundary()` and `strip_system_prompt_cache_boundary()` helpers; pass boundary token into `render_channel_prompt_with_links` context; add unit tests.
Provider Integration (Anthropic & Others) `src/llm/anthropic/params.rs`, `src/llm/model.rs`	Anthropic: split `request.preamble` at boundary and emit stable/volatile system blocks with conditional `cache_control` via `push_system_text_block`. Model layer: normalize/strip boundary for non‑Anthropic payloads. Tests added for boundary behavior.
MCP Determinism & Tests `src/mcp.rs`	Sort tool-name aggregation in `McpManager::get_tool_names()` and add tests/helpers to assert deterministic, lexicographically ordered results across connections.

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~25 minutes

Possibly related PRs

fix(memory): make cortex synthesis non-blocking #570: Modifies channel prompt rendering and template context—overlaps with the new cache-boundary marker and render changes.
fix: move backfill history from chat history to system prompt #377: Threads additional context into render_channel_prompt_with_links, touching the same prompt-rendering path.

Suggested reviewers

jamiepine

🚥 Pre-merge checks | ✅ 4 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 73.08% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (4 passed)

Check name	Status	Explanation
Title check	✅ Passed	The title 'Add stable prompt cache boundary' accurately and concisely summarizes the main change: introducing an explicit cache boundary for prompt caching in the system.
Description check	✅ Passed	The description is comprehensive and directly related to the changeset, covering the summary, implementation details, test plan, and review findings for the prompt cache boundary feature.
Linked Issues check	✅ Passed	Check skipped because no linked issues were found for this pull request.
Out of Scope Changes check	✅ Passed	Check skipped because no linked issues were found for this pull request.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

📝 Generate docstrings

Create stacked PR
Commit on current branch

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch codex/prompt-cache-boundary

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (2)

docs/content/docs/(core)/prompts.mdx (1)
113-142: ⚠️ Potential issue | 🟡 Minor

Keep memory context below the documented cache boundary.

The example still shows memory_bulletin above {{ system_prompt_cache_boundary }}, which implies it is part of the cached stable prefix. The actual channel template renders fallback memory context below the boundary, so this doc should move that block under the boundary and list it as volatile.
📝 Suggested docs adjustment
-{%- if memory_bulletin %}
-## Memory Context
-
-{{ memory_bulletin }}
-{%- endif %}
-
 [Base channel instructions...]

@@
 {%- if knowledge_synthesis %}
 ## Knowledge Context

 {{ knowledge_synthesis }}
 {%- endif %}
+
+{%- if memory_bulletin and not knowledge_synthesis %}
+## Memory Context
+
+{{ memory_bulletin }}
+{%- endif %}
@@
 - participant context
 - knowledge context
+- memory context fallback
 - conversation context
Also applies to: 170-183
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@docs/content/docs/`(core)/prompts.mdx around lines 113 - 142, The doc
incorrectly places the fallback memory block (memory_bulletin) above the cached
stable prefix (system_prompt_cache_boundary); move the memory_bulletin block so
it appears below {{ system_prompt_cache_boundary }} and classify it as volatile
(i.e., describe it as non-cached/volatile context), and do the same
relocation/description for the other occurrence of the
memory_bulletin/working_memory blocks later in the file (the second instance
that mirrors lines ~170-183).
src/llm/anthropic/params.rs (1)
10-11: 🛠️ Refactor suggestion | 🟠 Major

Move the Claude Code preamble out of Rust.

CLAUDE_CODE_SYSTEM_PREAMBLE is sent as Anthropic system prompt text, so it should live in prompts/ and be loaded through the prompt/text registry instead of being stored as a Rust string constant.

As per coding guidelines, "Don't store prompts as string constants in Rust. System prompts live in prompts/ as markdown files. Load at startup or on demand."
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@src/llm/anthropic/params.rs` around lines 10 - 11, Remove the
CLAUDE_CODE_SYSTEM_PREAMBLE Rust constant and move its text into a markdown file
under prompts/ (e.g., prompts/claude/code_system_preamble.md), then change all
usages of CLAUDE_CODE_SYSTEM_PREAMBLE to load the prompt via the project's
prompt/text registry API (load at startup or on demand via the registry lookup)
so the system prompt is managed by the prompt registry instead of being a Rust
string constant.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Outside diff comments:
In `@docs/content/docs/`(core)/prompts.mdx:
- Around line 113-142: The doc incorrectly places the fallback memory block
(memory_bulletin) above the cached stable prefix (system_prompt_cache_boundary);
move the memory_bulletin block so it appears below {{
system_prompt_cache_boundary }} and classify it as volatile (i.e., describe it
as non-cached/volatile context), and do the same relocation/description for the
other occurrence of the memory_bulletin/working_memory blocks later in the file
(the second instance that mirrors lines ~170-183).

In `@src/llm/anthropic/params.rs`:
- Around line 10-11: Remove the CLAUDE_CODE_SYSTEM_PREAMBLE Rust constant and
move its text into a markdown file under prompts/ (e.g.,
prompts/claude/code_system_preamble.md), then change all usages of
CLAUDE_CODE_SYSTEM_PREAMBLE to load the prompt via the project's prompt/text
registry API (load at startup or on demand via the registry lookup) so the
system prompt is managed by the prompt registry instead of being a Rust string
constant.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: e63b23aa-a8f0-4ab2-a706-d1aff052545d

📥 Commits

Reviewing files that changed from the base of the PR and between 7b13a18 and f3163da.

📒 Files selected for processing (7)

docs/content/docs/(core)/prompts.mdx
prompts/en/channel.md.j2
src/llm/anthropic/params.rs
src/llm/model.rs
src/mcp.rs
src/prompts.rs
src/prompts/engine.rs

coderabbitai

Actionable comments posted: 1

🧹 Nitpick comments (1)

src/prompts/engine.rs (1)
8-8: Marker token value — consider uniqueness guarantees.

The HTML-comment marker is fine for the current markdown templates, but since strip_system_prompt_cache_boundary does a plain replace and split_once splits on the first occurrence, any user-authored content that happens to contain this exact comment (e.g., pasted transcript in backfill_transcript or working_memory) would collide. Unlikely in practice, but worth keeping in mind — a more obscure sentinel (e.g., including a UUID) would eliminate the risk entirely.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@src/prompts/engine.rs` at line 8, The current sentinel string
SYSTEM_PROMPT_CACHE_BOUNDARY is too generic and could collide with user content;
change its value to a far-more-obscure, globally-unique marker (e.g., append a
stable UUID-like suffix) and update all places that reference it (for example
strip_system_prompt_cache_boundary) to use the new constant; ensure any
serialization, tests, or template files that embed the marker are updated to the
new unique value so split_once/replace behavior cannot be triggered by
user-supplied text.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@src/llm/anthropic/params.rs`:
- Around line 134-143: The Anthropic branch currently uses
crate::prompts::engine::split_system_prompt_cache_boundary which returns a
(stable_prefix, volatile_suffix) via split_once but leaves any additional
SYSTEM_PROMPT_CACHE_BOUNDARY markers inside volatile_suffix; update the logic in
the block that calls split_system_prompt_cache_boundary (and then
push_system_text_block) to ensure remaining markers are removed from the
volatile suffix—e.g., after receiving volatile_suffix call the same
strip/replace helper used elsewhere (strip_system_prompt_cache_boundary or
equivalent replace) before passing it to push_system_text_block—so Anthropic and
non-Anthropic paths handle multiple markers consistently (alternative: assert a
single-marker invariant if you prefer).

---

Nitpick comments:
In `@src/prompts/engine.rs`:
- Line 8: The current sentinel string SYSTEM_PROMPT_CACHE_BOUNDARY is too
generic and could collide with user content; change its value to a
far-more-obscure, globally-unique marker (e.g., append a stable UUID-like
suffix) and update all places that reference it (for example
strip_system_prompt_cache_boundary) to use the new constant; ensure any
serialization, tests, or template files that embed the marker are updated to the
new unique value so split_once/replace behavior cannot be triggered by
user-supplied text.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: 1c278440-ba0d-44ab-a666-55dd80da3876

📥 Commits

Reviewing files that changed from the base of the PR and between f3163da and 720df56.

📒 Files selected for processing (7)

docs/content/docs/(core)/prompts.mdx
prompts/en/channel.md.j2
src/llm/anthropic/params.rs
src/llm/model.rs
src/mcp.rs
src/prompts.rs
src/prompts/engine.rs

✅ Files skipped from review due to trivial changes (4)

prompts/en/channel.md.j2
src/prompts.rs
docs/content/docs/(core)/prompts.mdx
src/llm/model.rs

coderabbitai · 2026-04-23T19:09:13Z

    if let Some(preamble) = &request.preamble {
-        let mut preamble_block = serde_json::json!({
-            "type": "text",
-            "text": preamble,
-        });
-        if let Some(cc) = cache_control {
-            preamble_block["cache_control"] = cc.clone();
+        if let Some((stable_prefix, volatile_suffix)) =
+            crate::prompts::engine::split_system_prompt_cache_boundary(preamble)
+        {
+            push_system_text_block(&mut system_blocks, stable_prefix, cache_control);
+            push_system_text_block(&mut system_blocks, volatile_suffix, &None);
+        } else {
+            push_system_text_block(&mut system_blocks, preamble, cache_control);
        }
-        system_blocks.push(preamble_block);
    }


⚠️ Potential issue | 🟡 Minor

Multiple boundary markers: inconsistent handling between providers.

split_system_prompt_cache_boundary uses split_once, so if the preamble contains more than one SYSTEM_PROMPT_CACHE_BOUNDARY marker, any subsequent markers remain verbatim inside the volatile suffix text block sent to Anthropic. Meanwhile the non-Anthropic path in src/llm/model.rs uses strip_system_prompt_cache_boundary (a replace), which removes all occurrences. Today there's only one marker in the channel template, so this is latent rather than exploitable, but if another template or fragment ever adds the marker Anthropic users would see the raw HTML comment leak into the system prompt while other providers would not.

Consider either splitting on first and stripping the marker from the remaining suffix, or asserting/documenting single-marker invariant.

🔧 Proposed tweak

if let Some((stable_prefix, volatile_suffix)) = crate::prompts::engine::split_system_prompt_cache_boundary(preamble) { + let volatile_suffix = + crate::prompts::strip_system_prompt_cache_boundary(volatile_suffix); push_system_text_block(&mut system_blocks, stable_prefix, cache_control); - push_system_text_block(&mut system_blocks, volatile_suffix, &None); + push_system_text_block(&mut system_blocks, &volatile_suffix, &None); } else { push_system_text_block(&mut system_blocks, preamble, cache_control); }

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@src/llm/anthropic/params.rs` around lines 134 - 143, The Anthropic branch currently uses crate::prompts::engine::split_system_prompt_cache_boundary which returns a (stable_prefix, volatile_suffix) via split_once but leaves any additional SYSTEM_PROMPT_CACHE_BOUNDARY markers inside volatile_suffix; update the logic in the block that calls split_system_prompt_cache_boundary (and then push_system_text_block) to ensure remaining markers are removed from the volatile suffix—e.g., after receiving volatile_suffix call the same strip/replace helper used elsewhere (strip_system_prompt_cache_boundary or equivalent replace) before passing it to push_system_text_block—so Anthropic and non-Anthropic paths handle multiple markers consistently (alternative: assert a single-marker invariant if you prefer).

vsumner changed the base branch from codex/cortex-synthesis-reliability to main April 20, 2026 02:18

vsumner force-pushed the codex/prompt-cache-boundary branch from 4ec775f to f3163da Compare April 20, 2026 02:18

vsumner marked this pull request as ready for review April 20, 2026 02:18

coderabbitai Bot reviewed Apr 20, 2026

View reviewed changes

feat: add stable prompt cache boundary

720df56

vsumner force-pushed the codex/prompt-cache-boundary branch from f3163da to 720df56 Compare April 23, 2026 19:06

coderabbitai Bot reviewed Apr 23, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add stable prompt cache boundary#577

Add stable prompt cache boundary#577
vsumner wants to merge 1 commit intomainfrom
codex/prompt-cache-boundary

vsumner commented Apr 19, 2026 •

edited by tembo Bot

Loading

Uh oh!

coderabbitai Bot commented Apr 19, 2026 •

edited

Loading

❌ Failed checks (1 warning)

Uh oh!

coderabbitai Bot left a comment

Uh oh!

coderabbitai Bot left a comment

Uh oh!

coderabbitai Bot Apr 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

vsumner commented Apr 19, 2026 • edited by tembo Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Test Plan

Review Finding Closure

Uh oh!

coderabbitai Bot commented Apr 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Suggested reviewers

❌ Failed checks (1 warning)

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Apr 23, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

vsumner commented Apr 19, 2026 •

edited by tembo Bot

Loading

coderabbitai Bot commented Apr 19, 2026 •

edited

Loading