feat: Enhance ResponseGuardrailSpec with additional fields #4231

natedemoss · 2025-11-25T18:28:50Z

What does this PR do?

Adds a production-oriented ResponseGuardrailSpec model to src/llama_stack_api/agents.py enabling structured guardrail configuration during response generation. Supports both string guardrail IDs and inline specs via the union ResponseGuardrail = str | ResponseGuardrailSpec. Fields include: type, description, enabled, severity, action, policy_id, version, categories, thresholds, max_violations, config, tags, metadata. Enforces strict schema (extra='forbid') and provides a normalized() helper for category cleanup.

Test Plan

Default construction

g = ResponseGuardrailSpec(type="llama-guard")
assert g.enabled is True
assert g.severity is None
 ```
### Enum validation (should fail)
```python
from pydantic import ValidationError
try:
 ResponseGuardrailSpec(type="x", severity="critical")
 assert False, "Expected ValidationError"
except ValidationError:
 pass

Extra key rejection

try:
    ResponseGuardrailSpec(type="x", unknown=1)
    assert False
except ValidationError:
    pass

Category normalization

g = ResponseGuardrailSpec(type="x", categories=[" Violence ", "Self-Harm"]).normalized()
assert g.categories == ["violence", "self-harm"]

Union usage in API (integration)

POST with guardrails: ["llama-guard"] → succeeds.
POST with guardrails: [{"type":"llama-guard","severity":"warn","categories":["violence"]}] → succeeds.

OpenAPI/spec regeneration

Run schema generation script.
Verify guardrails now shows oneOf (string | object) and object schema lists all new fields.

Negative thresholds (optional future test if validation added)

Add validator; ensure invalid values raise ValidationError.

All tests pass locally (manual execution). Add automated unit test file in follow-up PR if not already present.

-- Nate DeMoss

Added fields for guardrail configuration including description, enabled, severity, action, policy_id, version, categories, thresholds, max_violations, config, tags, and metadata.

meta-cla · 2025-11-25T18:28:57Z

Hi @natedemoss!

Thank you for your pull request and welcome to our community.

Action Required

In order to merge any pull request (code, docs, etc.), we require contributors to sign our Contributor License Agreement, and we don't seem to have one on file for you.

Process

In order for us to review and merge your suggested changes, please sign at https://code.facebook.com/cla. If you are contributing on behalf of someone else (eg your employer), the individual CLA may not be sufficient and your employer may need to sign the corporate CLA.

Once the CLA is signed, our tooling will perform checks and validations. Afterwards, the pull request will be tagged with CLA signed. The tagging process may take up to 1 hour after signing. Please give it that time before contacting us about it.

If you have received this in error or have any questions, please contact us at [email protected]. Thanks!

meta-cla · 2025-11-25T18:33:57Z

Thank you for signing our Contributor License Agreement. We can now accept your code for this (and any) Meta Open Source project. Thanks!

cdoern

one question to get started

cdoern · 2025-11-26T19:37:01Z

src/llama_stack_api/agents.py

+
+    Fields
+    ------
+    type: Identifier for the guardrail implementation (e.g. 'llama-guard', 'content-filter').


these look great, but can I ask where these are coming from?

Hi, the “type” values come from the backend guardrail config, not from agents.py. They’re IDs the server maps to concrete handlers (e.g., llama-guard, content-filter). Resolution happens during create_openai_response on the backend. If helpful, I can add a doc note in ResponseGuardrailSpec pointing to the registry module or config path.

Also, I can make a new PR with a comment with less specificality.

Enhance ResponseGuardrailSpec with additional fields

836275d

Added fields for guardrail configuration including description, enabled, severity, action, policy_id, version, categories, thresholds, max_violations, config, tags, and metadata.

natedemoss requested review from ashwinb, bbrowning, cdoern, ehhuang, franciscojavierarceo, leseb, mattf and raghotham as code owners November 25, 2025 18:28

natedemoss changed the title ~~Enhance ResponseGuardrailSpec with additional fields~~ feat: Enhance ResponseGuardrailSpec with additional fields Nov 25, 2025

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Nov 25, 2025

cdoern reviewed Nov 26, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: Enhance ResponseGuardrailSpec with additional fields #4231

feat: Enhance ResponseGuardrailSpec with additional fields #4231

Uh oh!

natedemoss commented Nov 25, 2025 •

edited

Loading

Uh oh!

meta-cla bot commented Nov 25, 2025

Uh oh!

meta-cla bot commented Nov 25, 2025

Uh oh!

cdoern left a comment

Uh oh!

cdoern Nov 26, 2025

Uh oh!

natedemoss Nov 26, 2025 •

edited

Loading

Uh oh!

natedemoss Nov 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

feat: Enhance ResponseGuardrailSpec with additional fields #4231

Are you sure you want to change the base?

feat: Enhance ResponseGuardrailSpec with additional fields #4231

Uh oh!

Conversation

natedemoss commented Nov 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Test Plan

Default construction

Extra key rejection

Category normalization

Union usage in API (integration)

OpenAPI/spec regeneration

Negative thresholds (optional future test if validation added)

Uh oh!

meta-cla bot commented Nov 25, 2025

Action Required

Process

Uh oh!

meta-cla bot commented Nov 25, 2025

Uh oh!

cdoern left a comment

Choose a reason for hiding this comment

Uh oh!

cdoern Nov 26, 2025

Choose a reason for hiding this comment

Uh oh!

natedemoss Nov 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

natedemoss Nov 28, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

natedemoss commented Nov 25, 2025 •

edited

Loading

natedemoss Nov 26, 2025 •

edited

Loading