Add rubrik plugin by seph-barker · Pull Request #23913 · BerriAI/litellm

seph-barker · 2026-03-17T23:29:35Z

Adds the Rubrik LiteLLM plugin (litellm/integrations/rubrik.py) — a callback logger that provides two capabilities:

Tool call blocking: Intercepts LLM responses (both streaming and non-streaming), validates tool calls against a Rubrik blocking service, and filters disallowed tools from the response before it reaches the client.
Request/response logging: Batches and logs LLM interactions to a Rubrik backend with configurable sampling.
The plugin supports both OpenAI and Anthropic response formats, handles streaming correctly (including GPT-5 style finish chunks), and uses a fail-open design — if the blocking service is
unavailable, responses are returned unchanged.

This plugin has been running in production at Rubrik for some time and is well tested against real-world traffic patterns.

Configuration

guardrails:
  - guardrail_name: "rubrik"
    litellm_params:
      guardrail: rubrik
      mode: "post_call"
      default_on: true

Environment variables:

RUBRIK_WEBHOOK_URL (required) — Base URL for the Rubrik service
RUBRIK_API_KEY (optional) — Bearer token for authentication
RUBRIK_BATCH_SIZE (optional) — Batch size for logging (default 512)
RUBRIK_SAMPLING_RATE (optional) — Fraction of requests to log, e.g. 0.5 (default 1.0)
Tests

Includes 42 unit tests across two test files with a comprehensive test framework:

test_rubrik_openai.py (28 tests) — Initialization, URL stripping, sampling rate parsing, non-streaming tool blocking (allow/block/partial), streaming tool blocking with accumulated deltas,
GPT-5 duplicate-args regression, fail-open error recovery, real SSE fixture data
test_rubrik_anthropic.py (14 tests) — Anthropic non-streaming tool blocking (allow/block/partial/text-only), OpenAI↔Anthropic format round-trip with and without usage key, streaming with tool
blocking/allowing/text-only pass-through, format detection, real SSE fixture data
Test fixtures in rubrik_test_sample_data/ include captured OpenAI and Anthropic responses (JSON and SSE streams) for realistic end-to-end testing.

Files

litellm/integrations/rubrik.py — Plugin implementation
tests/test_litellm/integrations/test_rubrik_openai.py — OpenAI format tests
tests/test_litellm/integrations/test_rubrik_anthropic.py — Anthropic format tests
tests/test_litellm/integrations/rubrik_test_sample_data/ — 13 test fixture files

Note that this is the second PR introducing this plugin. The previous one, #22935, was inadvertently based off an old version of the plugin. This PR incorporates all of the actionable feedback that greptile gave on that PR.

Relevant issues

Pre-Submission checklist

Please complete all items before asking a LiteLLM maintainer to review your PR

I have Added testing in the tests/test_litellm/ directory, Adding at least 1 test is a hard requirement - see details
My PR passes all unit tests on make test-unit
My PR's scope is as isolated as possible, it only solves 1 specific problem
I have requested a Greptile review by commenting @greptileai and received a Confidence Score of at least 4/5 before requesting a maintainer review

Delays in PR merge?

If you're seeing a delay in your PR being merged, ping the LiteLLM Team on Slack (#pr-review).

CI (LiteLLM team)

CI status guideline:

50-55 passing tests: main is stable with minor issues.

45-49 passing tests: acceptable but needs attention

<= 40 passing tests: unstable; be careful with your merges and assess the risk.

Branch creation CI run
Link:
CI run for the last commit
Link:
Merge / cherry-pick CI run
Links:

Type

🆕 New Feature
🐛 Bug Fix
🧹 Refactoring
📖 Documentation
🚄 Infrastructure
✅ Test

Changes

vercel · 2026-03-17T23:29:41Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Actions	Updated (UTC)
litellm	Ready	Preview, Comment	Mar 19, 2026 0:17am

Implements a LiteLLM guardrail plugin that integrates with Rubrik's security service. Supports both OpenAI and Anthropic formats in streaming and non-streaming modes with fail-open error handling. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

codspeed-hq · 2026-03-17T23:33:22Z

Merging this PR will not alter performance

✅ 16 untouched benchmarks

_{Comparing predibase:add_rubrik_plugin (1393569) with main (488b93c)}

greptile-apps · 2026-03-17T23:36:42Z

Greptile Summary

This PR adds the Rubrik LiteLLM plugin (litellm/integrations/rubrik.py), a CustomGuardrail + CustomBatchLogger that intercepts LLM responses (OpenAI and Anthropic, streaming and non-streaming) to validate tool calls against a Rubrik blocking service and batch-log interactions to a Rubrik backend. It is the second submission of this plugin and incorporates all actionable feedback from the first review (#22935).

Key design points:

Fail-open at every level: if the blocking service is unreachable or returns an error, the original response is returned unchanged.
Format detection via the proxied endpoint URL path (/chat/completions → OpenAI, /v1/messages → Anthropic).
Streaming handler buffers chunks only from the first tool-call delta onward; pre-tool text chunks are forwarded immediately, so they are never duplicated in the fail-open path.
model_copy(deep=True) used when mutating buffered stream chunks to preserve immutability of the buffer.
GPT-5 style finish chunks (carrying tool_calls + finish_reason together) are handled correctly for all-blocked and partial-blocking paths.
42 unit tests with no real network calls; ModelResponse fixtures used to exercise the model_validate primary path.

Issues found:

_detect_llm_response_format is called without exception handling in async_post_call_streaming_iterator_hook. If proxy_server_request is None rather than absent in request_data, AttributeError propagates out of the generator and aborts the stream, rather than failing open as the non-streaming hook does.
buffered_choice becomes a stale reference after buffered_chunk.model_copy(deep=True) in _replay_filtered_tool_chunks; currently harmless but is a maintenance hazard.

Confidence Score: 4/5

Safe to merge with one minor logic asymmetry to address; all previous review issues resolved.
The implementation is well-structured with comprehensive test coverage (42 tests, all mocked) and correctly addresses every issue raised in the previous review. The one logic concern — unguarded AttributeError in the streaming hook when proxy_server_request is None — is unlikely to occur in production (the proxy always passes a dict), but the asymmetric exception handling between the streaming and non-streaming paths is a real inconsistency. The stale buffered_choice reference is a code-quality issue only. No FastAPI imports in the integration file, no real network calls in tests, and no hardcoded model flags.
litellm/integrations/rubrik.py — lines 360 (unguarded format detection in streaming hook) and 499–530 (stale buffered_choice reference)

Important Files Changed

Filename	Overview
litellm/integrations/rubrik.py	Core plugin implementation (1139 lines). Well-structured with fail-open behavior throughout; all issues from previous review (#22935) addressed. Two new minor issues: `_detect_llm_response_format` is unguarded against `None` `proxy_server_request` in the streaming hook (unlike the non-streaming hook), and `buffered_choice` becomes a stale reference after deep copy in `_replay_filtered_tool_chunks`.
litellm/proxy/guardrails/guardrail_hooks/rubrik/init.py	Guardrail hook initializer; registers `RubrikLogger` in `guardrail_initializer_registry` and `guardrail_class_registry`. Minimal and correct.
litellm/types/guardrails.py	Adds `RUBRIK = "rubrik"` to `SupportedGuardrailIntegrations` enum. Single-line, correct change.
tests/test_litellm/integrations/test_rubrik_openai.py	28 tests covering initialization, URL/sampling config, batch logging, OpenAI non-streaming blocking, streaming blocking (all/partial/GPT-5 style), and fail-open recovery. All network calls are mocked; real `ModelResponse` fixtures used for the `model_validate` path. No real network calls detected.
tests/test_litellm/integrations/test_rubrik_anthropic.py	14 tests for Anthropic non-streaming and streaming tool blocking, format detection, and pass-through for text-only responses. All mocked; SSE fixture files exercised via real file reads.

Sequence Diagram

sequenceDiagram
    participant Client
    participant LiteLLM Proxy
    participant RubrikLogger
    participant LLM API
    participant Rubrik Blocking Service
    participant Rubrik Logging Service

    Client->>LiteLLM Proxy: POST /v1/chat/completions (or /v1/messages)
    LiteLLM Proxy->>LLM API: Forward request
    LLM API-->>LiteLLM Proxy: Response (streaming or non-streaming)

    alt Non-streaming (async_post_call_success_hook)
        LiteLLM Proxy->>RubrikLogger: async_post_call_success_hook(response)
        RubrikLogger->>RubrikLogger: _detect_llm_response_format()
        alt OpenAI format with tool_calls
            RubrikLogger->>Rubrik Blocking Service: POST /v1/after_completion/openai/v1
            Rubrik Blocking Service-->>RubrikLogger: Filtered response (allowed tools only)
            RubrikLogger->>RubrikLogger: ModelResponse.model_validate(modified_dict)
        else Anthropic format with tool_use blocks
            RubrikLogger->>RubrikLogger: _anthropic_response_to_openai_dict()
            RubrikLogger->>Rubrik Blocking Service: POST /v1/after_completion/openai/v1
            Rubrik Blocking Service-->>RubrikLogger: Filtered OpenAI dict
            RubrikLogger->>RubrikLogger: _openai_dict_to_anthropic_response()
        else Text-only or unknown format
            RubrikLogger-->>LiteLLM Proxy: Return original response unchanged
        end
        RubrikLogger-->>LiteLLM Proxy: Modified response
    else Streaming (async_post_call_streaming_iterator_hook)
        LiteLLM Proxy->>RubrikLogger: async_post_call_streaming_iterator_hook(stream)
        loop For each chunk in stream
            alt Pre-tool chunks (no tool_calls yet)
                RubrikLogger->>Client: Yield chunk immediately
            else Tool-call chunk
                RubrikLogger->>RubrikLogger: Buffer + accumulate tool call deltas
            else Finish chunk (finish_reason set)
                RubrikLogger->>RubrikLogger: Append to buffer
                RubrikLogger->>Rubrik Blocking Service: POST /v1/after_completion/openai/v1
                Rubrik Blocking Service-->>RubrikLogger: Allowed tool list + explanation
                alt All tools allowed
                    RubrikLogger->>Client: Replay buffered chunks unchanged
                else Some/all tools blocked
                    RubrikLogger->>Client: Replay filtered chunks + explanation + finish
                end
            end
        end
    end

    Note over RubrikLogger,Rubrik Logging Service: Async batch logging (sampled)
    RubrikLogger->>RubrikLogger: async_log_success_event → log_queue
    RubrikLogger->>Rubrik Logging Service: POST /v1/litellm/batch (when queue full or periodic flush)