feat(llm): pass llm params directly #1387

Pouyanpi · 2025-09-10T12:32:50Z

Description

langchain-community models support .bind() method

All models inherit from langchain-core Runnable interface
.bind() is universally available across all LangChain packages

langchain-core (contains Runnable interface with .bind())
── langchain-openai (inherits .bind())
── langchain-community (inherits .bind())
── langchain-anthropic (inherits .bind())
── langchain-* (all inherit .bind() ? )

Related Issue(s)

Checklist

I've read the CONTRIBUTING guidelines.
I've updated the documentation if applicable.
I've added tests if applicable.
@mentions of the person or team responsible for reviewing proposed changes.

Implements tool call extraction and passthrough functionality in LLMRails: - Add tool_calls_var context variable for storing LLM tool calls - Refactor llm_call utils to extract and store tool calls from responses - Support tool calls in both GenerationResponse and dict message formats - Add ToolMessage support for langchain message conversion - Comprehensive test coverage for tool calling integration

Add example configuration and documentation for using NVIDIA NeMoGuard NIMs, including content moderation, topic control, and jailbreak detection.

Update verbose logging to safely handle cases where log records may not have 'id' or 'task' attributes. Prevents potential AttributeError and improves robustness of LLM and prompt log output formatting.

Implements tool call extraction and passthrough functionality in LLMRails: - Add tool_calls_var context variable for storing LLM tool calls - Refactor llm_call utils to extract and store tool calls from responses - Support tool calls in both GenerationResponse and dict message formats - Add ToolMessage support for langchain message conversion - Comprehensive test coverage for tool calling integration

… Runnable protocol support - Implement comprehensive async/sync invoke, batch, and streaming support - Add robust input/output transformation for all LangChain formats (ChatPromptValue, BaseMessage, dict, string) - Enhance chaining behavior with intelligent __or__ method handling RunnableBinding and complex chains - Add concurrency controls, error handling, and configurable blocking messages - Implement proper tool calling support with tool call passthrough - Add extensive test suite (14 test files, 2800+ lines) covering all major functionality including batching, streaming, composition, piping, and tool calling - Reorganize and expand test structure for better maintainability apply review suggestions

…Rails Ensure AIMessage responses from RunnableRails contain the same metadata fields (response_metadata, usage_metadata, additional_kwargs, id) as direct LLM calls, enabling consistent LangChain integration behavior.

Enhance streaming in RunnableRails to include generation metadata in streamed chunks. Skips END_OF_STREAM markers and updates chunk formatting to support metadata for AIMessageChunk outputs. This improves compatibility with consumers expecting metadata in streaming responses. fix fix

Introduce tool output/input rails configuration and Colang flows for tool call validation and parameter security checks. Add support for BotToolCall event emission in passthrough mode, enabling tool call guardrails before execution.

…ion and processing - Add UserToolMessages event handling and tool input rails processing - Fix message-to-event conversion to properly handle tool messages in conversation history - Preserve tool call context in passthrough mode by using full conversation history - Support tool_calls and tool message metadata in LangChain format conversion - Include comprehensive test suite for tool input rails functionality test(runnable_rails): fix prompt format in passthrough mode feat: support ToolMessage in message dicts refactor: rename BotToolCall to BotToolCalls

Extend llm_call to accept an optional llm_params dictionary for passing configuration parameters (e.g., temperature, max_tokens) to the language model. This enables more flexible control over LLM behavior during calls. refactor(llm): replace llm_params context manager with argument Update all usages of the llm_params context manager to pass llm_params as an argument to llm_call instead. This simplifies parameter handling and improves code clarity for LLM calls. docs: clarify prompt customization and llm_params usage update LLMChain config usage add unit and e2e tests fix failing tests

Copilot

Pull Request Overview

This PR migrates from using context managers for LLM parameter management to passing parameters directly to the llm_call function. The change leverages LangChain's universal .bind() method to pass parameters like temperature and max_tokens directly to LLM models without temporarily modifying their state.

Key changes:

Added llm_params parameter to llm_call function for direct parameter passing
Replaced all with llm_params(...) context manager usage with direct parameter passing
Updated tests to cover the new parameter passing approach

Reviewed Changes

Copilot reviewed 21 out of 21 changed files in this pull request and generated 2 comments.

Show a summary per file

File	Description
`nemoguardrails/actions/llm/utils.py`	Added `llm_params` parameter to `llm_call` and implemented LLM binding
`tests/test_tool_calling_utils.py`	Added comprehensive tests for new parameter passing functionality
`tests/test_llm_params_e2e.py`	New end-to-end tests for LLM parameter functionality with real providers
`tests/test_llm_params.py`	Added migration tests comparing context manager with direct parameter approach
Various action files	Updated all LLM calls to use direct parameter passing instead of context managers
`docs/user-guides/advanced/prompt-customization.md`	Updated documentation example to show new parameter passing syntax

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

Copilot · 2025-09-15T10:36:59Z

nemoguardrails/library/hallucination/actions.py

        chain = LLMChain(prompt=last_bot_prompt, llm=llm)

        # Generate multiple responses with temperature 1.
-        with llm_params(llm, temperature=1.0, n=num_responses):
-            extra_llm_response = await chain.agenerate(
-                [{"text": last_bot_prompt_string}],
-                run_manager=logging_callback_manager_for_chain,
-            )
+        # Use chain.with_config for runtime parameters
+        configured_chain = chain.with_config(
+            configurable={"temperature": 1.0, "n": num_responses}
+        )
+        extra_llm_response = await configured_chain.agenerate(


The use of chain.with_config() with configurable parameter differs from the pattern used elsewhere in the codebase. This should use the llm_params approach for consistency, or the LLM should be bound directly with .bind() before creating the chain.

Copilot · 2025-09-15T10:37:00Z

nemoguardrails/evaluate/evaluate_factcheck.py

-                )
+            negative_answer_result = create_negatives_chain.invoke(
+                {"evidence": evidence, "answer": answer},
+                config={"temperature": 0.8, "max_tokens": 300},


The use of chain.invoke() with config parameter is inconsistent with the migration pattern used throughout the rest of the codebase. This should follow the same llm_params pattern for consistency.

Suggested change

config={"temperature": 0.8, "max_tokens": 300},

llm_params={"temperature": 0.8, "max_tokens": 300},

tgasser-nv

Looks good, 4k LOC is too large for a single PR.

I'm a little confused about a few things:

Why did we use a context-manager to pass a dict of LLM parameters in the first place? Normally they're used to make sure we close files/DB connections so we don't forget.
Does a context-manager break some Langchain functionality?
Can you add some local integration tests to make sure this works calling tools with production LLMs?

Pouyanpi changed the title ~~Feat/llm params~~ feat(llm): pass llm params directly Sep 10, 2025

Pouyanpi self-assigned this Sep 10, 2025

Pouyanpi added the enhancement New feature or request label Sep 10, 2025

Pouyanpi added this to the v0.17.0 milestone Sep 10, 2025

Pouyanpi marked this pull request as ready for review September 15, 2025 06:46

Pouyanpi added 5 commits September 15, 2025 10:42

docs(examples): add NeMoGuard safety rails config example for Colang 1.0

015d4e1

Add example configuration and documentation for using NVIDIA NeMoGuard NIMs, including content moderation, topic control, and jailbreak detection.

fix(logging): handle missing id and task in verbose logs

c5a44c0

Update verbose logging to safely handle cases where log records may not have 'id' or 'task' attributes. Prevents potential AttributeError and improves robustness of LLM and prompt log output formatting.

Pouyanpi force-pushed the feat/tool-calling-input branch from 0358cd7 to 3240dc9 Compare September 15, 2025 09:35

Pouyanpi force-pushed the feat/llm-params branch from 242de1f to cbfcf20 Compare September 15, 2025 09:39

Pouyanpi force-pushed the feat/tool-calling-input branch 2 times, most recently from 21e33e2 to 2f57ec4 Compare September 15, 2025 09:46

Pouyanpi force-pushed the feat/llm-params branch from cbfcf20 to 621aafb Compare September 15, 2025 09:47

Pouyanpi added 2 commits September 15, 2025 11:53

Pouyanpi force-pushed the feat/tool-calling-input branch from 2f57ec4 to ed234d6 Compare September 15, 2025 09:54

Pouyanpi force-pushed the feat/llm-params branch 2 times, most recently from d25c548 to ef88f7f Compare September 15, 2025 09:55

Pouyanpi force-pushed the feat/tool-calling-input branch from ed234d6 to 4c34032 Compare September 15, 2025 10:05

Pouyanpi force-pushed the feat/llm-params branch from ef88f7f to 5f99209 Compare September 15, 2025 10:06

Pouyanpi requested a review from Copilot September 15, 2025 10:36

Copilot AI reviewed Sep 15, 2025

View reviewed changes

Pouyanpi requested review from trebedea and tgasser-nv September 15, 2025 10:44

Pouyanpi force-pushed the feat/tool-calling-input branch from 4c34032 to c8ff064 Compare September 15, 2025 11:01

tgasser-nv approved these changes Sep 19, 2025

View reviewed changes

Pouyanpi force-pushed the feat/tool-calling-input branch from c8ff064 to 5792bea Compare September 22, 2025 09:11

Base automatically changed from feat/tool-calling-input to develop September 22, 2025 09:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(llm): pass llm params directly #1387

feat(llm): pass llm params directly #1387

Pouyanpi commented Sep 10, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Sep 15, 2025

Uh oh!

Copilot AI Sep 15, 2025

Uh oh!

tgasser-nv left a comment

Uh oh!

Uh oh!

	config={"temperature": 0.8, "max_tokens": 300},
	llm_params={"temperature": 0.8, "max_tokens": 300},

feat(llm): pass llm params directly #1387

Are you sure you want to change the base?

feat(llm): pass llm params directly #1387

Conversation

Pouyanpi commented Sep 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Related Issue(s)

Checklist

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Copilot AI Sep 15, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Sep 15, 2025

Choose a reason for hiding this comment

Uh oh!

tgasser-nv left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Pouyanpi commented Sep 10, 2025 •

edited

Loading