fix: enforce tool call limit enforcement for parallel tool calls #2978

tradeqvest · 2025-09-22T08:38:08Z

Fix parallel tool call limit enforcement

Problem

The tool_calls_limit in UsageLimits was not properly enforced for parallel tool execution. When multiple tools were called in parallel, all tools would start executing before the limit was checked, allowing the limit to be exceeded before raising UsageLimitExceeded.

For example, if tool_calls_limit=6 and the model returned 8 parallel tool calls, all 8 tools would start executing before the error was raised.

Solution

This PR modifies the parallel tool execution logic in _agent_graph.py to enforce the limit before starting tool tasks:

Pre-execution limit check: Before creating async tasks for parallel tools, we now check how many tool calls are remaining within the limit
Limited execution: Only start tasks for the allowed number of tool calls
Immediate error: Raise UsageLimitExceeded after the allowed tools complete if more were requested

- Removed the unused `parts` variable in `UserPromptNode`.

DouweM · 2025-09-26T22:03:41Z

@tradeqvest Since we'll raise an error regardless, is it worth executing tool calls up to the limit, instead of just raising immediately?

I was thinking we could do something like this:

pydantic-ai/pydantic_ai_slim/pydantic_ai/_agent_graph.py

Lines 476 to 484 in bfcccba

    
           usage = ctx.state.usage 
        
           if ctx.deps.usage_limits.count_tokens_before_request: 
        
               # Copy to avoid modifying the original usage object with the counted usage 
        
               usage = deepcopy(usage) 
        
               counted_usage = await ctx.deps.model.count_tokens(message_history, model_settings, model_request_parameters) 
        
               usage.incr(counted_usage) 
        
           ctx.deps.usage_limits.check_before_request(usage)

Where we optimistically increment a copied version of the usage, and check the usage limit against that.

tradeqvest · 2025-09-26T22:56:48Z

@tradeqvest Since we'll raise an error regardless, is it worth executing tool calls up to the limit, instead of just raising immediately?

I was thinking we could do something like this:

pydantic-ai/pydantic_ai_slim/pydantic_ai/_agent_graph.py

Lines 476 to 484 in bfcccba

usage = ctx.state.usage

if ctx.deps.usage_limits.count_tokens_before_request:

# Copy to avoid modifying the original usage object with the counted usage

usage = deepcopy(usage)

counted_usage = await ctx.deps.model.count_tokens(message_history, model_settings, model_request_parameters)

usage.incr(counted_usage)

ctx.deps.usage_limits.check_before_request(usage)

Where we optimistically increment a copied version of the usage, and check the usage limit against that.

@DouweM It would definitely be simpler, yet I was thinking that the tool output up until the UsageLimit violation could still be of value, captured and further processed. Let me know what you think.

DouweM · 2025-09-29T23:25:09Z

@tradeqvest I think it'd be misleading if those results never get sent back to the model to use, and the user will think their action failed even though some tools (with side effects) may have in fact been executed. If we had a way to, instead of failing hard, tell the model "this call was not executed because you hit the limit" for the calls over the limit, executing the earlier ones makes sense, but until we have such a mode I'd rather not run the tools at all.

tradeqvest added 2 commits September 22, 2025 10:37

fix: enforce tool call limit enforcement for parallel tool calls

a235ee7

refactor: remove unused variable

5280163

- Removed the unused `parts` variable in `UserPromptNode`.

tradeqvest force-pushed the fix-tool-call-limit branch from c0165b0 to 5280163 Compare September 22, 2025 08:48

tradeqvest marked this pull request as ready for review September 22, 2025 13:10

DouweM self-assigned this Sep 26, 2025

DouweM added the awaiting author revision label Sep 26, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: enforce tool call limit enforcement for parallel tool calls #2978

fix: enforce tool call limit enforcement for parallel tool calls #2978

tradeqvest commented Sep 22, 2025 •

edited

Loading

Uh oh!

DouweM commented Sep 26, 2025

Uh oh!

tradeqvest commented Sep 26, 2025 •

edited

Loading

Uh oh!

DouweM commented Sep 29, 2025 •

edited

Loading

Uh oh!

Uh oh!

fix: enforce tool call limit enforcement for parallel tool calls #2978

Are you sure you want to change the base?

fix: enforce tool call limit enforcement for parallel tool calls #2978

Conversation

tradeqvest commented Sep 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Fix parallel tool call limit enforcement

Problem

Solution

Uh oh!

DouweM commented Sep 26, 2025

Uh oh!

tradeqvest commented Sep 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

DouweM commented Sep 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

tradeqvest commented Sep 22, 2025 •

edited

Loading

tradeqvest commented Sep 26, 2025 •

edited

Loading

DouweM commented Sep 29, 2025 •

edited

Loading