TODO: throttle on async validators #755

lla-dane · 2025-07-10T17:52:33Z

Fixed a TODO: Implement throttle on async validators in libp2p/pubsub/pubsub.py::validate_msg().

Used semaphores to limit concurrency while running concurrent async_validators:

semaphore = trio.Semaphore(MAX_CONCURRENT_VALIDATORS)

async def run_async_validator(func: AsyncValidatorFn) -> None:
    async with semaphore:
        result = await func(msg_forwarder, msg)
        results.append(result)

async with trio.open_nursery() as nursery:
    for async_validator in async_topic_validators:
        nursery.start_soon(run_async_validator, async_validator)

lla-dane · 2025-07-10T18:00:13Z

Will add the tests for this and make the concurrency limit configurable. By the mean time, please do check if the solution is correct @seetadev @pacrob

lla-dane · 2025-07-11T03:57:29Z

Added the tests to check that the concurrency limit is respected. Please check i f everything is alright, and the default concurrency limit is correct. @pacrob @seetadev

paschal533 · 2025-07-11T17:08:40Z

Hi @lla-dane, This looks like a solid fix for the async validator throttling TODO, but there's a critical issue where the default parameter limit: trio.Semaphore = trio.Semaphore(MAX_CONCURRENT_VALIDATORS) creates a new semaphore on every call instead of sharing one globally, you should move the semaphore to be an instance variable in the __init__ method like self._validator_semaphore = trio.Semaphore(MAX_CONCURRENT_VALIDATORS) and use that directly in the run_async_validator function. Also consider making the concurrency limit configurable via a constructor parameter, and the test could be simplified to focus more on the actual semaphore behavior rather than mocking the entire validate_msg method. Overall it's a good implementation that properly addresses resource exhaustion concerns, just needs that parameter fix to actually work correctly.

lla-dane · 2025-07-12T02:54:03Z

Hey @paschal533

Hi @lla-dane, This looks like a solid fix for the async validator throttling TODO, but there's a critical issue where the default parameter limit: trio.Semaphore = trio.Semaphore(MAX_CONCURRENT_VALIDATORS) creates a new semaphore on every call instead of sharing one globally, you should move the semaphore to be an instance variable in the init method like self._validator_semaphore = trio.Semaphore(MAX_CONCURRENT_VALIDATORS) and use that directly in the run_async_validator function. Also consider making the concurrency limit configurable via a constructor parameter.

Did this, added a self._validator_semaphore = trio.Semaphore(MAX_CONCURRENT_VALIDATORS) in the PubSub constructor, and is also configurable via the constructor parameter.

the test could be simplified to focus more on the actual semaphore behavior rather than mocking the entire validate_msg method.

For this, added the concurrency checker part in the original validate_msg test. so now the throttle and validate_msg both are getting tested in the same test.

Does this work @paschal533 ?

seetadev · 2025-07-12T14:15:15Z

@lla-dane : Hi Abhinav. Thank you for submitting the PR. Appreciate your great efforts and initiative.

Wish to ask whether you got a chance to review Varun's efforts at #647 and #710 (reference issue: #709 )

pacrob · 2025-07-12T20:23:05Z

@lla-dane - Looking good! I'm concerned about copying so much code directly from Pubsub though. What do you think about extracting run_async_validator out to a class method _run_async_validator and then just mocking that? You'd need to adjust arguments, but way less copied code in test.

paschal533 · 2025-07-13T13:03:45Z

Hi @lla-dane, This looks like a solid fix for the async validator throttling TODO, but there's a critical issue where the default parameter limit: trio.Semaphore = trio.Semaphore(MAX_CONCURRENT_VALIDATORS) creates a new semaphore on every call instead of sharing one globally, you should move the semaphore to be an instance variable in the init method like self._validator_semaphore = trio.Semaphore(MAX_CONCURRENT_VALIDATORS) and use that directly in the run_async_validator function. Also consider making the concurrency limit configurable via a constructor parameter.

Did this, added a self._validator_semaphore = trio.Semaphore(MAX_CONCURRENT_VALIDATORS) in the PubSub constructor, and is also configurable via the constructor parameter.

the test could be simplified to focus more on the actual semaphore behavior rather than mocking the entire validate_msg method.

For this, added the concurrency checker part in the original validate_msg test. so now the throttle and validate_msg both are getting tested in the same test.

Does this work @paschal533 ?

This is a great approach. this definitely fixes the main issue, and integrating the concurrency checking into the existing validate_msg test is actually a cleaner solution than having separate tests. The combined test approach makes sense since you're testing that the throttling works correctly within the actual validation flow rather than in isolation. This should work well now. The semaphore will now properly limit concurrency across all calls to validate_msg on the same Pubsub instance, which is exactly what we want for resource management. Great job and Well done @lla-dane 👏

lla-dane · 2025-07-13T17:22:21Z

libp2p/pubsub/pubsub.py

+    async def _run_async_validator(
+        self,
+        func: AsyncValidatorFn,
+        msg_forwarder: ID,
+        msg: rpc_pb2.Message,
+        results: list[bool],
+    ) -> None:
+        async with self._validator_semaphore:
+            result = await func(msg_forwarder, msg)
+            results.append(result)


@pacrob: separated the run_async_validator method to a separate class method as you suggested. Now there is only this much duplicated code from the pubsub.py in the tests:

async def mock_run_async_validator( self, func: AsyncValidatorFn, msg_forwarder: ID, msg: rpc_pb2.Message, results: list[bool], ) -> None: async with self._validator_semaphore: async with lock: state["concurrency_counter"] += 1 if state["concurrency_counter"] > state["max_observed"]: state["max_observed"] = state["concurrency_counter"] try: result = await func(msg_forwarder, msg) results.append(result) finally: async with lock: state["concurrency_counter"] -= 1

lla-dane · 2025-07-13T17:23:31Z

@lla-dane - Looking good! I'm concerned about copying so much code directly from Pubsub though. What do you think about extracting run_async_validator out to a class method _run_async_validator and then just mocking that? You'd need to adjust arguments, but way less copied code in test.

@pacrob : Did a few changes as you suggested here. Please see if everything is alright.

pacrob · 2025-07-18T01:24:52Z

libp2p/pubsub/pubsub.py

            async def run_async_validator(func: AsyncValidatorFn) -> None:
-                result = await func(msg_forwarder, msg)
-                results.append(result)
+                async with self._validator_semaphore:
+                    result = await func(msg_forwarder, msg)
+                    results.append(result)


Since you extracted the logic out, this code can be deleted, right?

My bad, forgot to remove this. Now removed!!

…case

lla-dane force-pushed the todo/throttle-async-val branch from 79620cf to 0731202 Compare July 12, 2025 02:56

lla-dane commented Jul 13, 2025

View reviewed changes

pacrob reviewed Jul 18, 2025

View reviewed changes

lla-dane added 6 commits July 18, 2025 10:25

fixed todo: throttle on async validators

d52779b

added test: validate message respects concurrency limit

d8e8949

added newsfragment

794dbee

added configurable validator semaphore in the PubSub constructor

95cf2c0

added the concurrency-checker in the original test-validate-msg test …

2ab0936

…case

separate out a _run_async_validator function

3ef5fe8

lla-dane force-pushed the todo/throttle-async-val branch from e3bc7bb to 3ef5fe8 Compare July 18, 2025 04:55

remove redundant run_async_validator

468e341

pacrob approved these changes Jul 18, 2025

View reviewed changes

pacrob merged commit 11560f5 into libp2p:main Jul 18, 2025
28 checks passed

seetadev mentioned this pull request Jul 22, 2025

Implement validation throttler for message validation in Pubsub #710

Closed

3 tasks

lla-dane deleted the todo/throttle-async-val branch September 1, 2025 11:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

TODO: throttle on async validators #755

TODO: throttle on async validators #755

Uh oh!

lla-dane commented Jul 10, 2025 •

edited

Loading

Uh oh!

lla-dane commented Jul 10, 2025 •

edited

Loading

Uh oh!

lla-dane commented Jul 11, 2025

Uh oh!

paschal533 commented Jul 11, 2025

Uh oh!

lla-dane commented Jul 12, 2025

Uh oh!

seetadev commented Jul 12, 2025

Uh oh!

pacrob commented Jul 12, 2025

Uh oh!

paschal533 commented Jul 13, 2025 •

edited

Loading

Uh oh!

lla-dane Jul 13, 2025

Uh oh!

lla-dane commented Jul 13, 2025

Uh oh!

pacrob Jul 18, 2025

Uh oh!

lla-dane Jul 18, 2025

Uh oh!

Uh oh!

Uh oh!

TODO: throttle on async validators #755

TODO: throttle on async validators #755

Uh oh!

Conversation

lla-dane commented Jul 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lla-dane commented Jul 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lla-dane commented Jul 11, 2025

Uh oh!

paschal533 commented Jul 11, 2025

Uh oh!

lla-dane commented Jul 12, 2025

Uh oh!

seetadev commented Jul 12, 2025

Uh oh!

pacrob commented Jul 12, 2025

Uh oh!

paschal533 commented Jul 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lla-dane Jul 13, 2025

Choose a reason for hiding this comment

Uh oh!

lla-dane commented Jul 13, 2025

Uh oh!

pacrob Jul 18, 2025

Choose a reason for hiding this comment

Uh oh!

lla-dane Jul 18, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

lla-dane commented Jul 10, 2025 •

edited

Loading

lla-dane commented Jul 10, 2025 •

edited

Loading

paschal533 commented Jul 13, 2025 •

edited

Loading