Improve search tool to extract resolved urls by amrit110 · Pull Request #28 · VectorInstitute/eval-agents

amrit110 · 2026-02-02T18:27:34Z

This pull request introduces significant improvements to the search tool infrastructure and URL handling in the agent evaluation framework, focusing on providing agents with real, fetchable URLs from Google Search. The changes include a new redirect resolution utility, a complete overhaul of the search tool to return actual URLs, dependency updates to support these features, and expanded test coverage. There are also minor improvements and fixes in other modules.

Search tool improvements:

Introduced a new google_search function and revamped create_google_search_tool to use FunctionTool, enabling agents to receive search results with real, resolved URLs instead of redirect links. This supports a research workflow where agents can search, fetch, verify, and answer using actual web sources. (aieng/agent_evals/tools/search.py, aieng/agent_evals/tools/__init__.py, tests/aieng/agent_evals/tools/test_search.py) [1] [2] [3] [4] [5] [6] [7]
Added a new _redirect.py module that provides asynchronous utilities for resolving known redirect URLs (such as those from Vertex AI Search) to their final destinations, with caching and retry logic. This is used by the updated search tool to ensure agents get usable URLs. (aieng/agent_evals/tools/_redirect.py)

Dependency and configuration updates:

Updated pyproject.toml to add new dependencies required for HTTP requests, PDF handling, and HTML conversion, and incremented the package version. Also added a new CLI entry point for the knowledge agent. (pyproject.toml) [1] [2]

Testing enhancements:

Expanded tests for the new search tool, including checks that ensure returned URLs are real and not redirects, and that the response structure is as expected. (tests/aieng/agent_evals/tools/test_search.py)

Other improvements and fixes:

Improved the report generation agent to accept both LocalExperimentItem and DatasetItemClient types, handling both dict and class attribute access patterns. (aieng/agent_evals/report_generation/evaluation.py) [1] [2] [3] [4]
Small typing fixes caught by stricter mypy rules. (implementations/aml_investigation/agent.py) [1] [2] [3]

…val-agents into ak/improve_search_tool

aieng-eval-agents/aieng/agent_evals/tools/search.py

aieng-eval-agents/aieng/agent_evals/report_generation/evaluation.py

aieng-eval-agents/aieng/agent_evals/tools/_redirect.py

aieng-eval-agents/aieng/agent_evals/tools/search.py

implementations/aml_investigation/data/cli.py

lotif · 2026-02-03T20:57:06Z

aieng-eval-agents/pyproject.toml

Why do we have two pyproject.toml files?

One for the project as a whole mostly used for development, and one for the package.

I see. I believe there is a way to do it with a single top level toml file but low priority, we can investigate later.

…val-agents into ak/improve_search_tool

for more information, see https://pre-commit.ci

…val-agents into ak/improve_search_tool

Improve search tool to extract resolved urls

438a383

amrit110 self-assigned this Feb 2, 2026

amrit110 added enhancement New feature or request refactor Refactor or clean up code structure labels Feb 2, 2026

amrit110 added 3 commits February 2, 2026 13:29

Add web fetch tool

c2b09d2

Add html_to_markdown dependency

67bb38d

Add html_to_markdown dependency

da81995

amrit110 changed the title ~~Improve search tool to extract resolved urls~~ Improve search tool to extract resolved urls, add web fetch tool Feb 2, 2026

Refactor out the redirect url code

05471dc

amrit110 changed the title ~~Improve search tool to extract resolved urls, add web fetch tool~~ Improve search tool to extract resolved urls Feb 2, 2026

amrit110 requested review from fcogidi and lotif February 2, 2026 19:09

amrit110 added 8 commits February 2, 2026 15:14

Remove unused synchronous version

5685b65

Fix merge conflict

fc29b79

Merge branch 'main' into ak/improve_search_tool

ce62b75

Merge branch 'main' into ak/improve_search_tool

ca7d319

Merge branch 'ak/improve_search_tool' of github.com:VectorInstitute/e…

3e8938a

…val-agents into ak/improve_search_tool

Fix merge conflicts

5ada2fa

Fix typing issues

1caea9e

Merge branch 'main' into ak/improve_search_tool

c6b1207

fcogidi approved these changes Feb 3, 2026

View reviewed changes

aieng-eval-agents/aieng/agent_evals/tools/search.py Outdated Show resolved Hide resolved

lotif reviewed Feb 3, 2026

View reviewed changes

amrit110 and others added 8 commits February 3, 2026 23:40

Update search fn to async

fa7a951

Merge branch 'ak/improve_search_tool' of github.com:VectorInstitute/e…

871530f

…val-agents into ak/improve_search_tool

Use modern operator to denote union of types

f6792cf

[pre-commit.ci] Add auto fixes from pre-commit.com hooks

5ecb97f

for more information, see https://pre-commit.ci

Use tenacity for retries

1fee32a

Merge branch 'ak/improve_search_tool' of github.com:VectorInstitute/e…

11655f8

…val-agents into ak/improve_search_tool

Fix config in test using mock

5eebe9f

Improve return docstring

9a73472

amrit110 added 2 commits February 4, 2026 00:23

Fix test

6f93ea0

Remove use of cast, lets stop lying to the type checker

2a9f4bb

amrit110 merged commit b4eda32 into main Feb 4, 2026
3 checks passed

amrit110 deleted the ak/improve_search_tool branch February 4, 2026 14:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve search tool to extract resolved urls#28

Improve search tool to extract resolved urls#28
amrit110 merged 23 commits intomainfrom
ak/improve_search_tool

amrit110 commented Feb 2, 2026 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

lotif Feb 3, 2026

Uh oh!

amrit110 Feb 4, 2026

Uh oh!

lotif Feb 4, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

amrit110 commented Feb 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

lotif Feb 3, 2026

Choose a reason for hiding this comment

Uh oh!

amrit110 Feb 4, 2026

Choose a reason for hiding this comment

Uh oh!

lotif Feb 4, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

amrit110 commented Feb 2, 2026 •

edited

Loading