Skip to content

Evals UI: session 404, non-clickable results link, undocumented port change, unclear evalset discovery #5423

@kellyoconor

Description

@kellyoconor

Environment

  • ADK version: 1.29.x (installed via pip)
  • Platform: macOS
  • Running: adk web --session_service_uri="sqlite:///./sessions.db" .

Issues found

1. Eval session 404 — user_id mismatch between eval runner and UI

The eval runner saves sessions using session_input.user_id from the evalset file (defaults to test_user_id if not set). The frontend UI hardcodes userId="user" when fetching session results:

GET /apps/app/users/user/sessions/___eval___session___<uuid> → 404

The session exists in the DB under a different user_id (e.g. eval_user). The UI never finds it.

Workaround: Set "user_id": "user" in all evalset files' session_input blocks.

Suggested fix: The UI should use the user_id returned in the eval result rather than hardcoding "user".

2. "View eval run result" tooltip is not clickable

Even after fixing the 404 (sessions return 200 OK), the "View eval run result" text appears only as a tooltip on the Fail/Pass badge — it's not a hyperlink or button. There's no way to navigate to the session trace from the eval results view.

3. Undocumented port change from 8501 to 8000

adk web now serves on port 8000 instead of 8501. This isn't called out in migration/upgrade notes, causing confusion when switching between versions.

4. Evalset file discovery is fragile and undocumented

It's not obvious that .evalset.json files must live directly inside the agent directory (e.g. app/) for adk web to discover them. Attempting to organize them into a subdirectory (e.g. app/eval_sets/) causes them to silently disappear from the UI. The LocalEvalSetsManager only scans the immediate agent directory. This should be documented, or subdirectory scanning should be supported.

Steps to reproduce

  1. Create evalset files with "user_id": "eval_user" in session_input
  2. Run adk web --session_service_uri="sqlite:///./sessions.db" .
  3. Run an eval from the UI
  4. Observe 404 in server logs for session fetch
  5. Change user_id to "user" — 404 resolves to 200
  6. Observe "View eval run result" tooltip is still not clickable

Metadata

Metadata

Labels

request clarification[Status] The maintainer need clarification or more information from the authorweb[Component] This issue will be transferred to adk-web

Type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions