feat: implement repository analysis agent with recommendation API #27

naaa760 · 2025-11-17T08:23:29Z

fix: #22

Add a new RepositoryAnalysisAgent that analyzes GitHub repositories to automatically generate personalized Watchflow rule recommendations. The agent uses LangGraph for multi-step analysis including repository structure examination, contributing guidelines parsing, and pattern-based rule generation with confidence scoring.

Key Features:

Repository Analysis: Examines workflows, contributors, branch protection, and code patterns
Contributing Guidelines Parsing: Uses LLM to extract requirements from CONTRIBUTING.md files
Rule Recommendations: Generates valid Watchflow YAML rules with confidence scores and reasoning
API Integration: Adds POST /api/v1/rules/recommend endpoint with caching and rate limiting
Agent Factory: Integrated into the existing agent system via get_agent("repository_analysis")

Files Added:

src/agents/repository_analysis_agent/ - Complete agent implementation
src/api/recommendations.py - REST API endpoints
Updated src/agents/factory.py and src/main.py for integration

Acceptance Criteria:

Analyzes repositories and generates valid rule recommendations
Recommendations include confidence scores and clear reasoning
API endpoint accepts repository identifiers and returns structured data
Integrates seamlessly with existing Watchflow architecture
All recommendations are valid Watchflow rule YAML
Comprehensive error handling and validation

dkargatzis · 2025-11-17T21:46:18Z

Hello @naaa760 - thanks for the PR, it adds really cool functionality for better / smother UX. Could we please address the pre-commit and test issues? Also please give an example of the expected outcome - what are the rules that this endpoint suggests for a high volume repo.

…ine agent’s evaluation prompt.

…y cases) stable.

naaa760 · 2025-11-29T02:20:10Z

@dkargatzis
please check!

dkargatzis · 2025-11-29T14:27:09Z

@naaa760 I guess we still need to instruct the agent / pass that context to the agent?

…d_field_in_diff) and wired them into the registry

…tors introduced in code

…f using snippets.

codecov-commenter · 2025-12-10T13:04:49Z

⚠️ Please install the to ensure uploads and comments are reliably processed by Codecov.

Codecov Report

❌ Patch coverage is 50.09901% with 252 lines in your changes missing coverage. Please review.

❌ Your patch status has failed because the patch coverage (50.0%) is below the target coverage (80.0%). You can increase the patch coverage or adjust the target coverage.
❌ Your project status has failed because the head coverage (32.0%) is below the target coverage (80.0%). You can increase the head coverage or adjust the target coverage.
❗ Your organization needs to install the Codecov GitHub app to enable full functionality.

@@           Coverage Diff           @@
##            main     #27     +/-   ##
=======================================
+ Coverage   24.7%   32.0%   +7.2%     
=======================================
  Files         79      85      +6     
  Lines       4497    5000    +503     
=======================================
+ Hits        1115    1604    +489     
- Misses      3382    3396     +14

Continue to review full report in Codecov by Sentry.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update a518bcb...2f1c0b9. Read the comment docs.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

dkargatzis · 2025-12-10T13:14:43Z

/gemini review

gemini-code-assist

Code Review

This pull request introduces a new Repository Analysis Agent for recommending Watchflow rules, along with a corresponding API endpoint and new diff-aware validators. The implementation is extensive and adds significant new capabilities.

My review has identified a few key areas for improvement:

Inconsistent Documentation: The documentation for the new diff-aware validators is duplicated and contains conflicting information, which could confuse users.
Incomplete Agent Logic: The repository analysis agent's implementation appears incomplete. The analysis of CONTRIBUTING.md is not fully implemented, and the core rule recommendations are hardcoded instead of being dynamically generated based on repository analysis, which doesn't align with the feature's description.
Inconsistent Validator Implementation: There's a critical issue in src/rules/validators.py where a new validator uses an inconsistent method for file path matching, which could lead to incorrect behavior.
Testing and Utilities: I've also included some suggestions to improve test case assertions and make the new caching utility safer for future use.

Overall, this is a great feature addition, but the agent's core logic needs to be completed to match its intended design, and the inconsistencies in documentation and implementation should be resolved.

gemini-code-assist · 2025-12-10T13:16:59Z

src/rules/validators.py

+    @staticmethod
+    def _matches_any(path: str, patterns: list[str]) -> bool:
+        if not path:
+            return False
+        posix_path = PurePosixPath(path)
+        return any(posix_path.match(pattern) for pattern in patterns)


This _matches_any static method inside RequiredFieldInDiffCondition is inconsistent with the more powerful, custom _matches_any helper function defined at the module level (line 69). This implementation uses PurePosixPath.match, which does not handle ** for recursive directory matching in the same way as the custom glob logic. This will lead to inconsistent and potentially incorrect file matching behavior compared to other new validators like DiffPatternCondition. This class should use the shared _matches_any helper from the module to ensure consistent glob pattern matching across all validators. Please remove this static method so the module-level one is used.

gemini-code-assist · 2025-12-10T13:17:00Z

src/agents/repository_analysis_agent/nodes.py

+                    prompt = CONTRIBUTING_GUIDELINES_ANALYSIS_PROMPT.format(content=content)
+                    await llm.ainvoke(prompt)
+
+                    # TODO: Parse JSON response and create ContributingGuidelinesAnalysis


This TODO indicates that the LLM response from analyzing CONTRIBUTING.md is not being parsed and used. The function currently falls back to creating a ContributingGuidelinesAnalysis object with just the raw content. This is a significant gap in the implementation, as the structured analysis of contribution guidelines is a key part of the agent's described functionality.

gemini-code-assist · 2025-12-10T13:17:00Z

src/agents/repository_analysis_agent/nodes.py

+async def generate_rule_recommendations(state: RepositoryAnalysisState) -> dict[str, Any]:
+    """
+    Generate Watchflow rule recommendations based on repository analysis.
+    """
+    try:
+        logger.info(f" Generating rule recommendations for {state.repository_full_name}")
+
+        recommendations = []
+
+        features = state.repository_features
+        contributing = state.contributing_analysis
+
+        # Diff-aware: enforce filter handling in core RAG/query code
+        recommendations.append(
+            RuleRecommendation(
+                yaml_content="""description: "Block merges when PRs change filter validation logic without failing on invalid inputs"
+enabled: true
+severity: "high"
+event_types: ["pull_request"]
+parameters:
+  file_patterns:
+    - "packages/core/src/**/vector-query.ts"
+    - "packages/core/src/**/graph-rag.ts"
+    - "packages/core/src/**/filters/*.ts"
+  require_patterns:
+    - "throw\\\\s+new\\\\s+Error"
+    - "raise\\\\s+ValueError"
+  forbidden_patterns:
+    - "return\\\\s+.*filter\\\\s*$"
+  how_to_fix: "Ensure invalid filters raise descriptive errors instead of silently returning unfiltered results."
+""",
+                confidence=0.85,
+                reasoning="Filter handling regressions were flagged in historical fixes; enforce throws on invalid input.",
+                source_patterns=["pr_history"],
+                category="quality",
+                estimated_impact="high",
+            )
+        )
+
+        # Diff-aware: enforce test updates when core code changes
+        recommendations.append(
+            RuleRecommendation(
+                yaml_content="""description: "Require regression tests when modifying tool schema validation or client tool execution"
+enabled: true
+severity: "medium"
+event_types: ["pull_request"]
+parameters:
+  source_patterns:
+    - "packages/core/src/**/tool*.ts"
+    - "packages/core/src/agent/**"
+    - "packages/client/**"
+  test_patterns:
+    - "packages/core/tests/**"
+    - "tests/**"
+  min_test_files: 1
+  rationale: "Tool invocation changes have previously caused regressions in clientTools streaming."
+""",
+                confidence=0.8,
+                reasoning="Core tool changes often broke client tools; require at least one related test update.",
+                source_patterns=["pr_history"],
+                category="quality",
+                estimated_impact="medium",
+            )
+        )
+
+        # Diff-aware: ensure agent descriptions exist
+        recommendations.append(
+            RuleRecommendation(
+                yaml_content="""description: "Ensure every agent exposes a user-facing description for UI profiles"
+enabled: true
+severity: "low"
+event_types: ["pull_request"]
+parameters:
+  file_patterns:
+    - "packages/core/src/agent/**"
+  required_text:
+    - "description"
+  message: "Add or update the agent description so downstream UIs can render capabilities."
+""",
+                confidence=0.75,
+                reasoning="Agent profile UIs require descriptions; ensure new/updated agents include them.",
+                source_patterns=["pr_history"],
+                category="process",
+                estimated_impact="low",
+            )
+        )
+
+        # Diff-aware: preserve URL handling for supported providers
+        recommendations.append(
+            RuleRecommendation(
+                yaml_content="""description: "Block merges when URL or asset handling changes bypass provider capability checks"
+enabled: true
+severity: "high"
+event_types: ["pull_request"]
+parameters:
+  file_patterns:
+    - "packages/core/src/agent/message-list/**"
+    - "packages/core/src/llm/**"
+  require_patterns:
+    - "isUrlSupportedByModel"
+  forbidden_patterns:
+    - "downloadAssetsFromMessages\\(messages\\)"
+  how_to_fix: "Preserve remote URLs for providers that support them natively; only download assets for unsupported providers."
+""",
+                confidence=0.8,
+                reasoning="Past URL handling bugs; ensure capability checks remain intact.",
+                source_patterns=["pr_history"],
+                category="quality",
+                estimated_impact="high",
+            )
+        )
+
+        # Legacy structural signals retained for completeness
+        if features.has_workflows:
+            recommendations.append(
+                RuleRecommendation(
+                    yaml_content="""description: "Require CI checks to pass"
+enabled: true
+severity: "high"
+event_types:
+  - pull_request
+conditions:
+  - type: "ci_checks_passed"
+    parameters:
+      required_checks: []
+actions:
+  - type: "block_merge"
+    parameters:
+      message: "All CI checks must pass before merging"
+""",
+                    confidence=0.9,
+                    reasoning="Repository has CI workflows configured, so requiring checks to pass is a standard practice",
+                    source_patterns=["has_workflows"],
+                    category="quality",
+                    estimated_impact="high",
+                )
+            )
+
+        if features.has_codeowners:
+            recommendations.append(
+                RuleRecommendation(
+                    yaml_content="""description: "Require CODEOWNERS approval for changes"
+enabled: true
+severity: "medium"
+event_types:
+  - pull_request
+conditions:
+  - type: "codeowners_approved"
+    parameters: {}
+actions:
+  - type: "require_approval"
+    parameters:
+      message: "CODEOWNERS must approve changes to owned files"
+""",
+                    confidence=0.8,
+                    reasoning="CODEOWNERS file exists, indicating ownership requirements for code changes",
+                    source_patterns=["has_codeowners"],
+                    category="process",
+                    estimated_impact="medium",
+                )
+            )
+
+        if contributing.requires_tests:
+            recommendations.append(
+                RuleRecommendation(
+                    yaml_content="""description: "Require test coverage for code changes"
+enabled: true
+severity: "medium"
+event_types:
+  - pull_request
+conditions:
+  - type: "test_coverage_threshold"
+    parameters:
+      minimum_coverage: 80
+actions:
+  - type: "block_merge"
+    parameters:
+      message: "Test coverage must be at least 80%"
+""",
+                    confidence=0.7,
+                    reasoning="Contributing guidelines mention testing requirements",
+                    source_patterns=["requires_tests"],
+                    category="quality",
+                    estimated_impact="medium",
+                )
+            )
+
+        if features.contributor_count > 10:
+            recommendations.append(
+                RuleRecommendation(
+                    yaml_content="""description: "Require at least one approval for pull requests"
+enabled: true
+severity: "medium"
+event_types:
+  - pull_request
+conditions:
+  - type: "minimum_approvals"
+    parameters:
+      count: 1
+actions:
+  - type: "block_merge"
+    parameters:
+      message: "Pull requests require at least one approval"
+""",
+                    confidence=0.6,
+                    reasoning="Repository has multiple contributors, indicating collaborative development",
+                    source_patterns=["contributor_count"],
+                    category="process",
+                    estimated_impact="medium",
+                )
+            )


The rule recommendations, especially the core 'diff-aware' ones, appear to be hardcoded rather than dynamically generated based on the repository analysis. The analysis data from repository_features and contributing_analysis is only used for a few legacy rules, but the main new recommendations are added unconditionally. This doesn't align with the PR description which suggests 'pattern-based rule generation'. The prompts defined in prompts.py for dynamic analysis and rule generation also seem to be unused here. This implementation should be updated to dynamically generate rules based on the actual analysis results from previous steps to fulfill the feature's goal.

gemini-code-assist · 2025-12-10T13:17:00Z

docs/getting-started/configuration.md

+## Diff-Aware Validators
+
+Watchflow supports advanced validators that inspect actual PR diffs to enforce code-level patterns:
+
+### diff_pattern
+Enforce regex requirements or prohibitions within file patches.
+
+```yaml
+parameters:
+  file_patterns: ["packages/core/src/**/vector-query.ts"]
+  require_patterns: ["throw\\s+new\\s+Error"]
+  forbid_patterns: ["silent.*skip"]
+```
+
+### related_tests
+Require test file updates when core code changes.
+
+```yaml
+parameters:
+  file_patterns: ["packages/core/src/**"]
+  require_test_updates: true
+  min_test_files: 1
+```
+
+### required_field_in_diff
+Ensure new additions include required fields (e.g., agent descriptions).
+
+```yaml
+parameters:
+  file_patterns: ["packages/core/src/agent/**"]
+  required_text: "description"
+```


There appears to be duplicated and inconsistent documentation for Diff-Aware Validators. This section is added here, but a similar, slightly different version is also added earlier in the file (starting around line 114). The two versions use conflicting parameter names and formats, which will be confusing for users. For example:

The other section uses forbidden_patterns, while this one uses forbid_patterns.

The related_tests example here uses require_test_updates: true, while the other uses source_patterns, test_patterns, and min_test_files.

The required_text parameter is a string here, but a list in the other example.

Please consolidate these into a single, consistent section to avoid confusion.

gemini-code-assist · 2025-12-10T13:17:00Z

src/agents/repository_analysis_agent/README.md

@@ -0,0 +1,86 @@
+# Repository Analysis Agent
+
+the Repository Analysis Agent analyzes GitHub repositories to generate personalized Watchflow rule recommendations based on repository structure, contributing guidelines, and development patterns.


Minor grammatical fix: The sentence should start with a capital letter for better readability.

Suggested change

the Repository Analysis Agent analyzes GitHub repositories to generate personalized Watchflow rule recommendations based on repository structure, contributing guidelines, and development patterns.

The Repository Analysis Agent analyzes GitHub repositories to generate personalized Watchflow rule recommendations based on repository structure, contributing guidelines, and development patterns.

gemini-code-assist · 2025-12-10T13:17:00Z

src/agents/repository_analysis_agent/README.md

+
+## Overview
+
+this agent performs comprehensive analysis of repositories and provides actionable governance rule recommendations with confidence scores and reasoning.


Minor grammatical fix: The sentence should start with a capital letter for better readability.

Suggested change

this agent performs comprehensive analysis of repositories and provides actionable governance rule recommendations with confidence scores and reasoning.

This agent performs comprehensive analysis of repositories and provides actionable governance rule recommendations with confidence scores and reasoning.

gemini-code-assist · 2025-12-10T13:17:00Z

src/agents/repository_analysis_agent/agent.py

+            if isinstance(result, dict):
+                state = RepositoryAnalysisState(**result)
+            else:
+                state = result


The logic to handle the result from self.graph.ainvoke can be simplified. The ainvoke method on a compiled graph returns the final state object directly, not a dictionary that needs to be converted. You can remove the isinstance check and directly assign the result to the state variable.

state = result

gemini-code-assist · 2025-12-10T13:17:00Z

src/agents/repository_analysis_agent/test_agent.py

+            result = await agent.execute("test-owner/test-repo")
+
+            assert isinstance(result, object)


The assertion assert isinstance(result, object) in this error handling test is too weak, as almost everything in Python is an object. The test should be more specific to be meaningful. It should verify that the agent's execution correctly reports a failure.

Suggested change

result = await agent.execute("test-owner/test-repo")

assert isinstance(result, object)

result = await agent.execute("test-owner/test-repo")

assert not result.success

assert "analysis failed" in result.message

gemini-code-assist · 2025-12-10T13:17:00Z

src/api/recommendations.py

+@router.get("/v1/rules/recommend/{repository_full_name}")
+async def get_cached_recommendations(repository_full_name: str) -> JSONResponse:
+    """
+    Get cached recommendations for a repository.
+
+    Args:
+        repository_full_name: Full repository name (owner/repo)
+
+    Returns:
+        Cached analysis results or 404 if not found
+    """
+    cache_key = f"repo_analysis:{repository_full_name}"
+    cached_result = await get_cache(cache_key)
+
+    if not cached_result:
+        raise HTTPException(status_code=404, detail="No cached analysis found for repository")
+
+    return JSONResponse(content=cached_result)


The get_cached_recommendations endpoint is a great addition for retrieving cached results. However, it's defined with a path that could be more consistent with REST principles. Since it's retrieving a specific resource (the analysis for a repository), using query parameters might be more idiomatic.

Suggested change

@router.get("/v1/rules/recommend/{repository_full_name}")

async def get_cached_recommendations(repository_full_name: str) -> JSONResponse:

"""

Get cached recommendations for a repository.

Args:

repository_full_name: Full repository name (owner/repo)

Returns:

Cached analysis results or 404 if not found

"""

cache_key = f"repo_analysis:{repository_full_name}"

cached_result = await get_cache(cache_key)

if not cached_result:

raise HTTPException(status_code=404, detail="No cached analysis found for repository")

return JSONResponse(content=cached_result)

@router.get("/v1/rules/recommend", response_model=RepositoryAnalysisResponse)

async def get_cached_recommendations(repository_full_name: str) -> RepositoryAnalysisResponse:

"""

Get cached recommendations for a repository.

Args:

repository_full_name: Full repository name (owner/repo)

Returns:

Cached analysis results or 404 if not found

"""

cache_key = f"repo_analysis:{repository_full_name}"

cached_result = await get_cache(cache_key)

if not cached_result:

raise HTTPException(status_code=404, detail="No cached analysis found for repository")

return RepositoryAnalysisResponse(**cached_result)

gemini-code-assist · 2025-12-10T13:17:00Z

src/core/utils/caching.py

+    if ttl and ttl != _GLOBAL_CACHE.ttl:
+        _GLOBAL_CACHE.ttl = ttl


Modifying the ttl of the global cache instance _GLOBAL_CACHE within the set_cache helper can lead to unpredictable behavior. If another part of the application calls this function with a different ttl, it will alter the cache's expiration policy for all subsequent calls. It would be safer to either not allow overriding the TTL here or to use a different caching mechanism if variable TTLs are required.

naaa760 added 8 commits November 17, 2025 13:49

feat: add RepositoryAnalysisAgent base structure

8ff6641

feat: implement RepositoryAnalysisAgent with LangGraph workflow

519ab06

feat: define Pydantic models for repository analysis

b2e3205

feat: implement LangGraph nodes for repository analysis workflow

8c6a580

feat: add LLM prompts for contributing guidelines analysis

fc607b3

test: add comprehensive tests for RepositoryAnalysisAgent

4a0ad3f

docs: add README documentation for RepositoryAnalysisAgent

a65019a

feat: add API endpoints for rule recommendations

3ae89e1

naaa760 requested a review from dkargatzis as a code owner November 17, 2025 08:23

naaa760 added 2 commits November 17, 2025 14:02

update

719cb76

update

212bcfd

dkargatzis assigned naaa760 Nov 17, 2025

naaa760 added 4 commits November 28, 2025 03:10

update

1db7a60

add normalized diff metadata and LLM-friendly summaries for PR files.

23b3710

surface the new diff context (details + top changed files) in the eng…

3dd71e1

…ine agent’s evaluation prompt.

regression test to keep the diff-summarizer behavior (truncation/empt…

544b3eb

…y cases) stable.

naaa760 added 11 commits November 29, 2025 22:45

added the diff-aware validators (diff_pattern, related_tests, require…

bdb4227

…d_field_in_diff) and wired them into the registry

added unit tests covering the new validators’ behavior

1c6cf40

refine the glob handling and keep the diff-aware validators passing

23d60e1

switch Mastra rule bundle to the new diff-aware parameters

4fefa76

document that the recommended rules now rely on the diff-aware valida…

63adcf9

…tors introduced in code

add a 'diff-aware validators' section describing the new parameter

9d40d7a

add Diff-Aware Validators section

ecbcbd4

update to reference new validators and inline sample rules instead o…

b8e8154

…f using snippets.

remove problematic analytics section

37b9e60

docs: align mastra rules with diff-aware validators

1b58651

chore: remove old mastra sample path

cf410a0

naaa760 added 3 commits December 9, 2025 12:27

feat: add PR history sampling and diff-aware rule recommendations

4bb73b3

fix: restore caching and structured logging helpers

b773cee

chore: fix imports, lint, and test path setup

2f1c0b9

gemini-code-assist bot reviewed Dec 10, 2025

View reviewed changes

		@@ -0,0 +1,86 @@
		# Repository Analysis Agent

		the Repository Analysis Agent analyzes GitHub repositories to generate personalized Watchflow rule recommendations based on repository structure, contributing guidelines, and development patterns.


		## Overview

		this agent performs comprehensive analysis of repositories and provides actionable governance rule recommendations with confidence scores and reasoning.

		result = await agent.execute("test-owner/test-repo")

		assert isinstance(result, object)

feat: implement repository analysis agent with recommendation API #27

Are you sure you want to change the base?

feat: implement repository analysis agent with recommendation API #27

Conversation

naaa760 commented Nov 17, 2025

Key Features:

Files Added:

Acceptance Criteria:

Uh oh!

dkargatzis commented Nov 17, 2025

Uh oh!

naaa760 commented Nov 29, 2025

Uh oh!

dkargatzis commented Nov 29, 2025

Uh oh!

codecov-commenter commented Dec 10, 2025

Codecov Report

Uh oh!

dkargatzis commented Dec 10, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Dec 10, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Dec 10, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Dec 10, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Dec 10, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Dec 10, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Dec 10, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Dec 10, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Dec 10, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Dec 10, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Dec 10, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants