Analyzer_agent by Jul434 · Pull Request #67 · ebzych/amphimixis

Jul434 · 2026-03-12T09:40:10Z

Add agent for optional analysis with LLM

ebzych

i think to need migrate on openai and don't use gigachat, generalize for any model as it implemented in perf_analyzer

ebzych · 2026-04-19T10:39:20Z

+def create_model():
+    """Create GigaChat model."""
+    credentials = os.getenv("GIGACHAT_CREDENTIALS")
+    if not credentials:
+        raise ValueError("GigaChat credentials environment variable not set")
+
+    model = GigaChat(
+        credentials=credentials,
+        scope="GIGACHAT_API_PERS",
+        model="GigaChat-2-pro",
+        verify_ssl_certs=False,
+        timeout=120,
+        temperature=0.3,
+    )
+    return model


you should think about the use scenario of tool: a user may want to use his own model
look at perf_analyzer, it is implemented generically there via environment variables, if there is a problem with generalization in langchain, try with openai

ebzych · 2026-04-19T10:42:10Z

+import yaml
+from langchain_core.messages import AIMessage, HumanMessage, SystemMessage, ToolMessage
+from langchain_core.tools import tool
+from langchain_gigachat.chat_models import GigaChat


why do we need to nail gigachat to tool?

ebzych · 2026-04-19T10:48:22Z

+    """
+    Returns file tree in the project. Each line contains relative path to one file.
+    Returns max 300 files to avoid token limits.
+
+    Args:
+        proj_path: Absolute path to the project directory
+    """


it is Google docstring style, we use reST style everywhere, check other modules

don't forget about types
:param str proj_path: bubuububu
:rtype:
:return:

i hope it does not cause problems with tool description to llm

ebzych · 2026-04-19T11:04:31Z

+        proj_path: Absolute path to the project directory
+    """
+    base = Path(proj_path)
+    paths = [str(p.relative_to(base)) for p in base.rglob("*") if p.is_file()]


if llm had been used this tool several times but not found file in root of project because rglob implemented as DFS? you can look at general.BuildSystem.find_relative_path, BFS was implemented there

ebzych · 2026-04-19T11:13:08Z

+    """
+    base = Path(proj_path)
+    paths = [str(p.relative_to(base)) for p in base.rglob("*") if p.is_file()]
+    return "\n".join(sorted(paths)[:MAX_FILES_IN_TREE])


e.g. directory src have ~4700 files in grpc repository, if the first directory to walking will be src then you may not find CMakeLists.txt

ebzych · 2026-04-19T12:01:08Z

+1. Use directory_tree to discover the project structure
+2. Get information about project by presence of files and directories
+3. Use get_file_content to examine build configs (CMakeLists.txt, meson.build, Makefile, etc.) and CI files
+4. Analyze CMakeLists.txt, meson.build, CI configs, etc. for test/benchmark paths
+5. Analyze all build system files to find what systems are used
+6. Analyze third-party directory, CMakeLists.txt find_package, etc. to find dependencies
+7. Put found information in YAML file format
+8. Repeat until you have all information


openai support skills, you can make more clear and comprehensive instructions for concrete build system or anything else with they, because they are not increasing system prompt and loads if condition is satisfied

ebzych · 2026-04-19T12:44:19Z

+    except ValueError as e:
+        raise e
+
+    model_with_tools = model.bind_tools(TOOLS)


maybe TOOLS_MAP be better? also may need to specify model.tool_choice because it None by default?

ebzych · 2026-04-19T12:47:53Z

+        messages.append(result)
+
+        if len(messages) > MAX_MESSAGES:
+            messages = messages[-MAX_MESSAGES:]


from the end for getting part of message with yaml formatting?

ebzych · 2026-04-19T12:52:06Z

+    last_message = messages[-1]
+    content = last_message.content
+    if isinstance(content, list):
+        output_text = str(content)


maybe "\n".join(content)?

ebzych · 2026-04-19T12:54:57Z

+from langchain_core.tools import tool
+from langchain_gigachat.chat_models import GigaChat
+
+LLM_ANALYSIS_FILE = "amphimixis_llm.analyzed"


maybe centralize data in one file for reuse for other modules?

ebzych · 2026-04-22T14:56:03Z

+    if use_llm is True:
+        _logger.info("Analyzing with llm")
+        analyze_with_agent(proj_path)
+        _logger.info("Analyzing with llm done")
+


you don't return after this
i understand that you do a primary analysis with heuristics in case llm fails, but I don’t see the results being reflected in the results somehow

Jul434 force-pushed the analyzer_llm branch 2 times, most recently from d2f0444 to 7b6bfa4 Compare March 12, 2026 10:22

refactor(analyzer): add optional llm usage for analysis

fec3b2e

Jul434 force-pushed the analyzer_llm branch from 7b6bfa4 to fec3b2e Compare April 9, 2026 20:40

Jul434 requested a review from ebzych April 10, 2026 11:27

ebzych requested changes Apr 19, 2026

View reviewed changes

ebzych requested changes Apr 22, 2026

View reviewed changes

ebzych added the fix wanted PR wants to be fixed label May 1, 2026

Conversation

Jul434 commented Mar 12, 2026

Uh oh!

ebzych left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants