Tune your AI Agent to best meet its KPI with a cyclic process of analyze, improve and simulate
-
Updated
Jul 24, 2025 - Python
Tune your AI Agent to best meet its KPI with a cyclic process of analyze, improve and simulate
Exploratory data analysis and interactive model-understanding and evaluation tool for chatbot training data and feedback
This is a repository for a Jupyter based tool to calculate Greedy Matching, Vector Extrema and Average Embedding evaluation metrics for generative AI chatbots
Evaluation results and experimental data for TRACER, demonstrating its effectiveness in discovering chatbot functionalities and detecting errors with coverage analysis and mutation testing.
An open-source framework for robust, LLM-powered testing and tracing of conversational AI applications.
Add a description, image, and links to the chatbot-evaluation topic page so that developers can more easily learn about it.
To associate your repository with the chatbot-evaluation topic, visit your repo's landing page and select "manage topics."