Contributing to AgentEval

Thank you for your interest in contributing to AgentEval! 🎉

AgentEval is the .NET evaluation toolkit for AI agents—evaluation, testing, and benchmarking for agentic AI, built first for Microsoft Agent Framework (MAF). Community contributions are essential to its success.

Ways to Contribute

🐛 Report Bugs - Found an issue? Open a bug report
💡 Request Features - Have an idea? Submit a feature request
📖 Improve Documentation - Fix typos, clarify explanations, add examples
🔧 Submit Code - Bug fixes, new features, refactoring
🧪 Write Tests - Increase test coverage, add edge cases
💬 Answer Questions - Help others in GitHub Discussions

Getting Started

Prerequisites

.NET 8.0 SDK or later
Git
(Optional) Azure OpenAI access for running samples with real AI

Fork and Clone

# Fork the repository on GitHub, then:
git clone https://github.com/YOUR_USERNAME/AgentEval.git
cd AgentEval

Build the Project

dotnet build

Run Tests

dotnet test

All tests should pass across all target frameworks (net8.0, net9.0, net10.0).

Development Workflow

1. Create a Branch

git checkout -b feature/my-new-feature
# or
git checkout -b fix/issue-123

Use descriptive branch names:

feature/ - New functionality
fix/ - Bug fixes
docs/ - Documentation changes
refactor/ - Code refactoring
test/ - Test improvements

2. Make Your Changes

Follow the existing code style
Add XML documentation for public APIs
Include unit tests for new functionality
Update documentation if needed

3. Test Your Changes

# Run all tests
dotnet test

# Run tests with coverage (optional)
dotnet test --collect:"XPlat Code Coverage"

4. Commit Your Changes

We follow Conventional Commits:

git commit -m "feat: add new assertion for tool timing"
git commit -m "fix: handle null tool arguments correctly"
git commit -m "docs: update walkthrough with new API"
git commit -m "test: add edge cases for snapshot comparer"

Commit message format:

<type>(<scope>): <description>

[optional body]

[optional footer]

Types: feat, fix, docs, style, refactor, test, chore

5. Push and Create a PR

git push origin feature/my-new-feature

Then create a Pull Request on GitHub.

Code Style Guidelines

C# Conventions

Use C# 12 features where appropriate
Follow Microsoft's C# Coding Conventions
Use file-scoped namespaces
Use primary constructors for simple types
Prefer required properties over constructor parameters for models

Naming

Classes/Methods: PascalCase (TestHarness, RunTestAsync)
Parameters/Variables: camelCase (testCase, result)
Private Fields: _camelCase (_logger, _evaluator)
Constants: PascalCase (DefaultPassingScore)

XML Documentation

All public APIs must have XML documentation:

/// <summary>
/// Runs a single test case against the agent.
/// </summary>
/// <param name="adapter">The agent adapter to test.</param>
/// <param name="testCase">The test case to run.</param>
/// <returns>The test result with score and details.</returns>
public async Task<TestResult> RunTestAsync(IAgentAdapter adapter, TestCase testCase)

Test Naming

Use the pattern: MethodName_StateUnderTest_ExpectedBehavior

[Fact]
public async Task RunTestAsync_WithNullAdapter_ThrowsArgumentNullException()

[Fact]
public async Task HaveCalledTool_WhenToolWasCalled_ShouldPass()

Project Structure

AgentEval/
├── src/
│   ├── AgentEval.Abstractions/  # Public contracts: interfaces, models
│   ├── AgentEval.Core/          # Implementations: metrics, assertions, comparison, tracing
│   ├── AgentEval.DataLoaders/   # Data loaders, exporters, output formatting
│   ├── AgentEval.MAF/           # Microsoft Agent Framework integration
│   ├── AgentEval.RedTeam/       # Security scanning, attack types, compliance
│   └── AgentEval/               # Umbrella packaging project (NuGet: AgentEval)
├── samples/
│   └── AgentEval.Samples/       # Sample code
├── tests/
│   └── AgentEval.Tests/         # Unit tests
└── docs/                        # DocFX documentation

Which Project Owns What

When adding new files, choose the correct sub-project based on these guidelines:

If your code is...	Put it in...
An interface, model, or public contract	`AgentEval.Abstractions`
A metric, assertion, comparison, or tracing implementation	`AgentEval.Core`
A data loader, exporter, or output formatter	`AgentEval.DataLoaders`
MAF-specific (MAFAgentAdapter, MAFEvaluationHarness)	`AgentEval.MAF`
Red team / security scanning	`AgentEval.RedTeam`

All 6 projects use RootNamespace=AgentEval — namespace should match the original folder structure (e.g., AgentEval.Metrics.RAG, AgentEval.Core), not the project name.

DI Registration

Each sub-project has its own DI extension method:

AddAgentEval() — Core services (in AgentEval.Core)
AddAgentEvalDataLoaders() — DataLoaders + Exporters
AddAgentEvalRedTeam() — Red Team security testing
AddAgentEvalAll() — Umbrella convenience (all of the above)

Pull Request Guidelines

Before Submitting

Tests pass locally (dotnet test)
New code has unit tests
XML documentation added for public APIs
No compiler warnings
PR description explains the change

PR Review Process

Create a draft PR early for feedback
Request review when ready
Address feedback promptly
Squash commits if requested

Merging

PRs require at least one approval
All CI checks must pass
Use "Squash and merge" for clean history

Reporting Issues

Bug Reports

Include:

AgentEval version
.NET version
Steps to reproduce
Expected vs actual behavior
Error messages/stack traces

Feature Requests

Include:

Problem you're trying to solve
Proposed solution
Alternative solutions considered
Impact on existing functionality

Community

GitHub Discussions: Ask questions, share ideas
Issues: Bug reports and feature requests

License

By contributing to AgentEval, you agree that your contributions will be licensed under the MIT License.

If you use AI tools (such as GitHub Copilot, ChatGPT, or similar) to generate contributed code, you are responsible for:

Ensuring the generated code does not violate any third-party licenses or intellectual property rights.
Reviewing and understanding all AI-generated code before submitting it.
Disclosing significant AI-assisted contributions in your PR description.

Release Process

AgentEval is currently in preview (pre-1.0). The following rules apply to all releases:

All GitHub releases before 1.0 must be marked as "This is a pre-release."
Release notes must include the standard preview banner:

Warning

This is a preview release. APIs may change without notice. Not for production use. See DISCLAIMER.md for full details.

Breaking changes must be documented in the release notes.

Recognition

Contributors are recognized in:

Release notes
README acknowledgments section (for significant contributions)

Thank you for helping make AgentEval better! 🚀

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Contributing to AgentEval

Ways to Contribute

Getting Started

Prerequisites

Fork and Clone

Build the Project

Run Tests

Development Workflow

1. Create a Branch

2. Make Your Changes

3. Test Your Changes

4. Commit Your Changes

5. Push and Create a PR

Code Style Guidelines

C# Conventions

Naming

XML Documentation

Test Naming

Project Structure

Which Project Owns What

DI Registration

Pull Request Guidelines

Before Submitting

PR Review Process

Merging

Reporting Issues

Bug Reports

Feature Requests

Community

License

Release Process

Recognition

FilesExpand file tree

CONTRIBUTING.md

Latest commit

History

CONTRIBUTING.md

File metadata and controls

Contributing to AgentEval

Ways to Contribute

Getting Started

Prerequisites

Fork and Clone

Build the Project

Run Tests

Development Workflow

1. Create a Branch

2. Make Your Changes

3. Test Your Changes

4. Commit Your Changes

5. Push and Create a PR

Code Style Guidelines

C# Conventions

Naming

XML Documentation

Test Naming

Project Structure

Which Project Owns What

DI Registration

Pull Request Guidelines

Before Submitting

PR Review Process

Merging

Reporting Issues

Bug Reports

Feature Requests

Community

License

Release Process

Recognition