Skip to content

Commit 2028489

Browse files
nagkumar91Nagkumar ArkalgudNagkumar ArkalgudCopilot
authored
Add properties to FDP, fix changelog formatting (#42208)
* Prepare evals SDK Release * Fix bug * Fix for ADV_CONV for FDP projects * Update release date * re-add pyrit to matrix * Change grader ids * Update unit test * replace all old grader IDs in tests * Update platform-matrix.json Add pyrit and not remove the other one * Update test to ensure everything is mocked * tox/black fixes * Skip that test with issues * update grader ID according to API View feedback * Update test * remove string check for grader ID * Update changelog and officialy start freeze * update the enum according to suggestions * update the changelog * Finalize logic * Initial plan * Fix client request ID headers in azure-ai-evaluation Co-authored-by: nagkumar91 <[email protected]> * Fix client request ID header format in rai_service.py Co-authored-by: nagkumar91 <[email protected]> * Passing threshold in AzureOpenAIScoreModelGrader * Add changelog * Adding the self.pass_threshold instead of pass_threshold * Add the python grader * Remove redundant test * Add class to exception list and format code * Add properties to evaluation upload run for FDP * Remove debug * Remove the redundant property * Fix changelog * Fix the multiple features added section * removed the properties in update --------- Co-authored-by: Nagkumar Arkalgud <[email protected]> Co-authored-by: Nagkumar Arkalgud <[email protected]> Co-authored-by: copilot-swe-agent[bot] <[email protected]> Co-authored-by: nagkumar91 <[email protected]>
1 parent 815ddd9 commit 2028489

File tree

2 files changed

+2
-4
lines changed

2 files changed

+2
-4
lines changed

sdk/evaluation/azure-ai-evaluation/CHANGELOG.md

Lines changed: 1 addition & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -9,12 +9,11 @@
99
### Features Added
1010

1111
- Added support for Azure OpenAI Python grader via `AzureOpenAIPythonGrader` class, which serves as a wrapper around Azure Open AI Python grader configurations. This new grader object can be supplied to the main `evaluate` method as if it were a normal callable evaluator.
12-
13-
### Features Added
1412
- Added `attack_success_thresholds` parameter to `RedTeam` class for configuring custom thresholds that determine attack success. This allows users to set specific threshold values for each risk category, with scores greater than the threshold considered successful attacks (i.e. higher threshold means higher
1513
tolerance for harmful responses).
1614
- Enhanced threshold reporting in RedTeam results to include default threshold values when custom thresholds aren't specified, providing better transparency about the evaluation criteria used.
1715

16+
1817
### Bugs Fixed
1918

2019
- Fixed red team scan `output_path` issue where individual evaluation results were overwriting each other instead of being preserved as separate files. Individual evaluations now create unique files while the user's `output_path` is reserved for final aggregated results.

sdk/evaluation/azure-ai-evaluation/azure/ai/evaluation/_evaluate/_utils.py

Lines changed: 1 addition & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -178,7 +178,6 @@ def _log_metrics_and_instance_results_onedp(
178178

179179
properties = {
180180
EvaluationRunProperties.RUN_TYPE: "eval_run",
181-
EvaluationRunProperties.EVALUATION_RUN: "promptflow.BatchRun",
182181
EvaluationRunProperties.EVALUATION_SDK: f"azure-ai-evaluation:{VERSION}",
183182
"_azureml.evaluate_artifacts": json.dumps([{"path": artifact_name, "type": "table"}]),
184183
}
@@ -191,6 +190,7 @@ def _log_metrics_and_instance_results_onedp(
191190
upload_run_response = client.start_evaluation_run(
192191
evaluation=EvaluationUpload(
193192
display_name=evaluation_name,
193+
properties=properties,
194194
)
195195
)
196196

@@ -202,7 +202,6 @@ def _log_metrics_and_instance_results_onedp(
202202
outputs={
203203
"evaluationResultId": create_evaluation_result_response.id,
204204
},
205-
properties=properties,
206205
),
207206
)
208207

0 commit comments

Comments
 (0)