Improve error message for duplicate pipeline run names #3701

strickvl · 2025-05-24T17:30:07Z

Summary

Improve the error message when users try to run a pipeline with a duplicate run_name
Handle duplicate run name errors directly in SQLZenStore where they occur
Provide a user-friendly error message with actionable solutions
Fix SQLAlchemy autoflush timing issue to ensure proper error handling

Problem

When users run a pipeline with a duplicate run name (whether from a config file, programmatically, or any other method), they get a confusing database error. The error can come in two forms:

A raw SQL IntegrityError (when using REST API):

RuntimeError: (pymysql.err.IntegrityError) (1062, "Duplicate entry 'test_run_name-6e23c0466cc4411c8b9f75f0c8a1a818' for key 'pipeline_run.unique_run_name_in_project'")

An EntityExistsError with a technical message

Additionally, there was a subtle SQLAlchemy autoflush timing issue where IntegrityErrors were being raised during _get_reference_schema_by_id calls instead of during the explicit session.commit(), preventing proper error handling.

Solution

This PR catches IntegrityError in SQLZenStore's _create_run method and provides a much more helpful error message with actionable solutions. It also fixes the SQLAlchemy autoflush issue to ensure errors are properly caught and handled.

Before (Raw SQL Error)

RuntimeError: (pymysql.err.IntegrityError) (1062, "Duplicate entry 'my_run-6e23c0466cc4411c8b9f75f0c8a1a818' for key 'pipeline_run.unique_run_name_in_project'")
[SQL: INSERT INTO pipeline_run ...]

After (User-Friendly Error)

Pipeline run name 'my_run' already exists in this project. Each pipeline run must have a unique name.

To fix this, you can:
1. Use a different run name
2. Use a dynamic run name with placeholders like: "my_run_{date}_{time}"
3. Remove the run name from your configuration to auto-generate unique names

For more information on run naming, see: https://docs.zenml.io/concepts/steps_and_pipelines/yaml_configuration#run-name

Changes Made

Enhanced error handling in SQLZenStore's _create_run() method to catch IntegrityError and provide user-friendly messages
Fixed SQLAlchemy autoflush issue by wrapping logs processing in session.no_autoflush to prevent premature IntegrityError during reference lookups
Replaced mocked unit test with proper integration test that actually runs real pipelines twice with the same name
Made error message more generic - removed specific mention of "config file" since run names can be set in multiple ways
Updated integration tests to verify the fix works end-to-end with real pipeline execution

Test Plan

Integration test verifies duplicate run names are properly detected when running real pipelines
Integration test confirms EntityExistsError is raised (not raw IntegrityError)
Integration test validates clear error message is shown to users
All existing tests continue to pass
Mypy type checking passes
Formatting and linting checks pass

Technical Details

The key fix was addressing the SQLAlchemy autoflush behavior where _get_reference_schema_by_id calls would trigger early database flushes, causing IntegrityErrors to be raised before the explicit session.commit() and thus not being caught by the try/except block. Using session.no_autoflush ensures the error handling works as intended.

🤖 Generated with Claude Code

coderabbitai · 2025-05-24T17:30:13Z

Important

Review skipped

Auto reviews are disabled on this repository.

Please check the settings in the CodeRabbit UI or the .coderabbit.yaml file in this repository. To trigger a single review, invoke the @coderabbitai review command.

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

Note

Other AI code review bot(s) detected

CodeRabbit has detected other AI code review bot(s) in this pull request and will avoid duplicating their findings in the review comments. This may lead to a less comprehensive review.

✨ Finishing touches

🧪 Generate unit tests

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch feature/better-error-message

Tip

👮 Agentic pre-merge checks are now available in preview!

Pro plan users can now enable pre-merge checks in their settings to enforce checklists before merging PRs.

Built-in checks – Quickly apply ready-made checks to enforce title conventions, require pull request descriptions that follow templates, validate linked issues for compliance, and more.
Custom agentic checks – Define your own rules using CodeRabbit’s advanced agentic capabilities to enforce organization-specific policies and workflows. For example, you can instruct CodeRabbit’s agent to verify that API documentation is updated whenever API schema files are modified in a PR. Note: Upto 5 custom checks are currently allowed during the preview period. Pricing for this feature will be announced in a few weeks.

Please see the documentation for more information.

Example:

reviews:
  pre_merge_checks:
    custom_checks:
      - name: "Undocumented Breaking Changes"
        mode: "warning"
        instructions: |
          Pass/fail criteria: All breaking changes to public APIs, CLI flags, environment variables, configuration keys, database schemas, or HTTP/GraphQL endpoints must be documented in the "Breaking Change" section of the PR description and in CHANGELOG.md. Exclude purely internal or private changes (e.g., code not exported from package entry points or explicitly marked as internal).

Please share your feedback with us on this Discord post.

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

github-actions · 2025-05-24T17:31:27Z

Documentation Link Check Results

❌ Absolute links check failed
There are broken absolute links in the documentation. See workflow logs for details
✅ Relative links check passed
_{Last checked: 2025-09-23 07:16:03 UTC}

Copilot

Pull Request Overview

This PR enhances the user experience by providing clearer guidance when a pipeline run name conflict occurs, along with tests and documentation to support the change.

Enhanced error handling in create_placeholder_run to catch duplicate run-name errors and surface a helpful, actionable message.
Added unit tests to verify the new error message and ensure other EntityExistsError cases remain unchanged.
Updated docs to warn about run-name uniqueness and suggest best practices.

Reviewed Changes

Copilot reviewed 4 out of 4 changed files in this pull request and generated 2 comments.

File	Description
src/zenml/pipelines/run_utils.py	Improved duplicate run-name error detection and messaging
tests/unit/pipelines/test_run_utils.py	Added tests for duplicate run-name error and non-duplicate behavior
docs/book/how-to/steps-pipelines/yaml_configuration.md	Warn about unique run names and provide guidance in YAML examples
run_alerter_tests.sh	New script to run alerter tests (scope seems unrelated)

Comments suppressed due to low confidence (1)

run_alerter_tests.sh:1

[nitpick] This script for running alerter tests appears unrelated to the pipeline run-name improvements. Consider moving it to a separate PR or isolating it under a more relevant feature grouping to keep this change focused.

#!/bin/bash

src/zenml/pipelines/run_utils.py

docs/book/how-to/steps-pipelines/yaml_configuration.md

When users run a pipeline with a fixed `run_name` in their config.yaml and then rerun the same pipeline, they would get a confusing database error about entity existence. This change catches both EntityExistsError and RuntimeError (with IntegrityError) specifically for duplicate run names and provides a much more helpful error message. ## Changes - Add improved error handling in `create_placeholder_run()` to catch duplicate run name errors (both EntityExistsError and raw SQL IntegrityError) - Provide actionable guidance with 3 specific solutions: 1. Change the run_name to a unique value 2. Use dynamic placeholders like `run_name: "my_run_{date}_{time}"` 3. Remove the run_name to auto-generate unique names - Add comprehensive unit tests to verify the improved error message - Update documentation in yaml_configuration.md to warn about run name uniqueness ## User Experience Instead of seeing confusing database errors, users now get: ``` Pipeline run name 'my_run_name' already exists in this project. Each pipeline run must have a unique name. To fix this, you can: 1. Change the 'run_name' in your config file to a unique value 2. Use a dynamic run name with placeholders like: run_name: "my_run_name_{date}_{time}" 3. Remove the 'run_name' from your config to auto-generate unique names For more information on run naming, see: https://docs.zenml.io/concepts/steps_and_pipelines/yaml_configuration#run-name ``` 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <[email protected]>

As suggested in PR review, this commit adds clearer YAML comments to the run_name placeholder examples to make it more obvious what each example demonstrates.

- Use TYPE_CHECKING to handle the optional sqlalchemy import properly - Rename to SQLIntegrityError to avoid confusion with other exceptions - This ensures mypy doesn't complain about assigning None to a type

- Change broad Exception catch to specific RuntimeError - Add parentheses for clarity in boolean logic - Align documentation wording with error message 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <[email protected]>

schustmi · 2025-05-26T07:53:21Z

src/zenml/pipelines/run_utils.py

-    run, _ = Client().zen_store.get_or_create_run(run_request)
-    return run
+
+    try:


This should be handled in the SQLZenStore, not in this random place (which is only one occurence where a run is created).

Specifying the run name in a config file is not the only one way to do it, the message can simply be generic and talk about configuration instead of files.

…or-message

- Moved error handling from run_utils.py to sql_zen_store.py where it architecturally belongs - Database-specific error handling now stays in the database layer - Made error message more generic (removed specific mention of 'config file') - Simplified run_utils.py by removing 40+ lines of error handling code - Updated tests to reflect the new error handling location - All code paths that create runs now benefit from improved error messages 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <[email protected]>

Remove EntityExistsError from Raises section since this function no longer explicitly raises exceptions - they are now handled in SQLZenStore. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <[email protected]>

src/zenml/zen_stores/sql_zen_store.py

tests/unit/pipelines/test_run_utils.py

Previously the test only verified that mocking worked correctly. This adds a proper integration test that actually runs pipelines twice with the same name to verify the duplicate name detection behavior. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <[email protected]>

Wrap logs processing in session.no_autoflush to prevent premature IntegrityError during _get_reference_schema_by_id calls. This ensures duplicate name errors are properly caught by the try/except block and converted to helpful EntityExistsError messages. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <[email protected]>

Remove "when implemented" and "should now" comments since the improved error handling is already working. Simplify test to only expect EntityExistsError now that the fix is in place. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <[email protected]>

Copilot

Pull Request Overview

This PR improves the error messaging for duplicate pipeline run names by handling IntegrityError exceptions in SQLZenStore and updating integration tests and documentation to demonstrate the enhanced behavior.

Enhanced error handling in SQLZenStore to catch duplicate run names and present a user-friendly message
Fixed the SQLAlchemy autoflush issue to ensure errors are raised at the correct time
Updated integration tests and docs to reflect and validate the changes made

Reviewed Changes

Copilot reviewed 4 out of 4 changed files in this pull request and generated 1 comment.

File	Description
tests/integration/functional/pipelines/test_pipeline_run.py	Added an integration test to verify improved error handling for duplicate pipeline run names
src/zenml/zen_stores/sql_zen_store.py	Enhanced error message handling for duplicate run names and fixed the autoflush issue with session.no_autoflush
src/zenml/pipelines/run_utils.py	Minor update (additional newline)
docs/book/how-to/steps-pipelines/yaml_configuration.md	Added a warning hint to document the uniqueness requirement for run names and provide best practices

src/zenml/zen_stores/sql_zen_store.py

schustmi

Accidentally approved, see my latest comment

Move user-friendly error message logic from _create_run to get_or_create_run method. This removes fragile database-specific error message parsing and follows the established pattern where get_or_create_run handles user-facing errors. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <[email protected]>

Remove code duplication by creating _get_duplicate_run_name_error_message helper method that generates the user-friendly error message for duplicate pipeline run names. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <[email protected]>

src/zenml/pipelines/run_utils.py

…or-message # Conflicts: # src/zenml/zen_stores/sql_zen_store.py

strickvl added enhancement New feature or request internal To filter out internal PRs and issues labels May 24, 2025

strickvl requested a review from schustmi May 24, 2025 17:30

strickvl requested a review from Copilot May 24, 2025 17:30

strickvl force-pushed the feature/better-error-message branch from b38ae4f to 32b4a7a Compare May 24, 2025 17:31

Copilot AI reviewed May 24, 2025

View reviewed changes

src/zenml/pipelines/run_utils.py Outdated Show resolved Hide resolved

docs/book/how-to/steps-pipelines/yaml_configuration.md Show resolved Hide resolved

strickvl force-pushed the feature/better-error-message branch from 32b4a7a to c28564c Compare May 24, 2025 20:11

strickvl added 2 commits May 24, 2025 22:16

Add more specific examples to run_name documentation

0b903f7

As suggested in PR review, this commit adds clearer YAML comments to the run_name placeholder examples to make it more obvious what each example demonstrates.

Fix mypy type errors for IntegrityError import

659288c

- Use TYPE_CHECKING to handle the optional sqlalchemy import properly - Rename to SQLIntegrityError to avoid confusion with other exceptions - This ensures mypy doesn't complain about assigning None to a type

strickvl requested a review from Copilot May 24, 2025 20:25

This comment was marked as outdated.

Sign in to view

schustmi requested changes May 26, 2025

View reviewed changes

strickvl and others added 3 commits May 26, 2025 09:59

Merge remote-tracking branch 'origin/develop' into feature/better-err…

22bc668

…or-message

strickvl requested a review from schustmi May 26, 2025 11:41

schustmi requested changes May 26, 2025

View reviewed changes

src/zenml/zen_stores/sql_zen_store.py Outdated Show resolved Hide resolved

schustmi requested changes May 26, 2025

View reviewed changes

tests/unit/pipelines/test_run_utils.py Outdated Show resolved Hide resolved

htahir1 and others added 6 commits May 26, 2025 21:03

Merge branch 'develop' into feature/better-error-message

508a616

Merge branch 'develop' into feature/better-error-message

86307b4

Merge branch 'develop' into feature/better-error-message

816b870

strickvl requested a review from Copilot June 25, 2025 13:11

Copilot AI reviewed Jun 25, 2025

View reviewed changes

src/zenml/zen_stores/sql_zen_store.py Outdated Show resolved Hide resolved

strickvl requested a review from schustmi June 25, 2025 13:13

strickvl added the run-slow-ci label Jun 25, 2025

Merge branch 'develop' into feature/better-error-message

419a8e8

schustmi approved these changes Jul 3, 2025

View reviewed changes

src/zenml/zen_stores/sql_zen_store.py Outdated Show resolved Hide resolved

schustmi requested changes Jul 3, 2025

View reviewed changes

strickvl and others added 3 commits July 3, 2025 17:05

Merge branch 'develop' into feature/better-error-message

af7ae3c

strickvl commented Jul 3, 2025

View reviewed changes

src/zenml/pipelines/run_utils.py Outdated Show resolved Hide resolved

Update src/zenml/pipelines/run_utils.py

7b05e07

strickvl requested a review from schustmi July 4, 2025 06:45

schustmi approved these changes Jul 29, 2025

View reviewed changes

strickvl and others added 4 commits July 29, 2025 10:01

Merge remote-tracking branch 'origin/develop' into feature/better-err…

c15f153

…or-message # Conflicts: # src/zenml/zen_stores/sql_zen_store.py

Merge branch 'develop' into feature/better-error-message

7bfe723

Merge branch 'develop' into feature/better-error-message

03700a9

Merge branch 'develop' into feature/better-error-message

8107860

strickvl merged commit fd0dc66 into develop Sep 23, 2025
76 of 79 checks passed

strickvl deleted the feature/better-error-message branch September 23, 2025 10:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Improve error message for duplicate pipeline run names #3701

Improve error message for duplicate pipeline run names #3701

Uh oh!

strickvl commented May 24, 2025 •

edited

Loading

Uh oh!

coderabbitai bot commented May 24, 2025 •

edited

Loading

Review skipped

Other AI code review bot(s) detected

Uh oh!

github-actions bot commented May 24, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

This comment was marked as outdated.

Uh oh!

schustmi May 26, 2025

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

schustmi left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Improve error message for duplicate pipeline run names #3701

Improve error message for duplicate pipeline run names #3701

Uh oh!

Conversation

strickvl commented May 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Problem

Solution

Before (Raw SQL Error)

After (User-Friendly Error)

Changes Made

Test Plan

Technical Details

Uh oh!

coderabbitai bot commented May 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Review skipped

Other AI code review bot(s) detected

Uh oh!

github-actions bot commented May 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Documentation Link Check Results

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

This comment was marked as outdated.

Uh oh!

schustmi May 26, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

schustmi left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

strickvl commented May 24, 2025 •

edited

Loading

coderabbitai bot commented May 24, 2025 •

edited

Loading

github-actions bot commented May 24, 2025 •

edited

Loading