Skip to content

Conversation

@raresgaia123
Copy link
Contributor

@raresgaia123 raresgaia123 commented Jan 16, 2026

Description

If some worker can't be recovered, logic is to remove the pipeline of these workers and check if the job is still able to run. In the case of incomplete data loop after pipeline removal, the job will transition to Failing state. Failing state will then wait for all workers to have certain status before transitioning to Failed state. The bug was in the way we updated the worker status in job context. _reconcile_wrk_status was using workers from wrong source - class attribute 'self._new_cfg' instead of 'new_cfg' received as argument.

Type of Change

  • Bug Fix
  • New Feature
  • Breaking Change
  • Refactor
  • Documentation
  • Other (please describe)

Checklist

  • I have read the contributing guidelines
  • Existing issues have been referenced (where applicable)
  • I have verified this change is not present in other open pull requests
  • Functionality is documented
  • All code style checks pass
  • New code contribution is covered by automated tests
  • All new and existing tests pass

If some worker can't be recovered, logic is to remove the pipeline of these workers and check if the job is still able to run. In the case of incomplete data loop after pipeline removal, the job will transition to Failing state. Failing state will then wait for all workers to have certain status before transitioning to Failed state. The bug was in the way we updated the worker status in job context. _reconcile_wrk_status was using workers from wrong source - class attribute 'self._new_cfg' instead of 'new_cfg' received as argument.
@myungjin myungjin merged commit 12d0924 into cisco-open:main Jan 16, 2026
1 check passed
@raresgaia123 raresgaia123 deleted the failing_stop branch January 20, 2026 08:35
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants