Fail mark-for-deployment when re-deploying same version without --wait-for-deployment by cuza · Pull Request #4307 · Yelp/paasta

cuza · 2026-05-07T14:19:37Z

Prevent deployment when attempting to redeploy the same version without the --wait-for-deployment flag.

…t-for-deployment

nemacysts · 2026-05-08T20:50:11Z

-        )
-        print(deployment_version)
-        print("Continuing anyway.")
+        if not args.block:


i think we'd want to flip this, no?
args.block is True with --wait-for-deployment, and we want to wait until the deploy group is healthy in that case rather than forging ahead?

nemacysts · 2026-05-08T20:55:52Z

+                f"what is set to be deployed in deploy group {deploy_group}:"
+            )
+            print(f"  {deployment_version}")
+            print("Checking if all instances are healthy before proceeding...")


i think we might also want to do something slightly different here - i think we probably want to then essentially pretend that we're doing a normal --wait-for-deployment bounce and poll until the deploy group is empty (and then timeout after whatever we have the usual timeout set to)

i.e., we want to treat an unhealthy deploy group as if it was previously on another version and wait until the "new" (really the same version, we're just re-polling again) version is healthy before continuing

(and if --wait-for-deployment is not set, then don't do anything different: just yolo as usual)

… same version

…config for deployment validation

nemacysts · 2026-05-19T13:46:55Z

+        instance_health = [
+            check_if_instance_is_done(
+                service=service,
+                instance=instance_config.get_instance(),
+                cluster=cluster,
+                version=deployment_version,
+                instance_config=instance_config,
+            )
+            for cluster, instance_configs in instance_configs_per_cluster.items()
+            for instance_config in instance_configs
+        ]
+        all_healthy = all(instance_health)
+        if all_healthy:
+            print(
+                "All instances are healthy at this version. "
+                "Safe to proceed to the next deploy group."
+            )
+            return 0
+        else:
+            print(
+                "Error: Not all instances are healthy for this version. "
+                "A previous deploy may have failed or timed out. "
+                "Not safe to proceed to the next deploy group."
+            )
+            return 1


might be worth essentially doing what a normal m-f-d does nad run this logic in a loop until all the instances are healthy (or i guess some percentage are healthy - i think we only require bounce_margin_factor % of instances to be healthy to proceed?) rather than doing the check once and exiting

i think doing the logic in a loop would probably also allow us to remove the special-casing here since if everything is healthy, we'd excit that loop immediately and if not, we'd keep rechecking

cuza added 2 commits May 7, 2026 07:18

Fail mark-for-deployment when re-deploying same version without --wai…

94dc596

…t-for-deployment

Add health check for instances when re-deploying the same version

1b08ffa

cuza mentioned this pull request May 8, 2026

Add PaaSTA Playground Claude Skill #4309

Merged

cuza requested review from ilkinmammadzada and nemacysts May 8, 2026 15:34

nemacysts reviewed May 8, 2026

View reviewed changes

cuza added 6 commits May 12, 2026 06:10

checking instance health before proceeding with re-deployments of the…

c864e40

… same version

Refactor instance health check to use a list comprehension for clarity

4060302

Update tests to use check_if_instance_is_done and load_system_paasta_…

3f785b7

…config for deployment validation

Add mock for get_currently_deployed_version in mark-for-deployment test

76c1b9b

Merge branch 'master' into u/cuza/PAASTA-18862

76e6497

Merge branch 'master' into u/cuza/PAASTA-18862

1a04794

nemacysts reviewed May 19, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fail mark-for-deployment when re-deploying same version without --wait-for-deployment#4307

Fail mark-for-deployment when re-deploying same version without --wait-for-deployment#4307
cuza wants to merge 8 commits into
masterfrom
u/cuza/PAASTA-18862

cuza commented May 7, 2026

Uh oh!

nemacysts May 8, 2026

Uh oh!

nemacysts May 8, 2026

Uh oh!

nemacysts May 8, 2026

Uh oh!

nemacysts May 19, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

cuza commented May 7, 2026

Uh oh!

nemacysts May 8, 2026

Choose a reason for hiding this comment

Uh oh!

nemacysts May 8, 2026

Choose a reason for hiding this comment

Uh oh!

nemacysts May 8, 2026

Choose a reason for hiding this comment

Uh oh!

nemacysts May 19, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants