Skip to content

Conversation

@cweibel
Copy link
Contributor

@cweibel cweibel commented Jan 6, 2026

Changes proposed in this pull request:

  • Runs a daily check to see of resurrection is enabled for each BOSH director, if not, drops a warning in Slack. With resurrection enabled, if a VM is unreachable after 30 seconds BOSH health monitor will replace it automatically without operator intervention (aka self healing)
  • Added master and tooling creds to /concourse/main/bosh-director-info on concourse-credhub-production so we don't have to manage access via individual per pipeline bosh creds. Will consolidate these over time.
  • Been meaning to do this for 2 years, finally resolves a sticky note I've had that long. Yay IP sprints.
  • Part of https://github.com/cloud-gov/product/issues/2836

security considerations

None, secrets, as noted above, are stashed in credhub.

@cweibel cweibel requested a review from a team as a code owner January 6, 2026 16:36
@cweibel cweibel changed the title Add Resurrection checks Add BOSH Resurrection checks Jan 6, 2026
@cweibel cweibel merged commit b9ea223 into main Jan 6, 2026
5 checks passed
@cweibel cweibel deleted the resurrection-checks branch January 6, 2026 16:41
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants