Skip to content

retry on monitorhub launch failure#67

Open
ColinOrionChandler wants to merge 1 commit intomainfrom
monitoring_retries
Open

retry on monitorhub launch failure#67
ColinOrionChandler wants to merge 1 commit intomainfrom
monitoring_retries

Conversation

@ColinOrionChandler
Copy link
Collaborator

Added try/except to retry launching the workflow for cases where the MonitorHub fails to start (typically because of lag on the cluster).

Also amended the number of compute nodes in the USDF configuration for Roma.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants