Skip to content

Conversation

@danpawlik
Copy link
Contributor

@danpawlik danpawlik commented Oct 29, 2025

It happens that the "oc adm must-gather" command takes longer
than expected, so the CI job gets timeout and the collecting
logs process is disturbed. In some places, we spotted that the
'--timeout' parameter in 'oc adm' command is just ignored:

TASK [os_must_gather : Run openstack-must-gather command
    output_dir={{ cifmw_os_must_gather_output_dir }}/artifacts,
    script=oc adm must-gather --image {{ cifmw_os_must_gather_image }}
    --timeout {{ cifmw_os_must_gather_timeout }}
    --host-network={{ cifmw_os_must_gather_host_network }}
    --dest-dir {{ cifmw_os_must_gather_output_log_dir }}
    -- ADDITIONAL_NAMESPACES={{ cifmw_os_must_gather_additional_namespaces }}
       OPENSTACK_DATABASES=$OPENSTACK_DATABASES SOS_EDPM=$SOS_EDPM SOS_DECOMPRESS=$SOS_DECOMPRESS gather 2>&1]
POST-RUN END RESULT_TIMED_OUT: [untrusted : playbooks/baremetal/collect-logs.yaml@main]

so let's add also timeout before 'oc adm' command to make sure
that it would be "killed" in proper amount of time.

OSPRH-21369

@danpawlik danpawlik requested a review from a team as a code owner October 29, 2025 13:09
@openshift-ci
Copy link
Contributor

openshift-ci bot commented Oct 29, 2025

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by:

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

stuggi
stuggi previously approved these changes Oct 29, 2025
Copy link
Contributor

@stuggi stuggi left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

/lgtm

It happens that the "oc adm must-gather" command takes longer
than expected, so the CI job gets timeout and the collecting
logs process is disturbed. In some places, we spotted that the
'--timeout' parameter in 'oc adm' command is just ignored:

    TASK [os_must_gather : Run openstack-must-gather command
        output_dir={{ cifmw_os_must_gather_output_dir }}/artifacts,
        script=oc adm must-gather --image {{ cifmw_os_must_gather_image }}
        --timeout {{ cifmw_os_must_gather_timeout }}
        --host-network={{ cifmw_os_must_gather_host_network }}
        --dest-dir {{ cifmw_os_must_gather_output_log_dir }}
        -- ADDITIONAL_NAMESPACES={{ cifmw_os_must_gather_additional_namespaces }}
           OPENSTACK_DATABASES=$OPENSTACK_DATABASES SOS_EDPM=$SOS_EDPM SOS_DECOMPRESS=$SOS_DECOMPRESS gather 2>&1]
    POST-RUN END RESULT_TIMED_OUT: [untrusted : playbooks/baremetal/collect-logs.yaml@main]

so let's add also timeout before 'oc adm' command to make sure
that it would be "killed" in proper amount of time.

Signed-off-by: Daniel Pawlik <[email protected]>
Copy link
Contributor

@fmount fmount left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

/lgtm

@softwarefactory-project-zuul
Copy link

Build failed (check pipeline). Post recheck (without leading slash)
to rerun all jobs. Make sure the failure cause has been resolved before
you rerun jobs.

https://softwarefactory-project.io/zuul/t/rdoproject.org/buildset/b05b0f1a21754500a576a05ed9c44cb1

✔️ openstack-k8s-operators-content-provider SUCCESS in 1h 55m 43s
podified-multinode-edpm-deployment-crc FAILURE in 36m 15s
✔️ cifmw-crc-podified-edpm-baremetal SUCCESS in 1h 31m 50s
✔️ cifmw-pod-zuul-files SUCCESS in 4m 46s
✔️ noop SUCCESS in 0s
✔️ cifmw-pod-ansible-test SUCCESS in 9m 26s
cifmw-pod-pre-commit FAILURE in 6m 40s
✔️ build-push-container-cifmw-client SUCCESS in 17m 20s
✔️ cifmw-molecule-os_must_gather SUCCESS in 20m 32s

@danpawlik
Copy link
Contributor Author

Need to test it first.

@danpawlik
Copy link
Contributor Author

Checking in testproject 1675

@danpawlik
Copy link
Contributor Author

recheck

@danpawlik danpawlik merged commit ae5f34c into openstack-k8s-operators:main Nov 4, 2025
6 checks passed
@danpawlik danpawlik deleted the add-timeout-must-gather branch November 4, 2025 07:36
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants