Cluster hardening app via ArgoCD using sync-waves between/across apps #39

ghost · 2025-09-04T22:09:00Z

This deploys scansetting and scansettingbinding for profile defined in values.yaml to run in a sync-wave before other layer-0,1, or 2 apps.

…e operator and existing scan profiles. This app is loaded first by Argo based on leveraging sync-wave annotation in the values-hub.yaml

…f in spec: and then ran linter against updated content

…itial launch

charts/compliance-scanning/Chart.yaml

charts/compliance-scanning/templates/scan-setting-binding.yaml

…ing-binding

…nherited

…an and then for each complianceremediation generated it changes default state of spec.apply false to spec.apply true and then merges that back into cluster to take effect

…luster settings

… instead use vp container image that has tools preloaded

….apply set to True

mhjacks · 2025-09-05T19:29:53Z

charts/compliance-scanning/templates/remediation-job.yaml

+---
+# ClusterRole with permissions to manage compliance resources
+apiVersion: rbac.authorization.k8s.io/v1
+kind: ClusterRole


Does this need to be a clusterRole? If all the objects you're touching are in openshift-compliance it might not need to be.

It may not need to be. The fact I set it that way shows my ignorance of remediation controller within compliance operator. I did set all elements to be openshift-compliance namespace focused but don't know if namespace scoped auth is sufficient for something that can reboot nodes during remediation

mhjacks · 2025-09-05T19:33:30Z

charts/compliance-scanning/templates/scan-setting-binding.yaml

+  name: {{ .Values.compliance.scanSettingBinding.name }}
+  namespace: openshift-compliance
+  annotations:
+    argocd.argoproj.io/sync-wave: '-10'


We generally only use sync-waves when not using them leads to errors or other problems with deployments. I am very unfamiliar with the compliance operator, so they may indeed be necessary. But if they aren't it's better not to have them. (As discussed we definitely need them in clustergroup, though).

This was me trying to ensure that the scan stuff was there before the results viewer and remediation updaters were available

ghost · 2025-09-08T14:11:46Z

So compliance scans do run and remediations are identified currently
Problem #1 is that setting automated remediation doesn't actually perform a remediation. all the complianceremediation CRs are marked with spec:apply false
Problem #2 is that when I do attempt to create a remediation job that will go and look for those CRs and mark/patch them to spec:apply true the script fails to exec any kubectl/oc commands (I am using the vp team imperative-container) due to an inability to find the KUBECONFIG to use to connect to the cluster
Problem #3 is that when I ignore problem #2 and test the bash script independently it does run (b/c I am logged into the cluster) and changes the spec:apply to true but the remediation status doesn't change from Pending to Applied. The remediation controller never does anything after the patch to spec:apply
That third problem may be due to me executing it from a local oc login. However I am logged in as a cluster-admin.
I am only literature deep on the compliance operator scanning stuff. Per the docs setting compliance.openshift.io/default-auto-apply: "true" should have been enough to run the remediations without the extra effort of creating a remediation-job to run a shell script inside a container image to oc patch the system. That does not appear to be the case.
Another set of eyes (and mind) would be very much appreciated

ghost · 2025-09-08T14:13:48Z

Identifies what things look like with current PR deployed. Scans have run, remediations are identified. My reemdiation-job will eventually timeout due to bash script failing to get a proper env for a KUBECONFIG to use to execute the kubectl commands in the script

…cussion with VP team and no need to do that in other examples

ghost · 2025-09-08T14:40:22Z

Updated PR with removal of env for KUBECONFIG and now imperative-container executes the bash script in remediation job and can use oc/kubectl commands
Still not seeing remediationcontroller act on the change to spec:apply true so while I have addressed Problem #2 from above, Problem # 3 still exists.
Script runs and patch occurs to complianceremediation CRs. Then nothing....

…for remediation-job

ghost · 2025-09-08T14:47:20Z

All of the complianceremediation CRs have been updated by the remediation job to have spec:apply set to true. However all of the status is still Pending after the patch

…ussion with compliance operator team. It should get generated at runtime

ghost · 2025-09-08T17:31:18Z

Seems like I uncovered an edge case where automated remediation just fails silently on ROSA clusters. Things work as expected on a non ROSA cluster. Spun up aws 4.18.22 with clusterbot Remediations run automatically with ScanSetting and ScanSettingBinding as defined (I did remove the annotation on the ScanSettingBinding) There are a lot of complianceremediations run but all either show Applied or Missing Dependencies Overall compliancescans show RESULT as NON-COMPLIANT. Wondering if there is a way to automatically trigger another scan after remediations?

…m ComplianceCheckResults in Compliance Operator

ghost · 2025-09-08T18:59:12Z

Since 15 of these commits were attempts to figure out why auto remediation wasn't working I am going to close this PR, rebase and only add the 5 files required along with updates to values-hub.yaml

Phil Osip added 12 commits September 4, 2025 14:31

Revised initial commit for cluster hardening leveraging the complianc…

05cee51

…e operator and existing scan profiles. This app is loaded first by Argo based on leveraging sync-wave annotation in the values-hub.yaml

Updated the ScanSetting and ScanSettingBinding yaml to not stick stuf…

cc6b3ce

…f in spec: and then ran linter against updated content

Removed resource limits from scan bindings and result viewer

4e181be

Removed extra line from values.yaml

20dcc46

Removed extra api refs

7d5ee8f

Modified calling of single profile for scansettingbinding

a3a7053

Removed apiGroup from profiles

fff995d

Removed apiVersion from settingsRef

971563c

Removed tolerations from result viewer

8f0755d

Updated to use the rhcos4-stig profile

123af1d

Setting a daily at 0200 cron to see if I can get setting to run on in…

88be04a

…itial launch

Updated comment for schedule field

e2dfaf3

sabre1041 requested changes Sep 5, 2025

View reviewed changes

charts/compliance-scanning/Chart.yaml Outdated Show resolved Hide resolved

charts/compliance-scanning/templates/scan-setting-binding.yaml Show resolved Hide resolved

charts/compliance-scanning/templates/scan-setting-binding.yaml Show resolved Hide resolved

Phil Osip added 16 commits September 5, 2025 08:58

Upodated description and reintroiduced apiGroup elements in scan-sett…

732c8bf

…ing-binding

Updated cron format for schedule

e4e1fbb

Added roles for master and worker to scansetting since they are not i…

06095df

…nherited

Fixed formatting issue on roles

f168626

Removing schdule cron entry so it only runs on deployment of pattern

bbc2021

Used Cursor to generate a job that waits on execution of compliancesc…

5992d24

…an and then for each complianceremediation generated it changes default state of spec.apply false to spec.apply true and then merges that back into cluster to take effect

Needs to run as root. We can revisit this in review

d8bfb6d

Updated image refs to ubi9 unauthenticated access container images

202ada7

Removed cursor generated securitycontext so we inherit from default c…

a891903

…luster settings

Updated remediaiton job to not try and dnf install jq and kubectl but…

63adfa5

… instead use vp container image that has tools preloaded

Modified the image to pull from older quay vp registry org

4ef6068

Updated the complaincescan naming convention to be profile-role

821a784

Fixed while loop logic to get compliancescans correctly

4145d90

Removed brackets around while

71ca8c0

Script fixes for value checks in remediation job

f9caa15

Updated bash script to patch the complianceremediaitons CRs with spec…

d45f55f

….apply set to True

mhjacks reviewed Sep 5, 2025

View reviewed changes

Phil Osip added 2 commits September 8, 2025 09:19

Updated version to chart

187763d

Removed env setting for bash script access to KUBECONFIG based on dis…

d5d6a1d

…cussion with VP team and no need to do that in other examples

Removed reasonApplied for kubectl vreate event at end of bash script …

71b7522

…for remediation-job

Removed the autoremediaiton annotation on scansettingbinding per disc…

96ece4f

…ussion with compliance operator team. It should get generated at runtime

Removed the result viewer pod since user can see results directly for…

e037fdb

…m ComplianceCheckResults in Compliance Operator

ghost requested a review from sabre1041 September 8, 2025 18:27

ghost closed this Sep 8, 2025

This pull request was closed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cluster hardening app via ArgoCD using sync-waves between/across apps #39

Cluster hardening app via ArgoCD using sync-waves between/across apps #39

Uh oh!

ghost commented Sep 4, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mhjacks Sep 5, 2025

Uh oh!

ghost Sep 5, 2025

Uh oh!

mhjacks Sep 5, 2025

Uh oh!

ghost Sep 5, 2025

Uh oh!

ghost commented Sep 8, 2025

Uh oh!

ghost commented Sep 8, 2025

Uh oh!

ghost commented Sep 8, 2025 •

edited by ghost

Loading

Uh oh!

ghost commented Sep 8, 2025

Uh oh!

ghost commented Sep 8, 2025 •

edited by ghost

Loading

Uh oh!

ghost commented Sep 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Cluster hardening app via ArgoCD using sync-waves between/across apps #39

Cluster hardening app via ArgoCD using sync-waves between/across apps #39

Uh oh!

Conversation

ghost commented Sep 4, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mhjacks Sep 5, 2025

Choose a reason for hiding this comment

Uh oh!

ghost Sep 5, 2025

Choose a reason for hiding this comment

Uh oh!

mhjacks Sep 5, 2025

Choose a reason for hiding this comment

Uh oh!

ghost Sep 5, 2025

Choose a reason for hiding this comment

Uh oh!

ghost commented Sep 8, 2025

Uh oh!

ghost commented Sep 8, 2025

Uh oh!

ghost commented Sep 8, 2025 • edited by ghost Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ghost commented Sep 8, 2025

Uh oh!

ghost commented Sep 8, 2025 • edited by ghost Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ghost commented Sep 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ghost commented Sep 8, 2025 •

edited by ghost

Loading

ghost commented Sep 8, 2025 •

edited by ghost

Loading