Erase transient attributes in attribute manager instead of controller #3955

clumens · 2025-09-02T16:32:06Z

This is exactly the same as #3801 except for changing three calls to deprecated functions and rebasing on main. I'll be making additional changes from patch review here in the future. I don't see any way to do this that isn't confusing.

nrwahl2

Looks pretty good. The remaining questions/uncertainties, which I don't expect us to have answers for:

CIB caching discrepancies: #3801 (comment)
- Might be worth adding a comment about our uncertainty here like we did for the boolean argument
Whether constructing the GRegex on each function call is a significant enough amount of work to just do it once: #3801 (comment)
Consistency among nodes and daemons: #3801 (comment)
More about caching (you've already added a TODO comment): #3801 (comment)

And I'd like to try to come up with a commit message improvement related to #3801 (comment).

There's a merge conflict that needs to be resolved too. That's all I've got.

daemons/controld/controld_remote_ra.c

daemons/attrd/attrd_attributes.c

daemons/attrd/attrd_cib.c

clumens · 2025-09-17T14:39:33Z

Pushed once to address review comments.

... for reuse

Previously, when the attribute manager purged a node, it would purge the node's transient attributes only from memory, and assumed the controller would purge them from the CIB. Now, the writer will purge them from the CIB as well. This happens by setting the values to NULL rather than dropping them from memory. If there is a writer, it will also erase the node's entire transient attributes section. If there is no writer, the next elected writer will erase each value as part of its normal election-win write-out. In either case, all nodes will drop the NULL values from memory after getting the notification that the erasure succeeded. This fixes a variety of timing issues when multiple nodes including the attribute writer are shutting down. If the writer leaves before some other node, the DC wipes that other node's attributes from the CIB when that other node leaves the controller process group (or all other nodes do if the DC is the leaving node). If a new writer (possibly even the node itself) is elected before the node's attribute manager leaves the cluster layer, it will write the attributes back to the CIB. Once the other node leaves the cluster layer, all attribute managers remove its attributes from memory, but they are now "stuck" in the CIB. As of this commit, the controller still erases the attributes from the CIB when the node leaves the controller process group, which is redundant but doesn't cause any new problems and will be corrected in an upcoming commit. Note: This will cause an insignificant regression if backported to Pacemaker 2. The Pacemaker 2 controller purges attributes from the CIB for leaving DCs only if they are at version 1.1.13 or later, because earlier DCs will otherwise get fenced after a clean shutdown. Since the attribute manager doesn't know the DC or its version, the attributes would now always be wiped, so old leaving DCs will get fenced. The fencing would occur only in the highly unlikely situation of a rolling upgrade from Pacemaker 2-supported versions 1.1.11 or 1.1.12, and the upgrade would still succeed without any negative impact on resources. Co-Authored-By: Reid Wahl <[email protected]> Co-Authored-By: Chris Lumens <[email protected]> Fixes T138

This hasn't been needed since attrd_cib_updated_cb() started ignoring changes from trusted sources (!cib__client_triggers_refresh()). It's especially useless now that the attribute manager handles clearing of leaving nodes' attributes from the CIB.

Now that the attribute manager will erase transient attributes from the CIB when purging a node, we don't need to do that separately in the controller.

…from cache Nothing uses the new capability yet

With recent changes, the attribute manager now handles it when the node leaves the cluster, so the controller purge is redundant. This does alter the timing somewhat, since the controller's purge occurred when the node left the controller process group, while the attribute manager's purge occurs when it leaves the cluster, but that shouldn't make a significant difference. This fixes a problem when a node's controller crashes and is respawned while fencing is disabled. Previously, another node's controller would remove that node's transient attributes from the CIB, but they would remain in the attribute managers' memory. Now, the attributes are correctly retained in the CIB in this situation. Fixes T137 Fixes T139 Co-Authored-By: Chris Lumens <[email protected]>

... instead of wiping from CIB directly Co-Authored-By: Chris Lumens <[email protected]>

It now boils down to a bool for whether we want only unlocked resources

... to controld_delete_node_history(), and controld_node_state_deletion_strings() to controld_node_history_deletion_strings(), since they only delete history now

This has only ever had two values, which basically just means it's a bool.

clumens · 2025-09-17T14:54:03Z

And again to rebase on main and resolve the merge conflicts.

clumens · 2025-09-17T19:26:30Z

Looks pretty good. The remaining questions/uncertainties, which I don't expect us to have answers for:

Additionally:

Someone needs to check out the rolling upgrade testing that Ken mentioned. Ugh.
I need to look at whether transient attributes are preserved for remote nodes that get restarted. I'd love to come up with a test for this, since it's the sort of thing we will one day unintentionally break since we don't remember that it's something we support.

clumens · 2025-10-13T18:42:57Z

* I need to look at whether transient attributes are preserved for remote nodes that get restarted.  I'd love to come up with a test for this, since it's the sort of thing we will one day unintentionally break since we don't remember that it's something we support.

Tested both full Pacemaker nodes (https://projects.clusterlabs.org/T132#16531) and Pacemaker Remote nodes (https://bugzilla.redhat.com/show_bug.cgi?id=2030869#c21) first on main, and then with these patches. Everything appears to be working as it should.

I haven't thought of a way to automate these tests yet.

clumens · 2025-10-14T20:05:28Z

Tested T137, T138, T139, and the original shutdown related attribute (bugs linked in T137). I think at this point all that's left is checking out the rolling upgrade thing.

clumens · 2025-10-15T15:57:32Z

I'm working on the rolling upgrade testing right now, so I want to leave some notes for myself (and anyone else who cares) here for future reference. Consider this comment from #3801:

Some simple rolling upgrade tests look good except for one minor issue. In a mixed-version cluster, a deleted attribute for a leaving node will show up in queries with an empty value rather than not showing up at all. That implies the NULL values aren't being dropped from memory as desired in drop_removed_values() when the node leaves. The (older) DC should still erase the transient attributes in that case (which appears to be working), and the CIB notification for that should still be received (which may not be) and handled correctly, so I'm not sure what's going on. The notification and handling is working when a single attribute is dropped.

My setup is four nodes - three full pacemaker nodes and one remote node. Let's just call them node1, node2, node3, and remote1. remote1 doesn't really matter for testing this, but having a remote node is part of my standard test cluster so whatever.

I also think it's important to note that for me, node3 is the DC. I think we don't want to touch the DC at all during testing.

First, build packages from the main branch and install them to all nodes. Start the cluster. Then, switch to the branch for this PR and add a patch that increases the minor-minor version of CRM_FEATURE_SET. Differences in that value (which is the #feature-set attribute) is what will cause a mixed version cluster.

Set a transient attribute on node2:

[node1]# attrd_updater -n XYZ -U ABC -N node2

Put node1 into standby, wait for any resources on it to migrate off, stop the cluster, and then copy over the new packages and install them. These are basically the rolling upgrade instructions (https://clusterlabs.org/projects/pacemaker/doc/2.1/Pacemaker_Administration/html/upgrading.html#rolling-node-by-node).

Now start the cluster on node1 and take it out of standby. You should how have a MIXED-VERSION cluster according to crm_mon. At this point, no additional package installation is necessary to reproduce the issue described in the quoted comment above.

Now, put node2 in standby and wait for resources to migrate off, and stop the cluster. DO NOT upgrade to the new packages. Query the value of the transient attribute on both node1 (which was upgraded to the packages built off this PR) and on node3 (the DC, which was not):

[node1]# attrd_updater -n XYZ -A
name="XYZ" host="node2" value=""

[node3]# attrd_updater -n XYZ -A
attrd_updater: Could not query value of XYZ: attribute does not exist

We would expect that node1 would have the same output as node3, so that's what Ken is referring to in that comment. I haven't tested this yet, but I believe the debug cycle for this looks like:

Bring node2 out of standby and start the cluster on it again.
Probably stopping the cluster everywhere to clear everything out.
Doing whatever coding needs to be done on this PR, building new packages, and installing them only on node1.
Starting the cluster everywhere again.
Adding a transient attribute to node2.
Putting node2 in standby and stopping the cluster on just that node.

clumens changed the title ~~Erase transient attributes~~ Erase transient attributes in attribute manager instead of controller Sep 5, 2025

clumens force-pushed the T137 branch from d8aaecd to aeced6c Compare September 11, 2025 14:01

nrwahl2 reviewed Sep 17, 2025

View reviewed changes

daemons/controld/controld_remote_ra.c Outdated Show resolved Hide resolved

daemons/attrd/attrd_attributes.c Outdated Show resolved Hide resolved

daemons/attrd/attrd_cib.c Outdated Show resolved Hide resolved

clumens force-pushed the T137 branch from aeced6c to de7822e Compare September 17, 2025 14:39

Ken Gaillot and others added 11 commits September 17, 2025 10:42

Refactor: pacemaker-attrd: functionize an attribute check

34e2fa2

... for reuse

Low: controller: don't need to erase node attributes for remote nodes

728cc14

Now that the attribute manager will erase transient attributes from the CIB when purging a node, we don't need to do that separately in the controller.

Refactor: controller: allow purging node attributes without removing …

c4a4122

…from cache Nothing uses the new capability yet

Low: controller: ask attribute manager to purge fenced nodes' attributes

936fe41

... instead of wiping from CIB directly Co-Authored-By: Chris Lumens <[email protected]>

Refactor: controller: drop no-longer-used section enum values

3708e25

Refactor: controller: drop node state section enum

9c21b0e

It now boils down to a bool for whether we want only unlocked resources

Refactor: controller: rename controld_delete_node_state()

6df9d63

... to controld_delete_node_history(), and controld_node_state_deletion_strings() to controld_node_history_deletion_strings(), since they only delete history now

Refactor: daemons: Remove the down_opts enum.

5dfdba2

This has only ever had two values, which basically just means it's a bool.

clumens force-pushed the T137 branch from de7822e to 5dfdba2 Compare September 17, 2025 14:53

clumens added the review: in progress PRs that are currently being reviewed label Sep 26, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Erase transient attributes in attribute manager instead of controller #3955

Erase transient attributes in attribute manager instead of controller #3955

Uh oh!

clumens commented Sep 2, 2025

Uh oh!

nrwahl2 left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

clumens commented Sep 17, 2025

Uh oh!

clumens commented Sep 17, 2025 •

edited

Loading

Uh oh!

clumens commented Sep 17, 2025

Uh oh!

clumens commented Oct 13, 2025

Uh oh!

clumens commented Oct 14, 2025

Uh oh!

clumens commented Oct 15, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Erase transient attributes in attribute manager instead of controller #3955

Are you sure you want to change the base?

Erase transient attributes in attribute manager instead of controller #3955

Uh oh!

Conversation

clumens commented Sep 2, 2025

Uh oh!

nrwahl2 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

clumens commented Sep 17, 2025

Uh oh!

clumens commented Sep 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

clumens commented Sep 17, 2025

Uh oh!

clumens commented Oct 13, 2025

Uh oh!

clumens commented Oct 14, 2025

Uh oh!

clumens commented Oct 15, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

clumens commented Sep 17, 2025 •

edited

Loading