Add cgroups CPU quota and throttling metrics #1039

mkapalka · 2025-05-09T15:06:21Z

Add metrics related to CPU quotas and CPU throttling (Linux CFS bandwidth control), as well as the total CPU usage from Linux cgroups CPU accounting. Those metrics can be useful in multi-tenant cloud environments, in particular on Elastic Cloud nodes that use CPU boosting (vCPU credits).

Add metrics related to CPU quotas and CPU throttling (Linux CFS bandwidth control), as well as the total CPU usage from Linux cgroups CPU accounting. Those metrics can be useful in multi-tenant cloud environments, in particular on Elastic Cloud nodes that use CPU boosting (vCPU credits). Signed-off-by: Michal Kapalka <[email protected]>

mkapalka · 2025-06-02T11:00:03Z

@SuperQ @sysadmind it would be great if you could have a look at this PR and tell me if there's anything missing that I should add. Thanks in advance!

mkapalka · 2025-08-08T13:55:21Z

Update: we have been using this branch successfully in production for quite some time now and those new metrics are very helpful, maybe it's worth merging this PR to make it easier for others to benefit from this as well? @SuperQ @sysadmind

sysadmind

I'm not sure why this PR hasn't triggered the CI process. You may need to rebase, or when you update this PR that might trigger the CI to run.

sysadmind · 2025-08-13T02:06:48Z

collector/nodes.go

@@ -286,6 +286,66 @@ func NewNodes(logger *slog.Logger, client *http.Client, url *url.URL, all bool,
 				},
 				Labels: defaultNodeLabelValues,
 			},
+			{


I think these metrics should be converted to seconds. It's typical practice for prometheus metrics to always be in base units - https://prometheus.io/docs/practices/naming/#metric-names

mkapalka force-pushed the feature/add-cgroups-cpu-stats branch from ad384a0 to 56195f7 Compare May 14, 2025 09:04

sysadmind requested changes Aug 13, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add cgroups CPU quota and throttling metrics #1039

Add cgroups CPU quota and throttling metrics #1039

Uh oh!

mkapalka commented May 9, 2025

Uh oh!

mkapalka commented Jun 2, 2025

Uh oh!

mkapalka commented Aug 8, 2025

Uh oh!

sysadmind left a comment

Uh oh!

sysadmind Aug 13, 2025

Uh oh!

Uh oh!

Add cgroups CPU quota and throttling metrics #1039

Are you sure you want to change the base?

Add cgroups CPU quota and throttling metrics #1039

Uh oh!

Conversation

mkapalka commented May 9, 2025

Uh oh!

mkapalka commented Jun 2, 2025

Uh oh!

mkapalka commented Aug 8, 2025

Uh oh!

sysadmind left a comment

Choose a reason for hiding this comment

Uh oh!

sysadmind Aug 13, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!