Feature/dcgm gpu backend #1391

AssimilatedCoder · 2025-12-04T15:23:16Z

Added Nvidia unified memory architecture based GB (Grace Blackwell) GPU support by using DCGM support. Tested on Nvidia DGX Spark (GB10)

- Introduce a new DCGM-backed NVIDIA GPU collector on Linux that populates the existing Gpu::gpu_info structures using dcgmUpdateAllFields and dcgmGetLatestValuesForFields. - Prefer DCGM over NVML when built with -DBTOP_DCGM=ON and libdcgm is available, while keeping NVML as a transparent fallback on systems without DCGM. - Track a unified nvidia_device_count so AMD (ROCm SMI) and Intel GPU backends stack correctly after whichever NVIDIA backend is active. - Expose a new CMake option BTOP_DCGM and link libdcgm when enabled, keeping GPU runtime behaviour controlled via existing shown_gpus and show_gpu_info config options. - Update README GPU compatibility and CMake documentation to describe the DCGM backend, including usage on DGX Spark / data center GB-series systems and how to enable it. Tests: - Built with -DBTOP_GPU=ON -DBTOP_DCGM=ON on Linux and verified that btop runs with DCGM present (DGX-style node) and falls back to NVML or no NVIDIA GPUs when DCGM is unavailable.

…rectly.

GPU name retrieval (lines 1290-1305): Added dcgmGetDeviceAttributes() to get proper GPU names, with the same cleanup logic as NVML supported_functions initialization (lines 1482-1496): During collect<1>, supported_functions is now set based on which fields returned valid data, with unsupported features (pwr_state, pcie_txrx, encoder/decoder) explicitly set to false Empty deque fallback (lines 1499-1509): All deques now guaranteed to have at least one value (0) to prevent .back() crashes

deckstose

This lacks Makefile support.

In my opinion the changes can be split out of btop_collect into their own file.

What's up with the changes to the utility functions? Please undo them.

aristocratos · 2025-12-04T19:45:53Z

@deckstose
Likely AI coded.

Thinking about adding rules that PR's that are obviously vide coded should be dismissed unless the author has some proof that they actually understand the code in the PR (like for example that they have other repositories in C++ that aren't vibe coded).

deckstose · 2025-12-04T19:51:36Z

@aristocratos

I totally agree

aristocratos · 2025-12-04T19:56:19Z

@deckstose
Have updated CONTRIBUTING.md:

Submissions where the majority of the code is AI generated must be marked with [AI generated].

"Vibe coded" PR's where it seems like the author doesn't understand the generated code will be dismissed.

ogcadbane and others added 10 commits December 1, 2025 16:37

updates

6cde828

updates to dcgm backend

1e7809c

This commit fixes the issue where the GPU backend was not working cor…

35bb50f

…rectly.

updates

da9312d

updated the GPU util metering and graphing capability

c852b27

final fixes.

036299e

Merge branch 'aristocratos:main' into feature/dcgm-gpu-backend

2e1b0c3

deckstose reviewed Dec 4, 2025

View reviewed changes

aristocratos added the ai generated Majority of included code is AI generated label Dec 4, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Feature/dcgm gpu backend #1391

Feature/dcgm gpu backend #1391

Uh oh!

AssimilatedCoder commented Dec 4, 2025

Uh oh!

deckstose left a comment

Uh oh!

aristocratos commented Dec 4, 2025

Uh oh!

deckstose commented Dec 4, 2025

Uh oh!

aristocratos commented Dec 4, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Feature/dcgm gpu backend #1391

Are you sure you want to change the base?

Feature/dcgm gpu backend #1391

Uh oh!

Conversation

AssimilatedCoder commented Dec 4, 2025

Uh oh!

deckstose left a comment

Choose a reason for hiding this comment

Uh oh!

aristocratos commented Dec 4, 2025

Uh oh!

deckstose commented Dec 4, 2025

Uh oh!

aristocratos commented Dec 4, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants