Add best practices documentation #271

PointKernel · 2025-09-19T20:07:08Z

This PR adds a best practices guide for NVBench, providing code examples to help users quickly get started and conduct effective performance comparisons in real-world scenarios.

docs/best_practices.md

oleksandr-pavlyk · 2025-10-02T21:15:51Z

@PointKernel I like the document as a friendly "getting started guide".

I would expect the best practices guide to provide answers to:

nvbench vs. NCU
should I or should I not lock GPU frequency
should I use a single kernel in the launchable lambda per benchmark, or can I use multiple kernels
how to do performance tuning, with a kernel example and reasoning for arriving at an optimal choice of parameters

So perhaps the document could be renamed, otherwise looks good to me.

PointKernel · 2025-10-02T21:31:36Z

Regarding the comparison between nvbench and NCU, could you elaborate a bit on what kind of information would be most useful for users? From my perspective, nvbench primarily gathers runtime data and basic metrics, which feels more similar to NSYS, whereas NCU provides deeper kernel-level profiling with detailed hardware utilization insights.

Yes, I should have been more clear. What I had in mind was the difference in kernel runtime estimate by NCU and by nvbench. I was just bitten by a discrepancy in timings caused by NCU locking GPU frequency while timing, and nvbench not doing that. I needed to use --clock-control none option on NCU to get timings measured by NCU to agree with those reported by nvbench.

While NCU may have other reasons to lock frequency (to get sampling-based estimates, such as stall rates, more accurate), GPU Mode talk https://www.youtube.com/watch?v=CtrqBmYtSEk by @gevtushenko makes the point that timings obtained with locked frequency may not be representative of real-world kernel performance.

PointKernel added 10 commits September 19, 2025 12:18

Create best_practices.md

2da0b8d

Update best_practices.md

af9773d

Update best_practices.md

be0cda8

Update best_practices.md

87aa856

Update best_practices.md

63f884d

Update best_practices.md

0b25b91

Update best_practices.md

df27dcb

Update best_practices.md

54cbcd0

Update best_practices.md

b95acb8

Update best_practices.md

df7abef

oleksandr-pavlyk reviewed Sep 24, 2025

View reviewed changes

docs/best_practices.md Outdated Show resolved Hide resolved

oleksandr-pavlyk reviewed Sep 24, 2025

View reviewed changes

docs/best_practices.md Outdated Show resolved Hide resolved

PointKernel added 2 commits September 29, 2025 12:22

Updates

8af0aa3

Updates

3a9c80d

PointKernel requested a review from oleksandr-pavlyk September 29, 2025 19:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add best practices documentation #271

Add best practices documentation #271

PointKernel commented Sep 19, 2025

Uh oh!

Uh oh!

Uh oh!

oleksandr-pavlyk commented Oct 2, 2025

Uh oh!

PointKernel commented Oct 2, 2025 •

edited by oleksandr-pavlyk

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add best practices documentation #271

Are you sure you want to change the base?

Add best practices documentation #271

Conversation

PointKernel commented Sep 19, 2025

Uh oh!

Uh oh!

Uh oh!

oleksandr-pavlyk commented Oct 2, 2025

Uh oh!

PointKernel commented Oct 2, 2025 • edited by oleksandr-pavlyk Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

PointKernel commented Oct 2, 2025 •

edited by oleksandr-pavlyk

Loading