Add benchmark utility to profile peak memory usage #16814

ding-young · 2025-07-18T07:21:50Z

Which issue does this PR close?

Closes Support memory profiling in benchmarks #16720 .

Rationale for this change

What changes are included in this PR?

In each benchmark (clickbench, tpch, sort-tpch, imdb, h2o), we now call print_memory_stats(); to print memory usage statistics via mimalloc. (This only works when compiled with --features mimalloc_extended.)
A new utility mem_profile is added. It builds dfbench with the mimalloc_extended feature enabled, and then runs each benchmark query in a separate subprocess to collect memory stats. The utility captures the subprocess’s stdout and summarizes the results for all queries.

Are these changes tested?

Yes. Via

// Run all queries 
cargo run --profile release-nonlto --bin mem_profile -- tpch --path benchmarks/data/tpch_sf1 --partitions 4 --format parquet
cargo run --profile release-nonlto --bin mem_profile -- h2o
cargo run --profile release-nonlto --bin mem_profile -- clickbench
cargo run --profile release-nonlto --bin mem_profile -- imdb --path benchmarks/data/imdb/
cargo run --profile release-nonlto --bin mem_profile -- sort-tpch --path benchmarks/data/tpch_sf1 --partitions 4

and

// Run specific query
cargo run --profile release-nonlto --bin mem_profile -- tpch --path benchmarks/data/tpch_sf1 --partitions 4 --format parquet --query 1

Are there any user-facing changes?

ding-young · 2025-07-25T03:19:15Z

benchmarks/README.md

+# Profiling Memory Stats for each benchmark query
+The `mem_profile` program wraps benchmark execution to measure memory usage statistics, such as peak RSS. It runs each benchmark query in a separate subprocess, capturing the child process’s stdout to print structured output.
+
+Subcommands supported by mem_profile are the subset of those in `dfbench`.
+Currently supported benchmarks include: Clickbench, H2o, Imdb, SortTpch, Tpch
+
+Before running benchmarks, `mem_profile` automatically compiles the benchmark binary (`dfbench`) using `cargo build` with the same cargo profile (e.g., --release) as mem_profile itself. By prebuilding the binary and running each query in a separate process, we can ensure accurate memory statistics.
+
+Currently, `mem_profile` only supports `mimalloc` as the memory allocator, since it relies on `mimalloc`'s API to collect memory statistics.
+
+Because it runs the compiled binary directly from the target directory, make sure your working directory is the top-level datafusion/ directory, where the target/ is also located. 


Here's more description about this utility and supported metrics.

ding-young · 2025-07-25T03:20:31Z

@2010YOUY01 This is ready for review :) I would love to hear your feedback.

ding-young force-pushed the memory-profiling branch from 62737d3 to aec3191 Compare July 18, 2025 07:34

ding-young mentioned this pull request Jul 25, 2025

Benchmark: Add micro-benchmark for Nested Loop Join operator #16819

Open

ding-young added 4 commits July 25, 2025 02:46

add benchmark utility to profile memory usage

5de82e5

get memory stats from mimalloc, not procfs

d9f45d0

support more benchmarks

3717b99

update benchmarks/README and refactor

5d8b177

ding-young force-pushed the memory-profiling branch from 52ffcdc to 5d8b177 Compare July 25, 2025 02:57

fix sort-tpch output format & taplo format

29cd822

ding-young commented Jul 25, 2025

View reviewed changes

ding-young marked this pull request as ready for review July 25, 2025 03:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add benchmark utility to profile peak memory usage #16814

Add benchmark utility to profile peak memory usage #16814

Uh oh!

ding-young commented Jul 18, 2025 •

edited

Loading

Uh oh!

ding-young Jul 25, 2025

Uh oh!

ding-young commented Jul 25, 2025

Uh oh!

Uh oh!

Add benchmark utility to profile peak memory usage #16814

Are you sure you want to change the base?

Add benchmark utility to profile peak memory usage #16814

Uh oh!

Conversation

ding-young commented Jul 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

Are these changes tested?

Are there any user-facing changes?

Uh oh!

ding-young Jul 25, 2025

Choose a reason for hiding this comment

Uh oh!

ding-young commented Jul 25, 2025

Uh oh!

Uh oh!

ding-young commented Jul 18, 2025 •

edited

Loading