Skip to content

Commit eb125a5

Browse files
committed
Add ucTrace section
1 parent 947fbc5 commit eb125a5

File tree

2 files changed

+18
-0
lines changed

2 files changed

+18
-0
lines changed

docs/assets/numa.png

2.77 MB
Loading

docs/index.md

Lines changed: 18 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -531,6 +531,7 @@ BeyondMoore Software Ecosystem
531531
**_Profiling Tools_**
532532
* [Snoopie](#SNOOPIE): A Multi-GPU Communication Profiler and Visualiser
533533
* [PES AMD vs Intel](#PRECISE-EVENT-SAMPLING): A Precise Event Sampling Benchmark Suite
534+
* [ucTrace](#UCTRACE): A Multi-Layer Profiling Tool for UCX-driven Communication
534535

535536
**_Algorithms and Applications_**
536537
* [aCG](#ACG): CPU- and GPU-initiated Communication Strategies for CG Methods
@@ -574,6 +575,23 @@ BeyondMoore Software Ecosystem
574575
</div>
575576
</div>
576577

578+
<div id="UCTRACE" class="h-auto bg-gray-100 rounded-s p-4 border-solid border-1 border-gray-200 flex flex-row justify-start items-start gap-5 transform transition-all hover:shadow-gray-100 hover:shadow-lg shadow-none">
579+
<div clas="flex flex-col justify-start">
580+
<div class="flex flex-row gap-2 justify-start items-center flex-shrink">
581+
<img width="32" src="./assets/git.webp" />
582+
<a href="https://github.com/ParCoreLab/ucTrace" class="text-xl font-semibold font-sans visited:text-teal-700">ucTrace: A Multi-Layer Profiling Tool for UCX-driven Communication</a>
583+
</div>
584+
<p class="text-lg">We introduce ucTrace, a novel profiler that exposes and visualizes UCX-driven communication in HPC environments. ucTrace provides insights into MPI workflows by profiling message passing at the UCX level, linking operations between hosts and devices (e.g., GPUs and NICs) directly to their originating MPI functions. Through interactive visualizations of process- and device-specific interactions, ucTrace helps system administrators, library and application developers optimize performance and debug communication patterns in large-scale workloads. We demonstrate ucTrace's features through a wide range of experiments including MPI point-to-point behavior under different UCX settings, Allreduce comparisons across MPI libraries, communication analysis of a linear solver, NUMA binding effects, and profiling of GROMACS MD simulations with GPU acceleration at scale.</p>
585+
<p>
586+
<a href="https://github.com/ParCoreLab/ucTrace" class="text-xl font-semibold font-sans visited:text-teal-700">More details and git repository of the project.</a>
587+
</p>
588+
</div>
589+
<div class="grid h-[100%] justify-center place-items-center">
590+
<img width="500px" src="./assets/numa.png" />
591+
</div>
592+
</div>
593+
594+
577595
<div id="MULTI-GPU-CALLBACKS" class="h-auto bg-gray-100 rounded-s p-4 border-solid border-1 border-gray-200 flex flex-row justify-start items-start gap-5 transform transition-all hover:shadow-gray-100 hover:shadow-lg shadow-none">
578596
<div clas="flex flex-col justify-start">
579597
<div class="flex flex-row gap-2 justify-start items-center flex-shrink">

0 commit comments

Comments
 (0)