Write only lower triangle in _JTDAJ_sparse by adenzler-nvidia · Pull Request #1275 · google-deepmind/mujoco_warp

adenzler-nvidia · 2026-04-02T11:25:49Z

Summary

_JTDAJ_sparse was writing both triangles of the H matrix (two atomic_add per off-diagonal pair), but the Cholesky factorization only reads the lower triangle
Write a single entry to the lower triangle instead, halving atomic adds for off-diagonal elements

Benchmark

three_humanoids (8192 worlds, nconmax=100, njmax=192, RTX PRO 6000 Blackwell):

	Run 1	Run 2	Run 3	Mean
Before	972K	901K	904K	925K steps/s
After	996K	1073K	1073K	1047K steps/s
Delta				+13%

Solver convergence is unchanged (mean 2.784 iters, p95=5, 8192/8192 converged).

Nsight Systems kernel-level profile confirms _JTDAJ_sparse drops from 1,248ms to 731ms per 1000 steps (-41%).

Test plan

solver_test.py (40/40 passed)
smooth_test.py (59/59 passed)
forward_test.py (60/60 passed)

The H matrix assembly in _JTDAJ_sparse was writing both triangles (two atomic adds per off-diagonal pair), but the Cholesky factorization only reads the lower triangle. Write a single entry to the lower triangle instead, halving the number of atomic adds for off-diagonal elements.

thowell

nice!

adenzler-nvidia requested a review from thowell April 2, 2026 11:34

use min/max for single atomic add instead of branch

32acc1f

thowell approved these changes Apr 2, 2026

View reviewed changes

adenzler-nvidia merged commit 5253652 into google-deepmind:main Apr 2, 2026
10 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Write only lower triangle in _JTDAJ_sparse#1275

Write only lower triangle in _JTDAJ_sparse#1275
adenzler-nvidia merged 2 commits intogoogle-deepmind:mainfrom
adenzler-nvidia:adenzler/jtdaj-sparse-single-triangle

adenzler-nvidia commented Apr 2, 2026

Uh oh!

thowell left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

adenzler-nvidia commented Apr 2, 2026

Summary

Benchmark

Test plan

Uh oh!

thowell left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants