GeoTransolver Guard 🛡️ by mnabian · Pull Request #1544 · NVIDIA/physicsnemo

mnabian · 2026-04-01T22:57:52Z

PhysicsNeMo Pull Request

Description

Problem

When users apply a pretrained GeoTransolver checkpoint to inputs outside the training distribution (e.g., running a DrivAerML-trained model on motorcycles or aircraft), the model silently produces unreliable predictions. There is no mechanism to detect or warn about out-of-distribution (OOD) inputs at inference time.

Solution

We add two lightweight OOD guards that integrate seamlessly into the existing training and inference workflow. Both guards are controlled by a single knob (guard_buffer_size) and require no additional scripts, calibration steps, or changes to the training loop.

During training: guards passively collect calibration data (zero impact on model predictions or gradients).
During inference: guards check inputs against the training distribution and emit warnings.warn() if OOD is detected.
At checkpoint save: the kNN threshold is automatically computed from collected data.

Guard 1: Global Parameters — Bounding Box

What it monitors: The raw global_embedding input tensor (e.g., air density, stream velocity for DrivAerML).

How it works:

During training, tracks per-dimension running min/max across all batches.
At inference, compares each dimension of the input against the stored bounds.
Emits a warning per dimension that falls outside the training range.

Why input space: Global parameters are low-dimensional scalars (2-3 dims). A bounding box is simpler, more interpretable, and more reliable than latent-space methods at this dimensionality.

Example warning:

OOD Guard: global_embedding dim 0 value 0.7500 below training min 0.9000

Guard 2: Geometry Context — kNN Distance

The implementation is based on this paper: https://arxiv.org/pdf/2204.06507

What it monitors: The geometry context vector produced by GlobalContextBuilder.geometry_tokenizer -- a learned 32-dimensional representation of the input geometry, mean-pooled over attention heads and slices.

How it works:

During training, accumulates pooled geometry embeddings into a fixed-size FIFO rolling buffer.
At checkpoint save, computes a kNN distance threshold from the buffer (99th percentile of leave-one-out k-th nearest neighbor distances).
At inference, L2-normalizes the input embedding, computes its k-th nearest neighbor distance to the stored training embeddings, and warns if above threshold.

Why latent space: Geometry is a variable-size point cloud -- there is no fixed-dimensional input representation to bound. The post-ContextProjector embedding compresses geometry into a fixed 32-dim vector suitable for distance-based methods.

Why kNN:

Training set is small (~400 samples for DrivAerML). Covariance estimation (Mahalanobis) is unreliable.
No distributional assumption -- geometries may cluster multimodally.
No additional trainable components needed.

Why not monitor multi-scale local features:

Local features are derived from geometry via ball queries -- not an independent signal.
They dominate the context dimension (768/832 = 92%), drowning out other signals in a combined detector.
768 dims with ~400 samples is a poor regime for kNN (curse of dimensionality).

Example warning:

Usage

Enabling guards

Add to model config (or pass to constructor):

guard_buffer_size: 500  # set to dataset size; null to disable
guard_knn_k: 10         # k for geometry kNN (optional, default 10)

Both guards are enabled when guard_buffer_size is set, and disabled when it is None.

Training

No changes to the training script. Guards collect data automatically during model.train() forward passes. The kNN threshold is computed automatically when the checkpoint is saved (via state_dict() override).

Inference

No changes to the inference script. Guards run automatically during model.eval() forward passes and emit Python warnings for OOD inputs.

Configuration

Parameter	Default	Description
`guard_buffer_size`	`None` (disabled)	FIFO buffer size for geometry embeddings. Set to dataset size.
`guard_knn_k`	`10`	k for k-th nearest neighbor distance. Range 5-15 recommended.

Threshold

The kNN threshold is set at the 99th percentile of training-set leave-one-out kNN distances. This means ~1% false alarm rate on in-distribution data -- near-zero warnings on validation/test sets that are in-distribution.

Multi-GPU / DDP

Each rank maintains its own buffer independently -- no cross-rank communication.
The distributed sampler shuffles data each epoch, so after a few epochs each rank's FIFO buffer covers most of the training set.
Rank 0 saves the checkpoint; its buffer and threshold are what persist.
Recommendation: set guard_buffer_size >= dataset_size to ensure good coverage per rank.

Checkpoint Compatibility

Pre-guard checkpoint loaded into guard-enabled model: Guard buffers retain their initial values (inf/-inf). Guards remain inactive until training populates the buffers.
Guard checkpoint loaded into non-guard model: Extra guard_* keys in the state dict. Requires strict=False when loading.

Tests

This is currently tested for the Crash recipe on the bumper beam dataset. No OOD warnings for inference on test samples. For new OOD samples which have either OOD global parameters or a scaled geometry by a small factor (1.05), the OOD warning is raised.

Implementation

Registered buffers

All guard state is stored as registered buffers (persistent, non-trainable):

Buffer	Shape	Description
`guard_global_min`	`(global_dim,)`	Per-dimension training min
`guard_global_max`	`(global_dim,)`	Per-dimension training max
`guard_geo_embeddings`	`(buffer_size, dim_head)`	FIFO geometry embedding store
`guard_geo_ptr`	`(1,)`	FIFO write pointer
`guard_knn_threshold`	scalar	99th percentile kNN distance
`guard_knn_k`	scalar	k for kNN

Key methods

Method	When called	Purpose
`_guard_collect()`	Every training forward pass	Update global bounds, append geometry embeddings to FIFO
`_guard_check()`	Every inference forward pass	Check bounds and kNN distance, emit warnings
`compute_guard_threshold()`	At `state_dict()` call (before save)	Compute 99th percentile kNN threshold from buffer

Checklist

I am familiar with the Contributing Guidelines.
New or existing tests cover these changes.
The documentation is up to date with these changes.
The CHANGELOG.md is up to date with these changes.
An issue is linked to this pull request.
If I am implementing a new model or modifying any existing model, I have followed the Models Implementation Coding Standards.

Dependencies

None

greptile-apps · 2026-04-01T23:01:30Z

Greptile Summary

This PR adds two lightweight OOD guards to GeoTransolver — a per-dimension bounding box on global parameters and a kNN distance check on pooled geometry latents — that passively calibrate during training and warn at inference without changing model outputs or gradients. The OODGuard module itself is well-designed (FIFO buffer, lazy threshold, AMP dtype upcasting, torch.compiler.disable guards, shape validation), and the unit-test suite is thorough.

return_embedding_states is silently broken: geotransolver.py line 653 has an unconditional return x before the if return_embedding_states conditional (lines 655–657), making that branch unreachable dead code. Every caller passing return_embedding_states=True silently receives a plain tensor instead of the documented (output, embedding_states) tuple — this was a pre-existing feature that the refactor accidentally disabled.

Important Files Changed

Filename	Overview
physicsnemo/experimental/guardrails/embedded/ood_guard.py	New OODGuard module — well-structured with correct FIFO, kNN threshold, shape validation, AMP upcasting, lazy threshold recomputation, and torch.compiler.disable guards.
physicsnemo/experimental/models/geotransolver/geotransolver.py	Guard integration is correct, but the forward method has dead code: the return_embedding_states branch is placed after an unconditional return x, making the documented embedding-states output feature completely non-functional.
physicsnemo/experimental/models/geotransolver/context_projector.py	Adds geometry_context_detached return from build_context for the OOD guard to consume without holding the backward graph; logic is correct.
test/experimental/guardrails/embedded/test_ood_guard.py	Comprehensive unit tests covering FIFO wrap, threshold calibration, OOD detection, AMP dtype handling, shape validation, sensitivity scaling, and state_dict round-trip.
test/models/geotransolver/test_geotransolver.py	Adds guard integration tests (attach/detach, training forward populates buffers, invalid config rejection); no test for return_embedding_states=True which would have caught the dead-code bug.

_{Reviews (2): Last reviewed commit: "Merge branch 'GeoT_Guard' of https://git..." | Re-trigger Greptile}

mnabian · 2026-04-01T23:40:56Z

Note: Tests will be added after the initial review is done for the overall design and implementation of this guardrail.

…to GeoT_Guard

mnabian · 2026-04-21T22:46:47Z

/blossom-ci

mnabian · 2026-04-21T23:23:58Z

/blossom-ci

add geotransolver guard

c6ad3f2

mnabian requested review from RishikeshRanade and coreyjadams as code owners April 1, 2026 22:57

greptile-apps Bot reviewed Apr 1, 2026

View reviewed changes

mnabian added 4 commits April 1, 2026 16:04

revert some example related changes

04d471a

refactor for better modularity

ef485dd

address greptile comments

0b45fa3

formatting

ac646ab

mnabian marked this pull request as draft April 7, 2026 17:37

mnabian added 6 commits April 21, 2026 11:53

major refactor

aa80f5b

fix json-serializable issue, minor changes

434b011

Merge branch 'main' into GeoT_Guard

efab5d5

tests, update readme

ad8d4d0

formatting

bde90c8

Merge branch 'GeoT_Guard' of https://github.com/NVIDIA/physicsnemo in…

9d0c0e2

…to GeoT_Guard

mnabian marked this pull request as ready for review April 21, 2026 22:46

greptile-apps Bot reviewed Apr 21, 2026

View reviewed changes

Comment thread physicsnemo/experimental/models/geotransolver/geotransolver.py Outdated

mnabian added 3 commits April 21, 2026 15:54

fix import issue

9695543

address greptile comment

a0aa60b

cleanup

3b8c8a7

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GeoTransolver Guard 🛡️ #1544

GeoTransolver Guard 🛡️ #1544
mnabian wants to merge 14 commits intomainfrom
GeoT_Guard

mnabian commented Apr 1, 2026 •

edited

Loading

Uh oh!

greptile-apps Bot commented Apr 1, 2026 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mnabian commented Apr 1, 2026 •

edited

Loading

Uh oh!

mnabian commented Apr 21, 2026

Uh oh!

Uh oh!

mnabian commented Apr 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

mnabian commented Apr 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PhysicsNeMo Pull Request

Description

Problem

Solution

Guard 1: Global Parameters — Bounding Box

Guard 2: Geometry Context — kNN Distance

Usage

Enabling guards

Training

Inference

Configuration

Threshold

Multi-GPU / DDP

Checkpoint Compatibility

Tests

Implementation

Registered buffers

Key methods

Checklist

Dependencies

Uh oh!

greptile-apps Bot commented Apr 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Greptile Summary

Important Files Changed

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mnabian commented Apr 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mnabian commented Apr 21, 2026

Uh oh!

Uh oh!

mnabian commented Apr 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

mnabian commented Apr 1, 2026 •

edited

Loading

greptile-apps Bot commented Apr 1, 2026 •

edited

Loading

mnabian commented Apr 1, 2026 •

edited

Loading