Fix lambdify cache-miss on evaluate() (closes #194) + replace flaky timing tests by lmoresi · Pull Request #193 · underworldcode/underworld3

lmoresi · 2026-05-18T12:36:15Z

Summary

Two coupled changes in the lambdify-evaluate caching area:

Fixes uw.function.evaluate() never hits the lambdify cache (fresh sympy.Dummy per call → +1 cache entry every call) #194 — uw.function.evaluate() never hit the lambdify cache. _expr_hash used sympy.srepr(expr), which embeds the volatile global dummy_index of any sympy.Dummy. The evaluate() coordinate-substitution path mints a fresh Dummy per call, so an otherwise-identical expression hashed differently every call → _lambdify_cache grew one entry per call (1,2,3,4,…) and sympy.lambdify recompiled every time on a hot path.

Fix: canonicalise Dummy → Symbol(d.name) in _expr_hash before srepr. This changes only the cache key; the real sympy.lambdify() call still uses the original expr/symbols, so numerics are unchanged. The cache key separately carries the symbol-name tuple, so name-keying is safe and deterministic.
Replaces the flaky wall-clock tests in TestPerformanceExpectations (the original motivation). test_lambdify_caching's assert time2 <= time1*2 was failing CI on unrelated PRs (e.g. Refined-submesh (coarse/fine companion) investigation #192) from pure runner jitter, and its loose tolerance was masking bug uw.function.evaluate() never hits the lambdify cache (fresh sympy.Dummy per call → +1 cache entry every call) #194.

Tests

test_lambdify_cache_hit — exercises get_cached_lambdified directly: identical request returns the same object, no new entry; distinct expression cached separately. Deterministic, timing-free.
test_evaluate_cache_stable_across_calls — uw.function.evaluate() never hits the lambdify cache (fresh sympy.Dummy per call → +1 cache entry every call) #194 regression guard: repeated identical evaluate() holds _lambdify_cache size flat ([1,1,1,…], was [1,2,3,…]) and results identical across calls. Aggregate cache behaviour, no wall-clock.

Validation

Cache size across 7 identical evaluate() calls: [1,1,1,1,1,1,1] (before fix: [1,2,3,4,5,6,7]).
tests/test_0720_lambdify_optimization_paths.py — 19 passed, 1 pre-existing unrelated xpass.
tests/test_0501_integrals.py smoke — 9 passed, 3 pre-existing xfail (unrelated CellWiseIntegral Perf: clone DM in Integral.evaluate() (3× speedup; closes #171 narrative) #172/Revert PR #172 (integrals regression) + xfail CellWiseIntegral parity tests #174). No numerical regression from the cache-key change.

Test plan

TestPerformanceExpectations — deterministic, no timing
Full test_0720 module — 19 passed
test_0501_integrals safety smoke — 9 passed
uw.function.evaluate() never hits the lambdify cache (fresh sympy.Dummy per call → +1 cache entry every call) #194 reproducer now flat
CI green on this branch

Closes #194.

Underworld development team with AI support from Claude Code

test_lambdify_caching and test_rbf_false_not_slow asserted one-off wall-clock timings (time2 <= time1*2, elapsed < 0.01s). These are inherently flaky on shared CI runners and test_lambdify_caching has been failing CI on unrelated PRs (0.0107s vs 0.0049s runner jitter). Replaced with deterministic, timing-free behaviour tests: - test_lambdify_cache_hit: exercises get_cached_lambdified directly -- identical request returns the SAME function object and adds no entry (true cache hit); a distinct expression is cached separately. This is the cache mechanism's actual contract. - test_rbf_modes_consistent: rbf=True and rbf=False must agree for a pure-sympy expression (meaningful, timing-free; replaces the "rbf=False not slow" wall-clock check). Note: writing the cache-hit test surfaced that the high-level uw.function.evaluate() path does NOT hit the lambdify cache on repeated identical calls -- _expr_hash(srepr) differs every call, so the cache grows by one entry per call and never returns a hit. The old loose timing tolerance was masking this. Not fixed here (evaluate/lambdify is a performance-critical hot path needing separate benchmarking); see PR description. Full module: 19 passed. Underworld development team with AI support from Claude Code

Copilot

Pull request overview

This PR replaces flaky wall-clock assertions in lambdify optimization tests with deterministic behavioral checks for cache reuse and evaluation consistency.

Changes:

Replaces timing-based lambdify cache test with a direct get_cached_lambdified cache-contract test.
Replaces rbf=False timing assertion with a consistency check between rbf=True and rbf=False.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

_expr_hash used sympy.srepr(expr), which embeds the volatile global dummy_index of any sympy.Dummy. The evaluate() coordinate-substitution path mints a fresh Dummy per call, so an otherwise-identical expression hashed differently every call and the lambdify cache never matched -- _lambdify_cache grew one entry per call (1,2,3,4,...) and sympy.lambdify recompiled every time on a hot path. Fix: canonicalise Dummy -> name-stable Symbol in _expr_hash before srepr. This changes only the cache *key*; the real sympy.lambdify() call still uses the original expr/symbols, so numerics are unchanged. The cache key separately carries the symbol-name tuple, so name-keying is safe and deterministic. Verified: repeated identical evaluate() now holds cache size flat ([1,1,1,...] vs [1,2,3,...] before). test_0720 module 19 passed; test_0501_integrals 9 passed / 3 pre-existing xfail (unrelated CellWiseIntegral #172/#174). test_evaluate_cache_stable_across_calls added as the #194 regression guard (aggregate cache behaviour + result-consistency, no wall-clock). Underworld development team with AI support from Claude Code

Copilot AI review requested due to automatic review settings May 18, 2026 12:36

Copilot started reviewing on behalf of lmoresi May 18, 2026 12:36 View session

Copilot AI reviewed May 18, 2026

View reviewed changes

lmoresi mentioned this pull request May 18, 2026

uw.function.evaluate() never hits the lambdify cache (fresh sympy.Dummy per call → +1 cache entry every call) #194

Open

lmoresi changed the title ~~Replace flaky wall-clock lambdify tests with cache-contract tests~~ Fix lambdify cache-miss on evaluate() (closes #194) + replace flaky timing tests May 18, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix lambdify cache-miss on evaluate() (closes #194) + replace flaky timing tests#193

Fix lambdify cache-miss on evaluate() (closes #194) + replace flaky timing tests#193
lmoresi wants to merge 2 commits into
developmentfrom
bugfix/lambdify-caching-cachehit

lmoresi commented May 18, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

lmoresi commented May 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Tests

Validation

Test plan

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

lmoresi commented May 18, 2026 •

edited

Loading