WIP: Replacing Static Dataflow Analysis with Reactive Caches #13948

hubertp · 2025-09-05T16:42:23Z

Pull Request Description

Initial implementation that replaces cache invalidation logic and static dataflow analysis with Reactive caches.

A lot of code is commented out, as either it needs to be re-done or is simply obsolete in the new implementation.
There will be still a lot of changes but basic functionality of loading/executing projects and executing visualizations works.
Invalidation of only the necessary nodes works, by replacing static dataflow analysis with runtime analysis that collects dependencies between UUIDs.

Depends on #13907. Addresses #10525, renders #13219 obsolete.

Important Notes

Checklist

Please ensure that the following checklist has been satisfied before submitting the PR:

The documentation has been updated, if necessary.
Screenshots/screencasts have been attached, if there are any visual changes. For interactive or animated visual changes, a screencast is preferred.
All code follows the
Scala,
Java,
TypeScript,
and
Rust
style guides. In case you are using a language not listed above, follow the Rust style guide.
Unit tests have been written where possible.
If meaningful changes were made to logic or tests affecting Enso Cloud integration in the libraries,
or the Snowflake database integration, a run of the Extra Tests has been scheduled.
- If applicable, it is suggested to paste a link to a successful run of the Extra Tests.

Initial implementation that replaces cache invalidation logic and static dataflow analysis with Reactive caches. A lot of code is commented out, as either it needs to be re-done or is simply obsolete in the new implementation. Depends on #13907.

The remaining problems relate to the fact that we are caching all expressions. That is conflict with the current `enterables` logic for entering functions from a local call stack. The latter will need a more involving rewrite.

JaroslavTulach

Inception Comments

I am glad reactive caches are shaping out. My biggest wish is to separate the Observable & co. into own project, provide clear API, documentation and test in isolation.

engine/runtime-instrument-common/src/main/java/org/enso/interpreter/instrument/Observable.java

engine/runtime-compiler/src/main/java/org/enso/compiler/pass/analyse/FramePointer.java

engine/runtime/src/main/java/org/enso/interpreter/EnsoLanguage.java

engine/polyglot-api/src/main/java/org/enso/polyglot/debugger/IdExecutionService.java

In order to be able to track dependencies between nodes, also in the event when no external `UUID` is available, had to allow to have two types of Observables. Similarly, `UUID`s that are external and internal are now easily distinguishable thanks to `RuntimeID` wrapper. Currently some tracking logic is inside nodes themselves. That's obviously wrong - it should all be in the instrumentation. Follow up change will move the logic. The current approach is mostly for achieving a minimal viable proof that the approach will work.

Mostly cosmetic changes due to the fact that we now invalidate less. But two prominent changes are included: - invalidation logic when Recompute requests comes - Fixed tracking method calls/closure execution The latter uses enter/exit logic in `RuntimeAnalysis` in `InvokeCallableNode` which is correct but undeseriable. The information should be gathered in instrumention instead. Explicit calls are easier to experiment with and will be transfered to instrumentation eventually.

Had to ensure that dependencies are being tracked correctly across function call boundaries. This is problematic due to the fact that local calls are being done with their own RuntimeCaches; any invalidation has to ensure that it is capable of crossing that boundary. Additionally, had to ensure that self argument is properly tracked/invalidated or changes withing the functions would not be translated into Truffle nodes. The logic is still experiemental and elements of RuntimeAnalysis will need to move to instrumentation, for performance reasons.

The exception was being propagated, unintentionally it seems, and showing up in logs.

Spurious method/self arg invalidation leading to large and unnecessary re-compilations.

Unresolved constructors are being resolved in synthetic constructs. Sadly, those constructs inside closures, share original IDs and therefore create false dependencies between nodes. As we still need the original IDs for instrumentation and expression updates, introduced a flag that disables runtime tracking for expressions.

Not yet complete but already found a few corner cases not covered in the new design.

Extra diagnostics were lost when computation failed at an early CompletionStage.

Panics aren't cached but visualizations can still be run on them. Had to add another method to Observable to make it happen.

Have to keep track of arguments applied to the visualization and map them to synthetic assignnment constructs in visualization code to get the right identifiers.

If a new external identifier is added within an expression that is already cached, the latter should be invalidated. Fixes a number of broken visualization tests.

When a text edit is performed on a code that is being called in the visualization, a full re-evaluation is needed in order to generate updated Truffle nodes. Decided not to support that case, as it is very unlikely to be needed in the foresable future. Text change will invalidate necessary observers, as one would expect, but it will not trigger re-evaluation. Instead, one has to explicitly make `ModifyVisualization` request. This seems like a good compromise as evaluations of visualization expressions are very costly (also due to locking).

Workarounds for Stackoverflow issues + IdMap updates causing full program invalidations. This change breaks 1 test in RuntimeVisualizationTests that will be addressed separately.

With runtime tracking we are able to precisely discover which cached values need to be invalidated. Unfortunately excessive use of external UUIDs (and therefore caching) by GUI revealed a flaw in that logic wrt to local variable access. Whenever a local variable is being accessed it is reading a previously written value from a frame. But if the assignment has been cached, then a specific slot in the frame is not written during successive execution. This means that adding new expressions that make use of previously cached local variables would report uninitialized value errors. This change workarounds this by **never** caching assignments. Assignments should be fast if the RHS is already cached, and GUI appears to assign external UUIDS to RHS always, it seems. That way any reading from a frame's slot will succeed. Also reverted to the old tree building in Changeset as it appeared to be sufficient for our needs.

hubertp mentioned this pull request Sep 8, 2025

Evaluate visualizations in parallel using reactive observers #13219

Closed

10 tasks

hubertp added the CI: Clean build required CI runners will be cleaned before and after this PR is built. label Sep 8, 2025

Fix some issues when pushing context

c8bd2e4

The remaining problems relate to the fact that we are caching all expressions. That is conflict with the current `enterables` logic for entering functions from a local call stack. The latter will need a more involving rewrite.

JaroslavTulach reviewed Sep 15, 2025

View reviewed changes

enso-bot bot mentioned this pull request Sep 17, 2025

Moving >, >=, <, <= to types where such operators make sense #14017

Merged

5 tasks

hubertp added 17 commits September 22, 2025 13:20

Stop RejectedExecutionException from showing up

0ca0cec

The exception was being propagated, unintentionally it seems, and showing up in logs.

Test illustrating issues in a larger project

6b60719

Spurious method/self arg invalidation leading to large and unnecessary re-compilations.

Skip tracking for synthetic node

894e71b

Fixing issues in RuntimeVisualizationsTest

964b850

Not yet complete but already found a few corner cases not covered in the new design.

Make sure CompletionExceptions are unwrapped

cd449ec

Extra diagnostics were lost when computation failed at an early CompletionStage.

Show visualizations for Panics

3213150

Panics aren't cached but visualizations can still be run on them. Had to add another method to Observable to make it happen.

Fix a number of RuntimeVisualizationTest cases

d364e2c

Have to keep track of arguments applied to the visualization and map them to synthetic assignnment constructs in visualization code to get the right identifiers.

Ensure IdMap updates can trigger invalidation

1c59be0

If a new external identifier is added within an expression that is already cached, the latter should be invalidated. Fixes a number of broken visualization tests.

No bug, just a broken test

0bcbeeb

Fix some regressions

9b52be1

Workarounds for Stackoverflow issues + IdMap updates causing full program invalidations. This change breaks 1 test in RuntimeVisualizationTests that will be addressed separately.

Renaming

3aead5e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

WIP: Replacing Static Dataflow Analysis with Reactive Caches #13948

WIP: Replacing Static Dataflow Analysis with Reactive Caches #13948

Uh oh!

hubertp commented Sep 5, 2025

Uh oh!

JaroslavTulach left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

WIP: Replacing Static Dataflow Analysis with Reactive Caches #13948

Are you sure you want to change the base?

WIP: Replacing Static Dataflow Analysis with Reactive Caches #13948

Uh oh!

Conversation

hubertp commented Sep 5, 2025

Pull Request Description

Important Notes

Checklist

Uh oh!

JaroslavTulach left a comment

Choose a reason for hiding this comment

Inception Comments

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants