PERF: delay preset sync until tab completion / attribute access, track file desync by tangkong · Pull Request #1317 · pcdshub/pcdsdevices

tangkong · 2025-01-11T00:33:35Z

Description

Delays preset file reading until tab-completion has been attempted. Stashes this information on the device instance itself. This is not the default behavior, and you must opt into this by supplying the optional argument defer_loading to setup_presets_paths

Also tracks the modification time of the preset file on sync, making the Preset instance the source of truth to see if the presets need to be synced.

Motivation and Context

Presets have taken a very long time to load recently, and the first place to look is often file i/o. This delays this as long as possible while attempting to preserve the behavior hutch scientists use: tab completion.

This has also been made optional. Both setup_presets_paths and Presets.sync() now take an optional defer_loading argument, that if true will skip the preset data loading step

In a nutshell:

the device's Preset instance tracks whether the preset is out of sync with the files. This is done by tracking file modification times
The device checks if a sync is necessary, and if so syncs with the files when:
- __dir__ is called (during tab-completion)
- __getattribute__ is called with a preset-related prefix (wm_, umv_, mv_)

Presets.sync() is also called on device init, since it's part of Presets.__init__. This happens both on device-creation and setup_presets_path, though on device-creation we don't do any file loading. This is a ~0.013s double-hit, split between the device load and presets load sections for FltMvInterface devices

How Has This Been Tested?

Tests pass

Tested interactively through hutch-python, loading some subset of 559 xcs presets

this PR: 7.53s (0.01345s / preset)
current master: 16.21s (0.0289s / preset)

There's some variation but I'm not going to do a battery of tests here

Why isn't this just 0 s?

There's still some path access going on in hutch-python, it's just stopping short of actually opening the presets file.

Where Has This Been Documented?

This PR, a stray comment

Pre-merge checklist

Code works interactively
Code contains descriptive docstrings, including context and API
New/changed functions and methods are covered in the test suite where possible
Test suite passes locally
Test suite passes on GitHub Actions
Ran docs/pre-release-notes.sh and created a pre-release documentation page
Pre-release docs include context, functional descriptions, and contributors as appropriate

tangkong · 2025-01-13T19:16:53Z

I did some more digging (manual profiling because I'm dumb) as to what causes the remaining add-methods to take so long, and I think it boils down to stat() calls having awful performance. I'll do some more digging but I do think this is a contribution to be considered separately

ZLLentz

I like this and have a lot of opinions about presets in general.

stat() calls having awful performance

I agree this is a problem on NFS/WEKA.
Probably the main way for us to mitigate this would be reworking presets to use fewer files, but that has its own annoyances.

pcdsdevices/interface.py

ZLLentz · 2025-01-13T19:26:15Z

pcdsdevices/interface.py

    def __dir__(self):
+        if not self._tab_initialized and hasattr(self, 'presets'):
+            self._tab_initialized = True
+            self.presets.sync()


Change/addition request: we need to identify other times where presets may need a late sync. For example, if someone is expecting presets to be available in a hutch-python function call they need a clear way to trigger this without tabbing (or we need to load them automatically).

The device.presets.sync() call is still available, and seems to be how we suggest people sync presets across sections. I imagine we haven't advertised this as thoroughly as necessary.

I guess we could overwrite __getattribute__ and catch attempts to access mv_*, umv_*, and wm_* methods, but are there any other access modes we can think of?

Is it possible that doing this on __getattribute__ is actually most correct to make sure the loaded values are always up-to-date?

My one qualm with this is that it does break the already fragile composition abstraction barrier we've set up with the Presets module. Maybe there's also a way around this.

Back to the mines!

pcdsdevices/interface.py

…ning if sync is needed, use Preset.sync_needed to determine if Presets should be updated

…tialization

tangkong · 2025-01-14T00:34:16Z

Updated with some rather significant changes, now we store the last file modification times with every sync() call, and use that to decide whether or not to update. Also added some tests to codify this

ZLLentz · 2025-01-14T00:41:36Z

I think at least one of your commits has not been pushed

tangkong · 2025-01-14T00:42:19Z

I blame github

tangkong · 2025-01-14T00:48:44Z

Fingers crossed that we don't see race conditions like in #1050 , and the resulting #1055

ZLLentz

I like how this is implemented and I like how it speeds up the startup preset loading by 2x

ZLLentz · 2025-01-14T01:03:53Z

pcdsdevices/tests/test_interface.py

+    # deferred_fast_motor_preset must come last,
+    # to clear cache after motor is created (and sync-ed at init)
+    assert fast_motor.presets.sync_needed
+    fast_motor.__dir__()  # mimic tab completion request


Nitpick: is dir(fast_motor) equivalent?

I believe so, do we prefer that to the dunder-access? I used __dir__ just to match the code implementation.

janeliu-slac · 2025-01-14T01:36:48Z

First time seeing updates to pcdsdevices, so probably don't have good feedback this time. Are presets for device settings? Why does preset file reading need to be delayed until tab completion has been attempted?

tangkong · 2025-01-14T16:12:52Z

Presets are basically position shortcuts that users can save/load. They're dynamically added to the devices attributes, and are backed by simple yaml files.
https://pcdshub.github.io/pcdsdevices/v8.7.0/presets.html

Because each device is backed by a file, we end up opening and reading a lot of files in some hutch-python sessions. So instead of loading them all at once, we defer loading until the user actually needs it

ZLLentz · 2025-01-14T18:21:20Z

pcdsdevices/interface.py

        return state

+    @property
+    def sync_needed(self) -> bool:


One last tiny tiny nitpick: this is a property that touches the filesystem, usually I prefer things that may take time to be function calls. Usually we expect attribute access to be fast, so if they are not fast we can accidentally write code with performance issues.

A good point. I'll do that

…ns, not properties

ZLLentz

I'm excited for the speedup

tangkong added 2 commits January 10, 2025 16:05

PERF: delay preset sync until tab completion is accessed

5acb0aa

MNT: make deferred loading of presets optional

61fe019

tangkong requested review from ZLLentz and janeliu-slac January 13, 2025 17:31

ZLLentz reviewed Jan 13, 2025

View reviewed changes

tangkong added 5 commits January 13, 2025 12:00

MNT: more explicit setup_preset_paths signature

3acf464

DDOC: fix up docstring for setup_preset_paths

6ef9ad1

MNT: move sync logic to FltMvInterface, also sync on appropriate getattr

fa3a633

ENH/REF: store file modification times on Preset instance for determi…

0c5173b

…ning if sync is needed, use Preset.sync_needed to determine if Presets should be updated

TST: add tests for Preset desync, tab-initialization, and getattr-ini…

c52e9ff

…tialization

tangkong requested a review from ZLLentz January 14, 2025 00:32

ZLLentz previously approved these changes Jan 14, 2025

View reviewed changes

tangkong changed the title ~~PERF: delay preset sync until tab completion is accessed~~ PERF: delay preset sync until tab completion / attribute access, track file desync Jan 14, 2025

DOC: pre-release notes

41b5439

tangkong mentioned this pull request Jan 14, 2025

PERF: delay preset loading for hutch-python pcdshub/hutch-python#394

Merged

7 tasks

tangkong dismissed ZLLentz’s stale review via 41b5439 January 14, 2025 16:13

tangkong requested a review from ZLLentz January 14, 2025 16:13

ZLLentz previously approved these changes Jan 14, 2025

View reviewed changes

MNT: methods that perform longer actions (file access) should functio…

f28490d

…ns, not properties

tangkong dismissed ZLLentz’s stale review via f28490d January 14, 2025 19:22

tangkong requested a review from ZLLentz January 14, 2025 19:22

ZLLentz approved these changes Jan 14, 2025

View reviewed changes

tangkong merged commit bec57ae into pcdshub:master Jan 14, 2025
11 checks passed

Conversation

tangkong commented Jan 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Motivation and Context

How Has This Been Tested?

Why isn't this just 0 s?

Where Has This Been Documented?

Pre-merge checklist

Uh oh!

tangkong commented Jan 13, 2025

Uh oh!

ZLLentz left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

tangkong commented Jan 14, 2025

Uh oh!

ZLLentz commented Jan 14, 2025

Uh oh!

tangkong commented Jan 14, 2025

Uh oh!

tangkong commented Jan 14, 2025

Uh oh!

ZLLentz left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

tangkong Jan 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

janeliu-slac commented Jan 14, 2025

Uh oh!

tangkong commented Jan 14, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ZLLentz left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

tangkong commented Jan 11, 2025 •

edited

Loading

tangkong Jan 14, 2025 •

edited

Loading