Improve `xarray` display of structural components for multivariate time series #555

AlexAndorra · 2025-07-31T22:15:30Z

Closes #545

Currently extract_components_from_idata returns square brackets notations for the state coord, making it hard to select components when working with multivariate series. Things like:

state = [
            "trend[level[gdp]]",
            "trend[trend[gdp]]",
            "trend[level[unemployment]]",
            "trend[trend[unemployment]]",
            "ar[gdp]",
            "ar[unemployment]",
]

This PR adds a restructure argument to extract_components_from_idata (default False for backwards compatibility). When True, it will restructure the state coordinates as a multi-index for easier component selection, thus enabling selections like idata.sel(component='level'), idata.sel(observed='gdp'), or even idata.sel(component='level', observed='gdp').
Again, this is especially useful for multivariate models with multiple observed states.

More precisely, the state dimension is broken down into two new ones, component and observed, whose coordinates will be [('level', 'gdp'), ('trend', 'gdp'), ('ar', 'gdp')], [('level', 'unemployment'), ('trend', 'unemployment'), ('ar', 'unemployment')] .

This also allows each observed state to have arbitrary model structure inside, which the current multivariate setup allows.

NB: This PR is a first pass, and in no way exhaustive -- we probably need to expand to more complex cases, that users will surface up. But at least it gets the ball rolling and should be self-sufficient to already merge.

jessegrabowski · 2025-08-01T06:06:53Z

I need to think a bit about this. As a v0 I guess it's fine, but I have the feeling that the whole extract_components should be refactored to just do the right thing from jump. Probably it could make better use of arviz in the first place. For example here we're casting everything to numpy then working with that. Seems dumb?

Some general comments:

I'm not wild about regex when dealing with nested structures, it has a lot of sharp edges, and the patterns are quite arcane.
I don't think we need to be backwards compatible. We're doing API breaks with every PR these days.

AlexAndorra · 2025-08-02T01:19:30Z

Yep, I agree with that. We do need that patch for the Berlin tutorial, but that can be just that -- a patch.
I'm all for making this better from the get-go, but have to say I won't have the bandwidth to work on such a high-stake PR. @OriolAbril will probably have some great points on whether we could and should rely more heavily on ArviZ

Improve idata display of structural components

4152d4a

AlexAndorra requested review from OriolAbril and jessegrabowski July 31, 2025 22:15

AlexAndorra self-assigned this Jul 31, 2025

AlexAndorra added enhancements New feature or request feature request statespace labels Jul 31, 2025

Merge branch 'main' into better-extract-component

6d6c74e

AlexAndorra added 3 commits August 2, 2025 12:46

Fix typo in docstring

a76593e

Merge branch 'main' into better-extract-component

5b171bc

Demo adding shared_state attribute

5500f53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Improve `xarray` display of structural components for multivariate time series #555

Improve `xarray` display of structural components for multivariate time series #555

AlexAndorra commented Jul 31, 2025

Uh oh!

jessegrabowski commented Aug 1, 2025

Uh oh!

AlexAndorra commented Aug 2, 2025

Uh oh!

Uh oh!

Improve xarray display of structural components for multivariate time series #555

Are you sure you want to change the base?

Improve xarray display of structural components for multivariate time series #555

Conversation

AlexAndorra commented Jul 31, 2025

Uh oh!

jessegrabowski commented Aug 1, 2025

Uh oh!

AlexAndorra commented Aug 2, 2025

Uh oh!

Uh oh!

Improve `xarray` display of structural components for multivariate time series #555

Improve `xarray` display of structural components for multivariate time series #555