Derive logprob for Split operation #7875

ricardoV94 · 2025-07-27T09:11:00Z

📚 Documentation preview 📚: https://pymc--7875.org.readthedocs.build/en/7875/

ricardoV94 · 2025-07-27T09:22:09Z

Classical mypy, feel free to review ignoring that, I'll fix it

codecov · 2025-07-27T09:36:16Z

Codecov Report

❌ Patch coverage is 93.54839% with 2 lines in your changes missing coverage. Please review.
✅ Project coverage is 92.94%. Comparing base (dc7cfee) to head (562c487).
⚠️ Report is 1 commits behind head on main.

Files with missing lines	Patch %	Lines
pymc/logprob/tensor.py	93.54%	2 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #7875      +/-   ##
==========================================
+ Coverage   88.25%   92.94%   +4.69%     
==========================================
  Files         116      116              
  Lines       18845    18875      +30     
==========================================
+ Hits        16631    17544     +913     
+ Misses       2214     1331     -883

Files with missing lines	Coverage Δ
pymc/logprob/tensor.py	`94.26% <93.54%> (-0.23%)`	⬇️

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

jessegrabowski · 2025-07-28T03:16:38Z

pymc/logprob/tensor.py

+            # If the axis is over a dimension that was reduced in the logp (multivariate logp),
+            # We cannot split it into distinct entries. The mapping between values-densities breaks.
+            # We return the weighted logp by the split sizes. This is a good solution as any?
+            split_weights = splits / pt.sum(splits)


Is this legit?

I think so? In MarginalMixture we decided to set the whole logp on the first entry, and zero for others, I like this approach more

jessegrabowski · 2025-07-28T03:34:42Z

tests/logprob/test_tensor.py

+        # axis=-2 (i.e., 0, - batch dimension)
+        x_parts = pt.split(x, splits_size=[2, 1], n_splits=2, axis=-2)
+        x_parts_vv = [x_part.clone() for x_part in x_parts]
+        logp_parts = list(conditional_logp(dict(zip(x_parts, x_parts_vv))).values())


Do i understand this correctly that each part is conditioned on the values of all other parts?

Thinking about e.g. the MVN case, where if you split the vector and condition each split on the other, you get two new MVN distributions

There's no marginalization going on, you can't evaluate the logp of only one part without providing the remaining ones. The only thing we do is join the value, get the logp, and split it again. We could argue that we don't want to do this for multivariate variables split along the core dimension, since there's no way you can split the logp (I did the weighing, but we can revert and raise NotImplemented)

ricardoV94 added enhancements logprob labels Jul 27, 2025

ricardoV94 requested a review from jessegrabowski July 27, 2025 09:21

ricardoV94 force-pushed the split_logp branch from ee0969e to 9e51798 Compare July 27, 2025 12:55

Derive logprob for Split operation

562c487

ricardoV94 force-pushed the split_logp branch from 9e51798 to 562c487 Compare July 28, 2025 07:13

jessegrabowski reviewed Jul 28, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Derive logprob for Split operation #7875

Derive logprob for Split operation #7875

ricardoV94 commented Jul 27, 2025 •

edited by github-actions bot

Loading

Uh oh!

ricardoV94 commented Jul 27, 2025

Uh oh!

codecov bot commented Jul 27, 2025 •

edited

Loading

Uh oh!

jessegrabowski Jul 28, 2025

Uh oh!

ricardoV94 Jul 28, 2025

Uh oh!

jessegrabowski Jul 28, 2025

Uh oh!

ricardoV94 Jul 28, 2025

Uh oh!

Uh oh!

Derive logprob for Split operation #7875

Are you sure you want to change the base?

Derive logprob for Split operation #7875

Conversation

ricardoV94 commented Jul 27, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ricardoV94 commented Jul 27, 2025

Uh oh!

codecov bot commented Jul 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

jessegrabowski Jul 28, 2025

Choose a reason for hiding this comment

Uh oh!

ricardoV94 Jul 28, 2025

Choose a reason for hiding this comment

Uh oh!

jessegrabowski Jul 28, 2025

Choose a reason for hiding this comment

Uh oh!

ricardoV94 Jul 28, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ricardoV94 commented Jul 27, 2025 •

edited by github-actions bot

Loading

codecov bot commented Jul 27, 2025 •

edited

Loading