Lazy vtables #4373

belliottsmith · 2025-09-09T09:53:51Z

Thanks for sending a pull request! Here are some tips if you're new here:

Ensure you have added or run the appropriate tests for your PR.
Be sure to keep the PR description updated to reflect all changes.
Write your PR title to summarize what this PR proposes.
If possible, provide a concise example to reproduce the issue for a faster review.
Read our contributor guidelines
If you're making a documentation change, see our guide to documentation contribution

Commit messages should follow the following format:

<One sentence description, usually Jira title or CHANGES.txt summary>

<Optional lengthier description (context on patch)>

patch by <Authors>; reviewed by <Reviewers> for CASSANDRA-#####

Co-authored-by: Name1 <email1>
Co-authored-by: Name2 <email2>

The Cassandra Jira

iamaleksey

+1, with a few nits addressed in a feedback commit.

clohfink · 2025-09-11T13:25:01Z

Maybe a test or two? Looks like you're testing the accord table but not verifying the lazy vtable interface at all.

clohfink · 2025-09-11T13:38:43Z

src/java/org/apache/cassandra/db/virtual/AbstractLazyVirtualTable.java

+                O bound = translate.apply((I)column.type.compose(expression.getIndexValue()));
+                switch (expression.operator())
+                {
+                    default: throw new InvalidRequestException("Operator " + expression.operator() + " not supported for txn_id");


this should be generic not explicitly erroring for accord table

thanks, will address that

belliottsmith · 2025-09-11T13:52:56Z

Maybe a test or two? Looks like you're testing the accord table but not verifying the lazy vtable interface at all.

I do not see additional value in dedicated testing, when it is exercised by the accord virtual table tests. Those tables are arguably much more in need of expanded test coverage.

Either way, while it's not an unreasonable request, I also don't have time. So if you want this, I will have to leave the patch unmerged for the foreseeable future, and use it only internally.

clohfink · 2025-09-11T14:02:38Z

So if you want this, I will have to leave the patch unmerged for the foreseeable future, and use it only internally.

Ok I will be testing it out a little as was (trying) to migrate the PartitionKeyStatsTable to use it instead of manually

belliottsmith · 2025-09-11T14:21:50Z

Let me know if you need help figuring out porting. I hope it should be pretty intuitive, and the patch has diffs showing my migration of the Accord tables to it that hopefully provides some examples, but if there's something that's unclear let me know.

smiklosovic · 2025-09-11T14:49:03Z

src/java/org/apache/cassandra/db/virtual/AbstractLazyVirtualTable.java

+            public <V> ColumnsCollector add(String name, V input, Function<V, Object> f)
+            {
+                ColumnMetadata cm = columnLookup.get(name);
+                if (!columnFilter.fetches(cm))


hi @belliottsmith , I am looking at this patch and you have code like this in vtables:

.lazyAdd(columns -> { columns.add("ballot", toStringOrNull(txn.ballot()))

I just want to understand it better (havent looked into this in IDE, just here), so ballot is name (argument of add method here, and input is toStringOrNull(txn.ballot()).

While I understand that when ballot is not in column filter, it will return this, that still means that V input is "materialized", basically toStringOrNull(txn.ballot()) is called regardless of whether name is filtered or not.

Is this correct?

When it is, then could not be the materialisation itself postponed? I am asking because I can imagine that if there is some non-trivial transformation of value to that column, it might take quite a while / take some resources while we do not ultimately need it.

V input might be Callable<V> input and then

Object value = f.apply(input.call()) if (value == null) return this;

and called like columns.add("ballot", () -> toStringOrNull(txn.ballot()))

I do not know what is worse, to have callable object instantiated every time to postpone it or to call that method even it is useless.

Maybe this can be parameterized and this decision would be left for a developer to decide.

Hi @smiklosovic. So, there's two ways to be lazy here, and we already support what you suggest. There's two add methods in ColumnsCollector, one that accepts a simple value, and another that accepts an input value and a Function that transforms it to the output value. So the equivalent here would be calling columns.add("ballot", txn.ballot(), AccordDebugKeyspace::toStringOrNull). I simply didn't think it was worth the extra ceremony at this call-site.

The second laziness applies at the clustering key level. If the clustering can be excluded (because for instance we have enough results and the new value comes after the last result we have collected) then we don't even reach this point - the whole row is discarded without considering any columns.

We could also fairly easily expand this early filtering to apply RowFilter expressions to clusterings before we materialise columns.

So the equivalent here would be calling columns.add("ballot", txn.ballot(), AccordDebugKeyspace::toStringOrNull)

Aha right, I see ... basically the "callable" in my comment is conceptually the same as the transformation function itself. Yeah ...

While it might be theoretically expensive to even call txn.ballot() (replace with any other arbitrary object / method which needs to be passed / called), I can see that the current code is just fine for the common usage. These columns just hold simple objects / strings / numbers etc. so passing them is basically for free.

I leave your suggestion in the last sentence to your judgement.

You can handle your case easily by providing txn as the input, and mapping it with t -> toStringOrNull(t.ballot()). The important thing is that this structure does not allocate a capturing lambda.

I mostly noted the additional filtering as something we could do in future fairly easily. I’ve implemented all of the functionality I expect to need for the moment, though perhaps I will discover something missing when I roll it out.

To support recovering a node that has lost some of its local transaction log, introduce rebootstrap and unsafe bootstrap modes, where Accord ensures no responses are produced for transactions the node cannot be certain it had not previously answered. patch by Benedict and Alex Petrov for CASSANDRA-20908

Also Improve: - Searchable system_accord_debug.txn patch by Benedict; reviewed by Aleksey Yeschenko for CASSANDRA-20899

- Integrate txn_blocked_by deadline and depth filter with execution logic, to ensure we terminate promptly and get a best effort reply

ifesdjeen · 2025-09-18T15:02:28Z

I did not yet have a chance to fully review the patch, but since this patch mostly ensures that Accord tables are lazy, I think the amount of testing here is appropriate. I would personally not add extensive tests to, say, test whether or not we are seeing all coordinations, etc, since this will likely just lead to tightly coupling observability with internals (which we unfortunately have in too many places already.

I thought we were talking about remote vtables with @clohfink in MN, so I briefly looked at the other patch, too. And from what I could see, the tables that are remote, we do have test coverage.

I think testing for laziness itself is probably also a bit excessive, but it also does not seem that it was requested here, so just mentioning this for completeness. We never verify laziness property in iterators, either. Nor do we verify eagerness of the existing vtables. I think this is just an implementation detail. Main thing they work as expected.

tl;dr it does seem that we have enough tests to verify that tables that are implemented as lazy, which are AccordDebugKeyspace tables. Moreover, these tables are hidden behind a feature flag that disables them by default (even two feature flags, one is accord, and second one accord debug tables themselves). Maybe I am missing something, so if you could elaborate what kind of test is expected, I could try to maybe address this.

That said, if we want to have exhaustive coverage for both lazy and distributed tables machinery, I think we should also do that separately: we could generate different random virtual tables, populate them with different test data, and make sure same thing comes out from both ways. Which, I think, should probably have been done when virtual tables first introduced. But I would definitely not enforce it on either of these patches. Could be a fun, separate, introductory ticket to Cassandra.

belliottsmith · 2025-09-18T15:10:37Z

@clohfink If you consider the integration tests inadequate for a general purpose facility, I will make this class Accord-only for now so that it is not a concern. If you or @smiklosovic want to use this facility for other use cases, you can decide between yourselves what API tests you consider appropriate.

iamaleksey self-requested a review September 11, 2025 10:11

iamaleksey approved these changes Sep 11, 2025

View reviewed changes

belliottsmith force-pushed the lazy-vtables branch 4 times, most recently from 8c2c424 to 8357790 Compare September 11, 2025 12:53

clohfink reviewed Sep 11, 2025

View reviewed changes

belliottsmith force-pushed the lazy-vtables branch from 8357790 to b01b101 Compare September 11, 2025 13:54

belliottsmith force-pushed the lazy-vtables branch from b01b101 to 9802141 Compare September 11, 2025 14:27

smiklosovic reviewed Sep 11, 2025

View reviewed changes

iamaleksey self-requested a review September 12, 2025 09:39

iamaleksey approved these changes Sep 12, 2025

View reviewed changes

belliottsmith force-pushed the lazy-vtables branch 4 times, most recently from c585205 to d3875bc Compare September 18, 2025 11:03

belliottsmith force-pushed the trunk branch from 4ac0487 to 0b20128 Compare September 18, 2025 11:31

ifesdjeen and others added 3 commits September 18, 2025 15:49

Lazy Virtual Tables

add004b

Also Improve: - Searchable system_accord_debug.txn patch by Benedict; reviewed by Aleksey Yeschenko for CASSANDRA-20899

- Support mutable lazy vtables and port remaining accord tables

1ccec48

- Integrate txn_blocked_by deadline and depth filter with execution logic, to ensure we terminate promptly and get a best effort reply

belliottsmith force-pushed the lazy-vtables branch from d3875bc to 1ccec48 Compare September 18, 2025 14:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Lazy vtables #4373

Lazy vtables #4373

Uh oh!

belliottsmith commented Sep 9, 2025

Uh oh!

iamaleksey left a comment

Uh oh!

clohfink commented Sep 11, 2025

Uh oh!

clohfink Sep 11, 2025

Uh oh!

belliottsmith Sep 11, 2025

Uh oh!

belliottsmith commented Sep 11, 2025

Uh oh!

clohfink commented Sep 11, 2025

Uh oh!

belliottsmith commented Sep 11, 2025

Uh oh!

smiklosovic Sep 11, 2025 •

edited

Loading

Uh oh!

belliottsmith Sep 11, 2025

Uh oh!

smiklosovic Sep 11, 2025 •

edited

Loading

Uh oh!

belliottsmith Sep 11, 2025

Uh oh!

ifesdjeen commented Sep 18, 2025 •

edited

Loading

Uh oh!

belliottsmith commented Sep 18, 2025

Uh oh!

Uh oh!

Lazy vtables #4373

Are you sure you want to change the base?

Lazy vtables #4373

Uh oh!

Conversation

belliottsmith commented Sep 9, 2025

Uh oh!

iamaleksey left a comment

Choose a reason for hiding this comment

Uh oh!

clohfink commented Sep 11, 2025

Uh oh!

clohfink Sep 11, 2025

Choose a reason for hiding this comment

Uh oh!

belliottsmith Sep 11, 2025

Choose a reason for hiding this comment

Uh oh!

belliottsmith commented Sep 11, 2025

Uh oh!

clohfink commented Sep 11, 2025

Uh oh!

belliottsmith commented Sep 11, 2025

Uh oh!

smiklosovic Sep 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

belliottsmith Sep 11, 2025

Choose a reason for hiding this comment

Uh oh!

smiklosovic Sep 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

belliottsmith Sep 11, 2025

Choose a reason for hiding this comment

Uh oh!

ifesdjeen commented Sep 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

belliottsmith commented Sep 18, 2025

Uh oh!

Uh oh!

smiklosovic Sep 11, 2025 •

edited

Loading

smiklosovic Sep 11, 2025 •

edited

Loading

ifesdjeen commented Sep 18, 2025 •

edited

Loading