Add a regression test for Catalog Federation #2286

poojanilangekar · 2025-08-06T22:31:21Z

This change adds a PR to test the end-to-end catalog federation functionality.

Steps involved:

Create a new-user account.
Create an INTERNAL catalog, called test-catalog-local.
Grant the following privileges on test-catalog-local :TABLE_WRITE_DATA -> catalog_admin -> service_admin -> new-user.
Create an EXTERNAL catalog called test-catalog-external that points to test-catalog-local and uses the client-id and client-secret credentials for the new_user account.
Grant the following privileges on test-catalog-external :TABLE_WRITE_DATA -> catalog_admin -> service_admin.
Connect to test-catalog-local and use it to create namespaces/tables and insert some data.
Connect to test-catalog-external and select the existing data and insert new rows.
Re-connect to test-catalog-local and verify that the changes made in test-catalog-external are visible.

This change disables URL overlap check since we are federating to a single Polaris instance.

poojanilangekar · 2025-08-07T00:01:56Z

CC @dennishuo @eric-maynard

regtests/README.md

eric-maynard · 2025-08-08T17:12:00Z

regtests/t_catalog_federation/src/catalog_federation.sh

+EOF
+
+echo ""
+echo "=== Verifying federation via LOCAL catalog ==="


Can we do some verification of RBAC as well?

I will update this test once #2223 is merged. For now, we can only do catalog-level RBACs for federated catalogs.

Co-authored-by: Eric Maynard <[email protected]>

dimas-b

The same test seems possible to do under the JUnit5 framework (as an intTest).

That approach is preferable from my POV because it is more easily discoverable in a java IDE and would show regressions at part of the usual gradle validation tasks.

WDYT?

poojanilangekar · 2025-08-11T17:52:22Z

This is a regression test (which is still run by GitHub CI). Can I send out a separate PR for an integration test if that works for you? At that point we can remove this test if necessary. Currently we have no tests for catalog federation, so it might be a good idea to keep this around while I work on the IT test as you suggested.

For some context, I won't be able to dedicate as much time to the project going forward so I expect that to take some time, in the meantime it is a good idea to have some test that check that federation still works and flag PRs that potentially break federation.

Please let me know what you think.

dimas-b · 2025-08-11T18:48:43Z

From my POV running docker-based tests is more cumbersome than running JUnit5-based tests. So, if the same functionality can be tested under the JUnit framework, I'd prefer that approach.

I do not oppose merging this PR, but I believe it too heavy for the job it performs.

eric-maynard · 2025-08-11T21:22:59Z

@dimas-b I agree that should add more integration tests on top of this , but we'll need docker-based regression tests for federation in general (imagine testing federation to Hive, you need to actually run HMS somewhere). Accordingly I think it's okay or even prudent to have a regression test for IRC federation for the sake of parity.

snazy · 2025-08-12T11:46:38Z

Having tests for more complex setups is totally legit. I just don't think that containers are are a requirement for that.

This is where for example apache/polaris-tools#18 comes into play - it runs a Polaris server (actually "anything Quarkus") from the "local" code base or a specific distributed/released version, which you can then use in an integration test to test the federation use case.

From a machine resource usage it's even more lightweight as only the processes are run, but no container infra.

And you'd be able to run it from your IDE and, with some option tweaking, can debug the processes.

dennishuo

LGTM

I agree we'll want both flavors longer-term (java tests and black-box regtests) because long-term they'll be able to test different scopes of functionality. In general having this black-box end-to-end is still needed for building confidence that it really works from the perspective of an external caller without JVM-local "tricks" that might creep into a JUnit test, especially when the nature of the feature (such as Federation) innately deals with how things interact beyond the single JVM boundary.

Re: heavyweight containers, it doesn't look like this PR actually requires anything related to Docker -- the sh test just extends the pre-existing patterns we already have for other sh tests and AFAICT is equally happy to run against a direct ./gradlew run or IDE-run or java -jar local deployment vs having Polaris run in the Docker container. So the Docker parts are just making sure the pre-existing Docker-based runs such as for precommit/CI are also able to run the new test successfully.

snazy · 2025-08-13T09:23:26Z

I'm not against this PR in principle, however it was mentioned that "we'll need docker-based regression tests". I do think that the state of the regression tests as of today. do nothing that could not be done in a Java based integration test.

OTOH, playing devil's advocate here, do we have good enough JUnit (integration) test coverage for federation?
Another devil: are the regression tests actually really testing a "full black box end to end test" including all the things that end users do (real database, external IdP, secured Polaris setup, external x/y/z)?

dennishuo · 2025-08-14T18:36:13Z

Yeah, I agree the "Docker" discussion is worth expanding in another thread, even though it's orthogonal to this PR. I'm definitely also in favor of supporting the non-Docker scenarios like running the server in an IDE or anywhere else and still being able to run tests against those, and we can make sure the structure of regtests continues supporting that pattern.

Also +1 to expanding the black-box testing scope to really exercise the things a real user will do; having this PR as a starting point will hopefully reduce the barrier of entry for folks interested in exercising those complex scenarios.

Since it sounds like no one is opposed to merging this PR, I'll go ahead and merge it to unblock expanding test coverage.

Add a regression test for Catalog Federation

166d750

github-project-automation bot added this to Basic Kanban Board Aug 6, 2025

github-project-automation bot moved this to PRs In Progress in Basic Kanban Board Aug 6, 2025

poojanilangekar added 2 commits August 6, 2025 16:20

Install jq dependency

a8fd1b5

Fix token issues

3e54902

poojanilangekar mentioned this pull request Aug 7, 2025

Modularize calls to federated catalogs #2301

Closed

eric-maynard reviewed Aug 8, 2025

View reviewed changes

poojanilangekar and others added 2 commits August 8, 2025 11:25

Update regtests/README.md

3c3dcfb

Co-authored-by: Eric Maynard <[email protected]>

Update README.md

f811c45

eric-maynard approved these changes Aug 11, 2025

View reviewed changes

github-project-automation bot moved this from PRs In Progress to Ready to merge in Basic Kanban Board Aug 11, 2025

dimas-b reviewed Aug 11, 2025

View reviewed changes

dennishuo approved these changes Aug 12, 2025

View reviewed changes

poojanilangekar mentioned this pull request Aug 12, 2025

Modularize federation (Option 2) #2332

Merged

dennishuo merged commit 22e4c68 into apache:main Aug 14, 2025
12 checks passed

github-project-automation bot moved this from Ready to merge to Done in Basic Kanban Board Aug 14, 2025

eric-maynard mentioned this pull request Aug 14, 2025

Support HMS Federation #2355

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add a regression test for Catalog Federation #2286

Add a regression test for Catalog Federation #2286

Uh oh!

poojanilangekar commented Aug 6, 2025

Uh oh!

poojanilangekar commented Aug 7, 2025

Uh oh!

Uh oh!

eric-maynard Aug 8, 2025

Uh oh!

poojanilangekar Aug 8, 2025

Uh oh!

dimas-b left a comment •

edited

Loading

Uh oh!

poojanilangekar commented Aug 11, 2025

Uh oh!

dimas-b commented Aug 11, 2025

Uh oh!

eric-maynard commented Aug 11, 2025

Uh oh!

snazy commented Aug 12, 2025

Uh oh!

dennishuo left a comment

Uh oh!

snazy commented Aug 13, 2025

Uh oh!

dennishuo commented Aug 14, 2025

Uh oh!

Uh oh!

Uh oh!

Add a regression test for Catalog Federation #2286

Add a regression test for Catalog Federation #2286

Uh oh!

Conversation

poojanilangekar commented Aug 6, 2025

Uh oh!

poojanilangekar commented Aug 7, 2025

Uh oh!

Uh oh!

eric-maynard Aug 8, 2025

Choose a reason for hiding this comment

Uh oh!

poojanilangekar Aug 8, 2025

Choose a reason for hiding this comment

Uh oh!

dimas-b left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

poojanilangekar commented Aug 11, 2025

Uh oh!

dimas-b commented Aug 11, 2025

Uh oh!

eric-maynard commented Aug 11, 2025

Uh oh!

snazy commented Aug 12, 2025

Uh oh!

dennishuo left a comment

Choose a reason for hiding this comment

Uh oh!

snazy commented Aug 13, 2025

Uh oh!

dennishuo commented Aug 14, 2025

Uh oh!

Uh oh!

Uh oh!

dimas-b left a comment •

edited

Loading