Skip to content

sync with open source how#118

Draft
lesterhaynes wants to merge 8718 commits intolinkedin:li_trunkfrom
apache:master
Draft

sync with open source how#118
lesterhaynes wants to merge 8718 commits intolinkedin:li_trunkfrom
apache:master

Conversation

@lesterhaynes
Copy link
Copy Markdown

Please add a meaningful description for your change here


Thank you for your contribution! Follow this checklist to help us incorporate your contribution quickly and easily:

  • Mention the appropriate issue in your description (for example: addresses #123), if applicable. This will automatically add a link to the pull request in the issue. If you would like the issue to automatically close on merging the pull request, comment fixes #<ISSUE NUMBER> instead.
  • Update CHANGES.md with noteworthy changes.
  • If this contribution is large, please file an Apache Individual Contributor License Agreement.

See the Contributor Guide for more tips on how to make review process smoother.

To check the build health, please visit https://github.com/apache/beam/blob/master/.test-infra/BUILD_STATUS.md

GitHub Actions Tests Status (on master branch)

Build python source distribution and wheels
Python tests
Java tests
Go tests

See CI.md for more information about GitHub Actions CI.

Abacn and others added 10 commits March 18, 2026 11:17
…enchmark

Fix vLLM Gemma benchmark and PyTorch language modeling tests
Bumps [google.golang.org/grpc](https://github.com/grpc/grpc-go) from 1.78.0 to 1.79.3.
- [Release notes](https://github.com/grpc/grpc-go/releases)
- [Commits](grpc/grpc-go@v1.78.0...v1.79.3)

---
updated-dependencies:
- dependency-name: google.golang.org/grpc
  dependency-version: 1.79.3
  dependency-type: direct:production
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
…l.py (#37857)

Replace root logger calls (logging.info, logging.warning, etc.) with a
module-level named logger (_LOGGER = logging.getLogger(__name__)) in
apache_beam.transforms.util. This allows sdk_harness_log_level_overrides
to properly control log levels for this module.
* Adds bundleFinalizer support to non-portable worker.
* Removes check preventing stateful DoFn's with bundle finalizers from running on Dataflow streaming non-portable worker when using Streaming Engine
* Handle some cases during infer schema from dataclass

* For backward compatibility, only infer schema for frozen dataclasses
  when it's registered with row coder

* Make sure Beam schema ID does not inherit

* Fix IndexOutofBoundError trying to infer type from custom Iterable
  without type hint

* Fix #37862: fixed named tuple and effectively fails dataclass inside union typehint

* Allow non-frozen dataclass register with other coders as a backup for backward compatibility; add tests

* Add upgrade compatibility check for potential coder change

* Update CHANGES.md

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

* Test case for update_compatibility_version

---------

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
…#37758)

* addressed all EqualsGetClass warnings

* Fix GetterBasedSchemaProvider.equals

* supress equalsGetClass in ProtoCoder and DynamicProtoCoder.
jrmccluskey and others added 30 commits April 16, 2026 13:41
* Swap out pylint and flake8 with ruff

* time the mypy run

* consolidate mypy checks into linting

* Migrate pre-commit hook, pin versions in setup.py

* fix new breakages

* lint/format adjustments

* review comments, fix SKILL configs
Bumps [protobufjs](https://github.com/protobufjs/protobuf.js) to 7.5.5 and updates ancestor dependencies [protobufjs](https://github.com/protobufjs/protobuf.js) and [google-gax](https://github.com/googleapis/google-cloud-node-core/tree/HEAD/packages/gax). These dependencies need to be updated together.


Updates `protobufjs` from 7.5.4 to 7.5.5
- [Release notes](https://github.com/protobufjs/protobuf.js/releases)
- [Changelog](https://github.com/protobufjs/protobuf.js/blob/master/CHANGELOG.md)
- [Commits](protobufjs/protobuf.js@protobufjs-v7.5.4...protobufjs-v7.5.5)

Updates `protobufjs` from 6.11.4 to 8.0.1
- [Release notes](https://github.com/protobufjs/protobuf.js/releases)
- [Changelog](https://github.com/protobufjs/protobuf.js/blob/master/CHANGELOG.md)
- [Commits](protobufjs/protobuf.js@protobufjs-v7.5.4...protobufjs-v7.5.5)

Updates `google-gax` from 4.0.4 to 4.6.1
- [Release notes](https://github.com/googleapis/google-cloud-node-core/releases)
- [Changelog](https://github.com/googleapis/google-cloud-node-core/blob/main/packages/gax/CHANGELOG.md)
- [Commits](https://github.com/googleapis/google-cloud-node-core/commits/HEAD/packages/gax)

---
updated-dependencies:
- dependency-name: protobufjs
  dependency-version: 7.5.5
  dependency-type: indirect
- dependency-name: protobufjs
  dependency-version: 8.0.1
  dependency-type: direct:production
- dependency-name: google-gax
  dependency-version: 4.6.1
  dependency-type: indirect
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
* Enable UndefinedEquals check

* Suppress UndefinedEquals warning in WindowedValues.java
* product name is deprecated, changing to client name

* create a share method + append external client name

* add unittests and rename original test file to IT

* reuse the same properties

* add user agent logic to the generated documentation

* fix query log targeted column

* add integration test to verify user agent is correct

* make sure integration tests are included

* move IT out of test task
* apply name mapping at commit time

* trigger ITs
* Exclude 3.14 ML containers from push314

* feedback
)

Bumps [nltk](https://github.com/nltk/nltk) from 3.9.3 to 3.9.4.
- [Changelog](https://github.com/nltk/nltk/blob/develop/ChangeLog)
- [Commits](nltk/nltk@3.9.3...3.9.4)

---
updated-dependencies:
- dependency-name: nltk
  dependency-version: 3.9.4
  dependency-type: direct:production
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
* [python] Expand SDF in PortableRunner default optimization

Enable translations.expand_sdf in PortableRunner's default pre-optimization path so Python Read transforms are expanded for portable runners like Spark. Also add optimizer coverage for default SDF expansion, explicit pre_optimize=expand_sdf, and bounded Read expansion.\n\nRefs #24422.

* [python] Fix formatting for PortableRunner SDF optimization

* Make expand_sdf portable pre_optimize opt-in

* chore: retrigger CI
Bumps [lodash](https://github.com/lodash/lodash) from 4.17.23 to 4.18.1.
- [Release notes](https://github.com/lodash/lodash/releases)
- [Commits](lodash/lodash@4.17.23...4.18.1)

---
updated-dependencies:
- dependency-name: lodash
  dependency-version: 4.18.1
  dependency-type: indirect
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Bumps [flatted](https://github.com/WebReflection/flatted) from 3.4.1 to 3.4.2.
- [Commits](WebReflection/flatted@v3.4.1...v3.4.2)

---
updated-dependencies:
- dependency-name: flatted
  dependency-version: 3.4.2
  dependency-type: indirect
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
…38101)

Bumps [github.com/aws/aws-sdk-go-v2/aws/protocol/eventstream](https://github.com/aws/aws-sdk-go-v2) from 1.7.4 to 1.7.8.
- [Release notes](https://github.com/aws/aws-sdk-go-v2/releases)
- [Commits](aws/aws-sdk-go-v2@service/m2/v1.7.4...service/m2/v1.7.8)

---
updated-dependencies:
- dependency-name: github.com/aws/aws-sdk-go-v2/aws/protocol/eventstream
  dependency-version: 1.7.8
  dependency-type: indirect
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
…#37940)

Bumps [github.com/nats-io/nats-server/v2](https://github.com/nats-io/nats-server) from 2.12.4 to 2.12.6.
- [Release notes](https://github.com/nats-io/nats-server/releases)
- [Changelog](https://github.com/nats-io/nats-server/blob/main/RELEASES.md)
- [Commits](nats-io/nats-server@v2.12.4...v2.12.6)

---
updated-dependencies:
- dependency-name: github.com/nats-io/nats-server/v2
  dependency-version: 2.12.6
  dependency-type: direct:production
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
* Fix IcebergIO connection pool crash by moving FileIO lifecycle to @teardown

* trigger build

* move catalog init to @setup, make table cache per-DoFn

* fix spotless and import

* use static shared catalog

* trigger integration tests

* trigger integration test
* Allow Python user type pass through Beam SQL

* Complete pythonsdk_any logical type representation def. Otherwise
  Java side SchemaTranslation for this logical type would fail

* Handle PassthroughLogicalType in Beam SQL.
  Allow Beam SQL treat PassthroughLogicalType as its base type

* Fix nested bytes in Beam SQL

* Introduce a schema option for compact encoding for static non-null
  schema

* Preserve original logical type in Beam->Calcite->Beam trip
…8273)

The four portable-execution test methods in FlinkRequiresStableInputTest
were declared with `@Test(timeout = 30_000)`, which is consistently
exceeded under CI load and currently causes
testParDoRequiresStableInputPortable to fail
TestTimedOutException at FlinkRequiresStableInputTest.java:190 (the
1-second poll loop in executePipeline) on every recent
beam_PreCommit_Java run on master.

All four sibling portable-pipeline tests in the same package
(PortableExecutionTest, PortableTimersExecutionTest,
PortableStateExecutionTest, ReadSourcePortableTest) already use
`@Test(timeout = 120_000)`. This test is doing strictly more work
(deliberate failure -> checkpoint -> restore -> re-execute -> DONE) yet
had a 4x tighter budget, which matches the "Jenkins under high load"
diagnosis from #21333.

Aligning the timeout with the rest of the portable Flink test suite.
* Add connectivity type tracking to endpoint consumption logic

* Addressing review comments

* Removing redundant version check
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.