Skip to content

Fix UTF-16 supplementary character encoding#659

Merged
swebb2066 merged 1 commit into
apache:masterfrom
metsw24-max:utf16-supplementary-encoding
May 12, 2026
Merged

Fix UTF-16 supplementary character encoding#659
swebb2066 merged 1 commit into
apache:masterfrom
metsw24-max:utf16-supplementary-encoding

Conversation

@metsw24-max
Copy link
Copy Markdown
Contributor

This fixes incorrect UTF-16 encoding for supplementary-plane Unicode characters in the UTF-16BE and UTF-16LE transcoder paths.

Changes:

  • Fixed low surrogate byte construction in transcoder.cpp
  • Added regression tests for:
    • UTF-16BE BMP encoding
    • UTF-16BE supplementary-plane encoding
    • UTF-16LE supplementary-plane encoding

Testing

Added regression coverage in:

 src/test/cpp/helpers/transcodertestcase.cpp

@swebb2066 swebb2066 merged commit 46037d0 into apache:master May 12, 2026
17 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants