Fix signedness/type mismatches when marshaling stat-related structs #24772

kleisauke · 2025-07-23T13:06:24Z

No description provided.

sbc100 · 2025-07-23T15:22:48Z

lgtm, but I'm fine with #24769 landing first too.

sbc100 · 2025-07-23T15:31:33Z

src/lib/libsyscall.js

      {{{ makeSetValue('buf', C_STRUCTS.stat.st_ctim.tv_sec, 'Math.floor(ctime / 1000)', 'i64') }}};
      {{{ makeSetValue('buf', C_STRUCTS.stat.st_ctim.tv_nsec, '(ctime % 1000) * 1000 * 1000', SIZE_TYPE) }}};
-      {{{ makeSetValue('buf', C_STRUCTS.stat.st_ino, 'stat.ino', 'i64') }}};
+      {{{ makeSetValue('buf', C_STRUCTS.stat.st_ino, 'stat.ino', 'u64') }}};


IIRC the signed-ness of the type only really matter for makeGetValue.. is that right? i.e. its when reading the bits that its important to interpret them.

Supporting evidence for this is that the makeSetValue code doesn't even have code paths for handling u64 and u53 here:

emscripten/src/parseTools.mjs

Lines 453 to 466 in 90250aa

if (type == 'i64' && !WASM_BIGINT) {

// If we lack BigInt support we must fall back to an reading a pair of I32

// values.

// prettier-ignore

return '(tempI64 = [' + splitI64(value) + '], ' +

makeSetValueImpl(ptr, pos, 'tempI64[0]', 'i32') + ',' +

makeSetValueImpl(ptr, getFastValue(pos, '+', getNativeTypeSize('i32')), 'tempI64[1]', 'i32') + ')';

}

const offset = calcFastOffset(ptr, pos);

if (type === 'i53') {

return `writeI53ToI64(${offset}, ${value})`;

}

I think maybe just always use the signed array views works for setting values?

I'm not sure what happens if we write a negative number to an unsigned typed array, but I guess its undefined behaviour from our POV and we should assert in debug build.

According to the failing tests, there's seems to be indeed support missing for -sWASM_BIGINT=0 and makeSetValue(x, y, z, 'u64'). Commit 5511235 reverts that part.

I'm not sure if this signedness mismatch only affects makeGetValue(), since getHeapForType() is also used in makeSetValue() (e.g. HEAP32 versus HEAPU32).

I'm not sure if this signedness mismatch only affects makeGetValue(), since getHeapForType() is also used in makeSetValue() (e.g. HEAP32 versus HEAPU32).

I'm not sure either, but I would say that seems like signedness is critical when reading values (since the bytes need to be interpreted), but maybe not critical when writing them.

This is an automatic change generated by tools/maint/rebaseline_tests.py. The following (1) test expectation files were updated by running the tests with `--rebaseline`: ``` code_size/test_codesize_hello_dylink.json: 45583 => 45582 [-1 bytes / -0.00%] Average change: -0.00% (-0.00% - -0.00%) ```

This is an automatic change generated by tools/maint/rebaseline_tests.py. The following (3) test expectation files were updated by running the tests with `--rebaseline`: ``` code_size/test_codesize_cxx_wasmfs.json: 176935 => 176938 [+3 bytes / +0.00%] code_size/test_codesize_files_wasmfs.json: 55778 => 55781 [+3 bytes / +0.01%] code_size/test_codesize_hello_dylink_all.json: 844699 => 844697 [-2 bytes / -0.00%] Average change: +0.00% (-0.00% - +0.01%) ```

kleisauke · 2025-07-31T17:12:55Z

The codesize tests appear to be quite sensitive, especially other.test_codesize_hello_dylink_all. IIRC, we allowed some slop/slack factor in the past, but it seems that's no longer configurable.

sbc100 · 2025-07-31T17:16:15Z

Yes, I apologize for that. I'm not sure if we should remove that test again.. maybe? Or we should have it auto-update as part of the review process.

kleisauke · 2025-07-31T19:21:35Z

I'm not certain what the ideal solution is, but one possibility is to create a bot that automatically updates test expectations whenever a specific label (perhaps reuse code size or introduce "needs-rebaseline"?) is applied to a PR. Once the bot completes the update, it would remove the label.

Using labels helps prevent potential abuse, since applying them requires triage access or higher (this idea is also somewhat inspired by LLVM's convenient /cherry-pick command, which also requires proper permissions, such as the ability to edit the milestone).

kleisauke · 2025-07-31T19:23:12Z

... ah, I forgot that this only works when the "Allow edits by maintainers" option is enabled, though I believe it's enabled by default.

sbc100 · 2025-07-31T20:15:51Z

I'm not certain what the ideal solution is, but one possibility is to create a bot that automatically updates test expectations whenever a specific label (perhaps reuse code size or introduce "needs-rebaseline"?) is applied to a PR. Once the bot completes the update, it would remove the label.

Using labels helps prevent potential abuse, since applying them requires triage access or higher (this idea is also somewhat inspired by LLVM's convenient /cherry-pick command, which also requires proper permissions, such as the ability to edit the milestone).

Yes, we already have an action that will create rebasline CLs for main: https://github.com/emscripten-core/emscripten/actions/workflows/rebaseline-tests.yml

Figuring out how to make this work PRs in flight is what I'd like to do.

I've not get written a bot that responds to labels of commands but it seems possible.

Fix signedness/type mismatches when marshaling stat-related structs

289d02e

kleisauke mentioned this pull request Jul 23, 2025

Use 64-bit fields for fsblkcnt_t and fsfilcnt_t #24769

Merged

sbc100 reviewed Jul 23, 2025

View reviewed changes

kleisauke added 2 commits July 23, 2025 19:12

Fix -sWASM_BIGINT=0 tests

5511235

Merge branch 'main' into stat-syscall-correct-types

d78cf85

sbc100 approved these changes Jul 24, 2025

View reviewed changes

kleisauke added 2 commits July 25, 2025 11:36

Merge branch 'main' into stat-syscall-correct-types

897e0d1

sbc100 approved these changes Jul 25, 2025

View reviewed changes

kleisauke added 2 commits July 31, 2025 18:36

Merge branch 'main' into stat-syscall-correct-types

96255d9

Merge branch 'main' into stat-syscall-correct-types

655ac2f

sbc100 merged commit 14da0a2 into emscripten-core:main Aug 1, 2025
28 of 30 checks passed

kleisauke deleted the stat-syscall-correct-types branch August 1, 2025 18:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix signedness/type mismatches when marshaling stat-related structs #24772

Fix signedness/type mismatches when marshaling stat-related structs #24772

Uh oh!

kleisauke commented Jul 23, 2025

Uh oh!

sbc100 commented Jul 23, 2025

Uh oh!

sbc100 Jul 23, 2025

Uh oh!

kleisauke Jul 23, 2025

Uh oh!

sbc100 Jul 23, 2025

Uh oh!

kleisauke commented Jul 31, 2025 •

edited

Loading

Uh oh!

sbc100 commented Jul 31, 2025

Uh oh!

kleisauke commented Jul 31, 2025

Uh oh!

kleisauke commented Jul 31, 2025

Uh oh!

sbc100 commented Jul 31, 2025

Uh oh!

Uh oh!

Uh oh!

	if (type == 'i64' && !WASM_BIGINT) {
	// If we lack BigInt support we must fall back to an reading a pair of I32
	// values.
	// prettier-ignore
	return '(tempI64 = [' + splitI64(value) + '], ' +
	makeSetValueImpl(ptr, pos, 'tempI64[0]', 'i32') + ',' +
	makeSetValueImpl(ptr, getFastValue(pos, '+', getNativeTypeSize('i32')), 'tempI64[1]', 'i32') + ')';
	}

	const offset = calcFastOffset(ptr, pos);

	if (type === 'i53') {
	return `writeI53ToI64(${offset}, ${value})`;
	}

Fix signedness/type mismatches when marshaling stat-related structs #24772

Fix signedness/type mismatches when marshaling stat-related structs #24772

Uh oh!

Conversation

kleisauke commented Jul 23, 2025

Uh oh!

sbc100 commented Jul 23, 2025

Uh oh!

sbc100 Jul 23, 2025

Choose a reason for hiding this comment

Uh oh!

kleisauke Jul 23, 2025

Choose a reason for hiding this comment

Uh oh!

sbc100 Jul 23, 2025

Choose a reason for hiding this comment

Uh oh!

kleisauke commented Jul 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sbc100 commented Jul 31, 2025

Uh oh!

kleisauke commented Jul 31, 2025

Uh oh!

kleisauke commented Jul 31, 2025

Uh oh!

sbc100 commented Jul 31, 2025

Uh oh!

Uh oh!

Uh oh!

kleisauke commented Jul 31, 2025 •

edited

Loading