[io] Properly abort when buffer size overflows max integer or size > maxBufferSize #19606

ferdymercury · 2025-08-11T15:06:24Z

This Pull request:

Changes or fixes:

And is a first step towards #6734

Checklist:

tested changes locally
updated the docs (if necessary)

pcanal · 2025-08-11T16:58:36Z

Can you make a separate PR with only the change in variable name (bufsize) to reduce the noise of this more challenging PR.

github-actions · 2025-08-12T05:44:26Z

Test Results

20 files 20 suites 3d 6h 24m 9s ⏱️
3 376 tests 3 374 ✅ 0 💤 2 ❌
65 866 runs 65 860 ✅ 0 💤 6 ❌

For more details on these failures, see this check.

Results for commit eb0dc82.

♻️ This comment has been updated with latest results.

Fixes root-project#14770

io/io/src/TDirectoryFile.cxx

io/io/src/TFile.cxx

jblomer

Cool, many thanks!

I think generally we want to use std::size_t for buffer sizes instead of Long64_t.

We should probably also update TBuffer::[Read|Write]Buf, TBuffer::ReadString, TBuffer::MapObject, TBuffer::[Check|Set]ByteCount, TBuffer::SetBufferDisplacement.

Maybe also TBuffer::[Read|Write]Clones.

Sometimes we are checking for kMaxBufferSize, sometimes for kMaxInt. Shouldn't we always check for kMaxBufferSize?

Regarding commit messages, I'd suggest

[NFC] remove unused headers

and

[io] accept and check long buffer size params

with an explanation why we (at this point) allow for long buffer sizes but then abort when they are actually used.

@pcanal: do we have an indication that the optimization of initializing the buffer size to the average buffer size seen so far in the file is actually useful? There are certainly write patterns where it hurts rather than helps. Removing this optimization would get us a fair amount of simplification in a number of read/write APIs.

core/base/inc/TBuffer.h

jblomer · 2025-08-14T08:04:34Z

io/io/inc/TFile.h

+   static void         SetFileReadCalls(Long64_t readcalls = 0);
+   static void         SetReadaheadSize(Long64_t bytes = 256000);


Maybe put those in another commit/PR.

ferdymercury · 2025-08-14T08:26:30Z

I think generally we want to use std::size_t for buffer sizes instead of Long64_t.

Even if that changes the data type from signed to unsigned?

Also, wouldn't it be better to have ULong64_t instead of std::size_t?

Since size_t is unsigned int for 32-bit targets and unsigned long long or unsigned long for 64-bit targets, depending on the C++ implementation and the compilation target.
Where as ULong64_t would always be unsigned long long.

as suggested by jblomer

jblomer · 2025-08-14T09:41:55Z

Even if that changes the data type from signed to unsigned?

Also, wouldn't it be better to have ULong64_t instead of std::size_t?

Since size_t is unsigned int for 32-bit targets and unsigned long long or unsigned long for 64-bit targets, depending on the C++ implementation and the compilation target. Where as ULong64_t would always be unsigned long long.

In my opinion, that's the point. The std::size_t type is the integer type describing a length of something in memory. This is platform-dependent. On a 32bit platform, e.g., there will never be a buffer > 4GB.

ferdymercury · 2025-08-14T09:52:08Z

In my opinion, that's the point. The std::size_t type is the integer type describing a length of something in memory. This is platform-dependent. On a 32bit platform, e.g., there will never be a buffer > 4GB.

Sounds reasonable. What if a TTree in the future contains a big entry over 4GB ? Does it mean that it won't be read in 32-bits? How do we error out then, or it's silently cropped? Or is it going to be emulated as several buffers one after the other?
Using a data type larger than the actual type (ulong64 vs size_t) leaves room for detecting those kind of situations.
But maybe it's all very corner cases and is not worth?

jblomer · 2025-08-14T14:00:39Z

In my opinion, that's the point. The std::size_t type is the integer type describing a length of something in memory. This is platform-dependent. On a 32bit platform, e.g., there will never be a buffer > 4GB.

Sounds reasonable. What if a TTree in the future contains a big entry over 4GB ? Does it mean that it won't be read in 32-bits? How do we error out then, or it's silently cropped? Or is it going to be emulated as several buffers one after the other? Using a data type larger than the actual type (ulong64 vs size_t) leaves room for detecting those kind of situations. But maybe it's all very corner cases and is not worth?

I think generally we have to distinguish between the in-memory buffer and what's serialized to disk. If a big atomic object (e.g., a histogram) is serialized to disk, it can't be read back on 32bit platforms. I think that's fine and unavoidable. The 32bit machine is simply not capable enough. On disk, of course, we will need to represent the size of objects in a platform-independent way. I think that the deserialization of the object length will be the proper point to throw errors.

Regarding the concrete on-disk representation, the plan is to chunk large objects in multiple keys to keep the changes to the TFile on-disk format minimal.

pcanal · 2025-08-14T14:27:31Z

do we have an indication that the optimization of initializing the buffer size to the average buffer size seen so far in the file is actually useful?

This is hard to really measure for sure as it is of course very dependent of the actual workload. When this was introduced, this was in direct reaction to issues related to not only memory fragmentation (increase of process virtual size due to the inability to re-use some memory that just a tad too small) but also thread scaling (by reducing the amount of memory allocation which requires (in most cases) the system to take a global lock).

pcanal · 2025-08-14T14:29:14Z

On a 32bit platform, e.g., there will never be a buffer > 4GB.

On the other hand we need to also make sure we probably error-out when there is a request for it ...

pcanal · 2025-08-14T14:30:37Z

What if a TTree in the future contains a big entry over 4GB ? ... Or is it going to be emulated as several buffers one after the other?

That is the current plan.

io/io/src/TDirectoryFile.cxx

pcanal · 2025-08-14T14:47:56Z

core/base/inc/TObject.h

@@ -170,8 +170,8 @@ class TObject {
   virtual void        SetDrawOption(Option_t *option="");  // *MENU*
   virtual void        SetUniqueID(UInt_t uid);
   virtual void        UseCurrentStyle();
-   virtual Int_t       Write(const char *name = nullptr, Int_t option = 0, Int_t bufsize = 0);
-   virtual Int_t       Write(const char *name = nullptr, Int_t option = 0, Int_t bufsize = 0) const;
+   virtual Int_t       Write(const char *name = nullptr, Int_t option = 0, Long64_t bufsize = 0);


That might be necessary but is a serious problem. This function is overload a lot in both our code but also very possibly in user code. Unless those user have upgrade their code to use the override keyword (which is unlikely in my opinion), their code will compile correctly but do the wrong thing (revert to use the default behavior rather than their customization .... )

What if we implement a dummy

Int_t Write(const char *name, Int_t option, Int_t bufsize) const final;

to trigger a compilation error, or at least a warning and avoid that silenced wrong behavior?

That would indeed provoke a compilation error ....

ferdymercury changed the title ~~[io] Properly abort when buffer size overflows max integer~~ [io] Properly abort when buffer size overflows max integer or size > maxBufferSize Aug 11, 2025

ferdymercury added the clean build Ask CI to do non-incremental build on PR label Aug 11, 2025

ferdymercury force-pushed the buffergb branch from 54cb51d to 2474391 Compare August 11, 2025 15:58

ferdymercury removed the clean build Ask CI to do non-incremental build on PR label Aug 11, 2025

ferdymercury force-pushed the buffergb branch from 2474391 to 5298dcb Compare August 11, 2025 16:56

ferdymercury mentioned this pull request Aug 11, 2025

[nfc] consistent renaming of all buffersize variables #19610

Merged

ferdymercury force-pushed the buffergb branch from 1ffed3e to 7061688 Compare August 11, 2025 18:17

dpiparo assigned pcanal Aug 12, 2025

[io] Properly abort when buffer size overflows max integer

cdfceda

Fixes root-project#14770

ferdymercury force-pushed the buffergb branch from 7061688 to cdfceda Compare August 13, 2025 17:07

ferdymercury marked this pull request as ready for review August 13, 2025 17:07

ferdymercury requested review from jblomer, martamaja10, vepadulano, pcanal, linev, lmoneta and dpiparo as code owners August 13, 2025 17:07

ferdymercury commented Aug 13, 2025

View reviewed changes

io/io/src/TDirectoryFile.cxx Outdated Show resolved Hide resolved

ferdymercury commented Aug 13, 2025

View reviewed changes

io/io/src/TFile.cxx Outdated Show resolved Hide resolved

[nfc] not needed headers

0498854

jblomer reviewed Aug 14, 2025

View reviewed changes

[io] add more checks in TBuffer functions

4d309e1

as suggested by jblomer

compilation fixes

eb0dc82

pcanal reviewed Aug 14, 2025

View reviewed changes

io/io/src/TDirectoryFile.cxx Outdated Show resolved Hide resolved

pcanal reviewed Aug 14, 2025

View reviewed changes

		static void SetFileReadCalls(Long64_t readcalls = 0);
		static void SetReadaheadSize(Long64_t bytes = 256000);

[io] Properly abort when buffer size overflows max integer or size > maxBufferSize #19606

Are you sure you want to change the base?

[io] Properly abort when buffer size overflows max integer or size > maxBufferSize #19606

Uh oh!

Conversation

ferdymercury commented Aug 11, 2025

This Pull request:

Changes or fixes:

Checklist:

Uh oh!

pcanal commented Aug 11, 2025

Uh oh!

github-actions bot commented Aug 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Test Results

Uh oh!

Uh oh!

Uh oh!

jblomer left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

jblomer Aug 14, 2025

Choose a reason for hiding this comment

Uh oh!

ferdymercury commented Aug 14, 2025

Uh oh!

jblomer commented Aug 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ferdymercury commented Aug 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jblomer commented Aug 14, 2025

Uh oh!

pcanal commented Aug 14, 2025

Uh oh!

pcanal commented Aug 14, 2025

Uh oh!

pcanal commented Aug 14, 2025

Uh oh!

Uh oh!

pcanal Aug 14, 2025

Choose a reason for hiding this comment

Uh oh!

ferdymercury Aug 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

pcanal Aug 14, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

github-actions bot commented Aug 12, 2025 •

edited

Loading

jblomer left a comment •

edited

Loading

jblomer commented Aug 14, 2025 •

edited

Loading

ferdymercury commented Aug 14, 2025 •

edited

Loading

ferdymercury Aug 14, 2025 •

edited

Loading