perf(decode::tree): preallocate Vec based on worst-case length by datdenkikniet · Pull Request #2462 · GitoxideLabs/gitoxide

datdenkikniet · 2026-03-07T17:55:47Z

A micro-optimization that is likely only really beneficial for the specific benchmark that it affects (TreeIter).

We can make a (terrible) guess at how many elements our tree contains. This lets us allocate at least as many entries as we will need, allowing the function to never re-allocate. Allocating vectors takes amortized constant time, but given that the time cost per item is so low (on the order of nanoseconds), avoiding them outright provides a pretty serious speedup.

I have no clue what the project's stance on over-allocation is: if they should be avoided, and/or if this specific optimization is unnecesary, let's close this PR (but remember that it exists).

Benchmark (cargo bench --bench decode-objects -- TreeRef, diff against main):

TreeRef()               time:   [58.240 ns 58.335 ns 58.458 ns]
                        change: [−38.740% −37.466% −36.027%] (p = 0.00 < 0.05)
                        Performance has improved.

We can make a (terrible) guess at how many elements our tree contains. This lets us allocate at least as many entries as we will need, allowing the function to never re-allocate.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 888feb71c7

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

chatgpt-codex-connector · 2026-03-07T17:57:55Z

gix-object/src/tree/ref_iter.rs

+        const HASH_LEN_FIXME: usize = 20;
+        let lower_bound_single_entry = 2 + HASH_LEN_FIXME; // 2 = space + trailing zero
+        let upper_bound_entries = i.len() / lower_bound_single_entry;
+        let mut out = Vec::with_capacity(upper_bound_entries);


Avoid preallocating from unvalidated tree byte length

tree() now calls Vec::with_capacity(i.len() / 22) before validating a single entry, so malformed or attacker-controlled tree payloads can force a large allocation and OOM even though parsing immediately returns an error. Previously allocations only grew with successfully decoded entries, so this commit introduces a memory-amplification path for invalid inputs (e.g., corrupted objects received from untrusted repos).

Useful? React with 👍 / 👎.

perf(decode::tree): preallocate Vec based on worst-case length

888feb7

We can make a (terrible) guess at how many elements our tree contains. This lets us allocate at least as many entries as we will need, allowing the function to never re-allocate.

chatgpt-codex-connector bot reviewed Mar 7, 2026

View reviewed changes

loci-dev mentioned this pull request Mar 8, 2026

UPSTREAM PR #2462: perf(decode::tree): preallocate Vec based on worst-case length auroralabs-loci/gitoxide#29

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

perf(decode::tree): preallocate Vec based on worst-case length#2462

perf(decode::tree): preallocate Vec based on worst-case length#2462
datdenkikniet wants to merge 1 commit intoGitoxideLabs:mainfrom
datdenkikniet:micro-optimize

datdenkikniet commented Mar 7, 2026 •

edited

Loading

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

chatgpt-codex-connector bot Mar 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

datdenkikniet commented Mar 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Mar 7, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

datdenkikniet commented Mar 7, 2026 •

edited

Loading