[tstate] inline byte to string conversions #174

hexfusion · 2023-05-15T15:17:05Z

I explored a few alternative approaches and generally found this PR does provide some small gains that could be considered as an incremental performance improvement. During research I found that using a 64bit hash such as xxhash64 for the map key was a large performance improvement this is a the approach taken with for example fast cache. But since we also need to handle hash collisions the complexity reduced performance until it was not worth the effort at small scale.

This approach is documented here https://pthevenet.com/posts/programming/go/bytesliceindexedmaps/#optimization local results from the benchmark confirm.

goos: linux
goarch: amd64
pkg: fastcache
cpu: Intel(R) Core(TM) i7-10750H CPU @ 2.60GHz
BenchmarkArrayKeyed-12              	18654550	        64.44 ns/op	       0 B/op	       0 allocs/op
BenchmarkOptimizedStringKeyed-12    	14765403	        86.89 ns/op	      16 B/op	       1 allocs/op
BenchmarkStringKeyed-12             	10360284	       125.4 ns/op	      48 B/op	       3 allocs/op
BenchmarkHexKeyed-12                	 6014667	       206.9 ns/op	      96 B/op	       4 allocs/op

before

goos: linux
goarch: amd64
pkg: github.com/ava-labs/hypersdk/tstate
cpu: Intel(R) Core(TM) i7-10750H CPU @ 2.60GHz
BenchmarkFetchAndSetScope
BenchmarkFetchAndSetScope/fetch_and_set_scope_4_keys
BenchmarkFetchAndSetScope/fetch_and_set_scope_4_keys-12                   885832              1440 ns/op             640 B/op          7 allocs/op
BenchmarkFetchAndSetScope/fetch_and_set_scope_8_keys
BenchmarkFetchAndSetScope/fetch_and_set_scope_8_keys-12                   403532              3230 ns/op            1537 B/op         12 allocs/op
BenchmarkFetchAndSetScope/fetch_and_set_scope_16_keys
BenchmarkFetchAndSetScope/fetch_and_set_scope_16_keys-12                  356056              5458 ns/op            3439 B/op         21 allocs/op
BenchmarkFetchAndSetScope/fetch_and_set_scope_32_keys
BenchmarkFetchAndSetScope/fetch_and_set_scope_32_keys-12                  173084              6636 ns/op            7181 B/op         39 allocs/op
BenchmarkFetchAndSetScope/fetch_and_set_scope_64_keys
BenchmarkFetchAndSetScope/fetch_and_set_scope_64_keys-12                   69446             19742 ns/op           15388 B/op         74 allocs/op
BenchmarkFetchAndSetScope/fetch_and_set_scope_128_keys
BenchmarkFetchAndSetScope/fetch_and_set_scope_128_keys-12                  48106             44673 ns/op           31211 B/op        140 allocs/op

after

goos: linux
goarch: amd64
pkg: github.com/ava-labs/hypersdk/tstate
cpu: Intel(R) Core(TM) i7-10750H CPU @ 2.60GHz
BenchmarkFetchAndSetScope
BenchmarkFetchAndSetScope/fetch_and_set_scope_4_keys
BenchmarkFetchAndSetScope/fetch_and_set_scope_4_keys-12                   851434              1400 ns/op             800 B/op          7 allocs/op
BenchmarkFetchAndSetScope/fetch_and_set_scope_8_keys
BenchmarkFetchAndSetScope/fetch_and_set_scope_8_keys-12                   516002              2178 ns/op            1825 B/op         12 allocs/op
BenchmarkFetchAndSetScope/fetch_and_set_scope_16_keys
BenchmarkFetchAndSetScope/fetch_and_set_scope_16_keys-12                  295729              3874 ns/op            3983 B/op         21 allocs/op
BenchmarkFetchAndSetScope/fetch_and_set_scope_32_keys
BenchmarkFetchAndSetScope/fetch_and_set_scope_32_keys-12                  160959              7256 ns/op            8235 B/op         39 allocs/op
BenchmarkFetchAndSetScope/fetch_and_set_scope_64_keys
BenchmarkFetchAndSetScope/fetch_and_set_scope_64_keys-12                   73342             14859 ns/op           17470 B/op         74 allocs/op
BenchmarkFetchAndSetScope/fetch_and_set_scope_128_keys
BenchmarkFetchAndSetScope/fetch_and_set_scope_128_keys-12                  43680             27489 ns/op           35332 B/op        140 allocs/op

benchvset-after.cpu.txt

[1] https://github.com/VictoriaMetrics/fastcache

Signed-off-by: Sam Batschelet <[email protected]>

hexfusion · 2023-05-17T12:58:21Z

tstate/tstate.go

 // checkScope returns whether [k] is in ts.readScope.
 func (ts *TState) checkScope(_ context.Context, k []byte) bool {
-	for _, s := range ts.scope {
-		// TODO: benchmark and see if creating map is worth overhead


this very much depends on the size of the set. since lookup is O(n) its reasonable to have a map if we might have 8+ keys. I added the change for ref but we can revert the commit if we feel the set size would remain similar to tokenvm (4).

Although the lookup is much faster for map it does increase the alloc in FetchAndSetScope by a fair amount. As this set grows into possibly hundreds the time gained on lookup will be probably overshadowed by memory bloat/cpu time.

before

goos: linux goarch: amd64 pkg: github.com/ava-labs/hypersdk/tstate cpu: Intel(R) Core(TM) i7-10750H CPU @ 2.60GHz BenchmarkFetchAndSetScope/fetch_and_set_scope_4_keys_with_length_65-12 894080 1408 ns/op 800 B/op 7 allocs/op BenchmarkFetchAndSetScope/fetch_and_set_scope_8_keys_with_length_65-12 687463 2001 ns/op 1473 B/op 11 allocs/op BenchmarkFetchAndSetScope/fetch_and_set_scope_16_keys_with_length_65-12 372249 3031 ns/op 2833 B/op 19 allocs/op BenchmarkFetchAndSetScope/fetch_and_set_scope_32_keys_with_length_65-12 243076 5262 ns/op 5424 B/op 35 allocs/op BenchmarkFetchAndSetScope/fetch_and_set_scope_64_keys_with_length_65-12 110935 14345 ns/op 11416 B/op 68 allocs/op BenchmarkFetchAndSetScope/fetch_and_set_scope_128_keys_with_length_65-12 42150 26776 ns/op 22680 B/op 132 allocs/op

after

goos: linux goarch: amd64 pkg: github.com/ava-labs/hypersdk/tstate cpu: Intel(R) Core(TM) i7-10750H CPU @ 2.60GHz BenchmarkFetchAndSetScope/fetch_and_set_scope_4_keys_with_length_65-12 716866 1802 ns/op 1392 B/op 14 allocs/op BenchmarkFetchAndSetScope/fetch_and_set_scope_8_keys_with_length_65-12 404772 2923 ns/op 2529 B/op 22 allocs/op BenchmarkFetchAndSetScope/fetch_and_set_scope_16_keys_with_length_65-12 282474 4318 ns/op 4824 B/op 38 allocs/op BenchmarkFetchAndSetScope/fetch_and_set_scope_32_keys_with_length_65-12 157590 7921 ns/op 9284 B/op 70 allocs/op BenchmarkFetchAndSetScope/fetch_and_set_scope_64_keys_with_length_65-12 83341 15155 ns/op 19376 B/op 136 allocs/op BenchmarkFetchAndSetScope/fetch_and_set_scope_128_keys_with_length_65-12 42186 40291 ns/op 38448 B/op 264 allocs/op

reverting for now.

You can see the cost of lookup as the set increases.

BenchmarkCheckScope/set_scope_4_keys_with_length_65-12 55952859 21.66 ns/op 0 B/op 0 allocs/op BenchmarkCheckScope/set_scope_8_keys_with_length_65-12 30695019 39.45 ns/op 0 B/op 0 allocs/op BenchmarkCheckScope/set_scope_16_keys_with_length_65-12 14245750 84.93 ns/op 0 B/op 0 allocs/op BenchmarkCheckScope/set_scope_32_keys_with_length_65-12 7016282 160.0 ns/op 0 B/op 0 allocs/op BenchmarkCheckScope/set_scope_64_keys_with_length_65-12 3868280 286.7 ns/op 0 B/op 0 allocs/op BenchmarkCheckScope/set_scope_128_keys_with_length_65-12 2166471 547.1 ns/op 0 B/op 0 allocs/op

Signed-off-by: Sam Batschelet <[email protected]>

patrick-ogrady · 2023-05-19T01:51:36Z

We should do the same for: https://github.com/ava-labs/hypersdk/blob/main/chain/processor.go

Signed-off-by: Sam Batschelet <[email protected]>

github-actions · 2023-07-26T00:11:45Z

This PR has become stale because it has been open for 30 days with no activity. Adding the lifecycle/frozen label will exempt this PR from future lifecycle events..

hexfusion added 2 commits May 14, 2023 11:57

add bench

81d0cb9

Signed-off-by: Sam Batschelet <[email protected]>

Inline all byte to string conversions

3002fd4

Signed-off-by: Sam Batschelet <[email protected]>

hexfusion temporarily deployed to long-ci May 15, 2023 15:17 — with GitHub Actions Inactive

hexfusion changed the title ~~Hexfusion/optimized string~~ Inline byte to string conversions May 15, 2023

hexfusion temporarily deployed to long-ci May 15, 2023 15:43 — with GitHub Actions Inactive

hexfusion force-pushed the hexfusion/optimized-string branch from fe7f852 to 3002fd4 Compare May 15, 2023 15:53

hexfusion temporarily deployed to long-ci May 15, 2023 15:53 — with GitHub Actions Inactive

nits

06ebd7f

Signed-off-by: Sam Batschelet <[email protected]>

hexfusion temporarily deployed to long-ci May 15, 2023 15:54 — with GitHub Actions Inactive

hexfusion self-assigned this May 15, 2023

reduce allocations

e5fe915

Signed-off-by: Sam Batschelet <[email protected]>

hexfusion temporarily deployed to long-ci May 16, 2023 00:59 — with GitHub Actions Inactive

hexfusion temporarily deployed to long-ci May 17, 2023 12:51 — with GitHub Actions Inactive

hexfusion marked this pull request as ready for review May 17, 2023 12:56

hexfusion requested a review from patrick-ogrady as a code owner May 17, 2023 12:56

hexfusion commented May 17, 2023

View reviewed changes

hexfusion force-pushed the hexfusion/optimized-string branch from 8f89b1a to f687764 Compare May 17, 2023 16:07

hexfusion temporarily deployed to long-ci May 17, 2023 16:07 — with GitHub Actions Inactive

hexfusion added 2 commits May 17, 2023 12:11

Fix typos

487c2b0

Signed-off-by: Sam Batschelet <[email protected]>

Add checkScope bench

a7628fc

Signed-off-by: Sam Batschelet <[email protected]>

hexfusion temporarily deployed to long-ci May 17, 2023 17:36 — with GitHub Actions Inactive

patrick-ogrady changed the title ~~Inline byte to string conversions~~ [tstate] inline byte to string conversions May 19, 2023

[processor] inline all byte to string conversions

0c7bef5

Signed-off-by: Sam Batschelet <[email protected]>

hexfusion temporarily deployed to long-ci May 19, 2023 02:01 — with GitHub Actions Inactive

github-actions bot added the lifecycle/stale label Jul 26, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[tstate] inline byte to string conversions #174

[tstate] inline byte to string conversions #174

hexfusion commented May 15, 2023 •

edited

Loading

Uh oh!

hexfusion May 17, 2023 •

edited

Loading

Uh oh!

hexfusion May 17, 2023

Uh oh!

hexfusion May 17, 2023

Uh oh!

hexfusion May 17, 2023

Uh oh!

patrick-ogrady commented May 19, 2023

Uh oh!

github-actions bot commented Jul 26, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[tstate] inline byte to string conversions #174

Are you sure you want to change the base?

[tstate] inline byte to string conversions #174

Conversation

hexfusion commented May 15, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

before

after

Uh oh!

hexfusion May 17, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

hexfusion May 17, 2023

Choose a reason for hiding this comment

before

after

Uh oh!

hexfusion May 17, 2023

Choose a reason for hiding this comment

Uh oh!

hexfusion May 17, 2023

Choose a reason for hiding this comment

Uh oh!

patrick-ogrady commented May 19, 2023

Uh oh!

github-actions bot commented Jul 26, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

hexfusion commented May 15, 2023 •

edited

Loading

hexfusion May 17, 2023 •

edited

Loading