Add wyhash2 and metro64/128 modern hashing #2293

NotsoanoNimus · 2025-07-11T21:21:47Z

Updating the C3 hash kit with some more modern hashes that are performant, random, and work reliably according to their known test vectors.

For this work, I'd like to give a special thanks to Dr. Timofey Prodanov for this article and the related benchmarks spreadsheet. Both of these items are important, interesting, and relevant for this change.

Typical benchmark results:

---------------------- BENCHMARKS -----------------------
Benchmarking non_crypto_benchmarks::fnv64a_1 ....... [COMPLETE] 12.88 ns, 25.75 CPU's clocks
Benchmarking non_crypto_benchmarks::fnv32a_1 ....... [COMPLETE] 10.29 ns, 20.57 CPU's clocks
Benchmarking non_crypto_benchmarks::wyhash2_1 ...... [COMPLETE] 12.70 ns, 25.40 CPU's clocks
Benchmarking non_crypto_benchmarks::metro64_1 ...... [COMPLETE] 86.14 ns, 172.27 CPU's clocks
Benchmarking non_crypto_benchmarks::metro128_1 ..... [COMPLETE] 100.94 ns, 201.86 CPU's clocks
Benchmarking non_crypto_benchmarks::a5hash_1 ....... [COMPLETE] 13.87 ns, 27.74 CPU's clocks
Benchmarking non_crypto_benchmarks::komi_1 ......... [COMPLETE] 22.43 ns, 44.84 CPU's clocks

Benchmarking non_crypto_benchmarks::fnv64a_4 ....... [COMPLETE] 29.84 ns, 59.66 CPU's clocks
Benchmarking non_crypto_benchmarks::fnv32a_4 ....... [COMPLETE] 26.93 ns, 53.84 CPU's clocks
Benchmarking non_crypto_benchmarks::wyhash2_4 ...... [COMPLETE] 13.33 ns, 26.64 CPU's clocks
Benchmarking non_crypto_benchmarks::metro64_4 ...... [COMPLETE] 55.46 ns, 110.90 CPU's clocks
Benchmarking non_crypto_benchmarks::metro128_4 ..... [COMPLETE] 65.28 ns, 130.54 CPU's clocks
Benchmarking non_crypto_benchmarks::a5hash_4 ....... [COMPLETE] 15.75 ns, 31.50 CPU's clocks
Benchmarking non_crypto_benchmarks::komi_4 ......... [COMPLETE] 26.86 ns, 53.70 CPU's clocks

Benchmarking non_crypto_benchmarks::fnv64a_8 ....... [COMPLETE] 54.15 ns, 108.29 CPU's clocks
Benchmarking non_crypto_benchmarks::fnv32a_8 ....... [COMPLETE] 47.62 ns, 95.22 CPU's clocks
Benchmarking non_crypto_benchmarks::wyhash2_8 ...... [COMPLETE] 11.48 ns, 22.97 CPU's clocks
Benchmarking non_crypto_benchmarks::metro64_8 ...... [COMPLETE] 55.05 ns, 110.08 CPU's clocks
Benchmarking non_crypto_benchmarks::metro128_8 ..... [COMPLETE] 65.42 ns, 130.82 CPU's clocks
Benchmarking non_crypto_benchmarks::a5hash_8 ....... [COMPLETE] 15.82 ns, 31.63 CPU's clocks
Benchmarking non_crypto_benchmarks::komi_8 ......... [COMPLETE] 31.26 ns, 62.50 CPU's clocks

Benchmarking non_crypto_benchmarks::fnv64a_16 ...... [COMPLETE] 102.64 ns, 205.26 CPU's clocks
Benchmarking non_crypto_benchmarks::fnv32a_16 ...... [COMPLETE] 89.49 ns, 178.97 CPU's clocks
Benchmarking non_crypto_benchmarks::wyhash2_16 ..... [COMPLETE] 11.16 ns, 22.31 CPU's clocks
Benchmarking non_crypto_benchmarks::metro64_16 ..... [COMPLETE] 59.85 ns, 119.67 CPU's clocks
Benchmarking non_crypto_benchmarks::metro128_16 .... [COMPLETE] 96.86 ns, 193.69 CPU's clocks
Benchmarking non_crypto_benchmarks::a5hash_16 ...... [COMPLETE] 35.02 ns, 70.02 CPU's clocks
Benchmarking non_crypto_benchmarks::komi_16 ........ [COMPLETE] 46.51 ns, 91.35 CPU's clocks

Benchmarking non_crypto_benchmarks::fnv64a_32 ...... [COMPLETE] 216.51 ns, 433.00 CPU's clocks
Benchmarking non_crypto_benchmarks::fnv32a_32 ...... [COMPLETE] 175.75 ns, 351.48 CPU's clocks
Benchmarking non_crypto_benchmarks::metro64_32 ..... [COMPLETE] 69.13 ns, 138.23 CPU's clocks
Benchmarking non_crypto_benchmarks::metro128_32 .... [COMPLETE] 75.88 ns, 151.74 CPU's clocks
Benchmarking non_crypto_benchmarks::a5hash_32 ...... [COMPLETE] 19.27 ns, 38.54 CPU's clocks
Benchmarking non_crypto_benchmarks::komi_32 ........ [COMPLETE] 34.71 ns, 69.39 CPU's clocks

Benchmarking non_crypto_benchmarks::fnv64a_64 ...... [COMPLETE] 395.49 ns, 790.95 CPU's clocks
Benchmarking non_crypto_benchmarks::fnv32a_64 ...... [COMPLETE] 341.06 ns, 682.08 CPU's clocks
Benchmarking non_crypto_benchmarks::metro64_64 ..... [COMPLETE] 83.42 ns, 166.83 CPU's clocks
Benchmarking non_crypto_benchmarks::metro128_64 .... [COMPLETE] 92.86 ns, 185.70 CPU's clocks
Benchmarking non_crypto_benchmarks::a5hash_64 ...... [COMPLETE] 30.62 ns, 61.24 CPU's clocks
Benchmarking non_crypto_benchmarks::komi_64 ........ [COMPLETE] 49.37 ns, 98.73 CPU's clocks

Benchmarking non_crypto_benchmarks::fnv64a_128 ..... [COMPLETE] 754.96 ns, 1509.88 CPU's clocks
Benchmarking non_crypto_benchmarks::fnv32a_128 ..... [COMPLETE] 714.43 ns, 1428.83 CPU's clocks
Benchmarking non_crypto_benchmarks::metro64_128 .... [COMPLETE] 110.52 ns, 221.02 CPU's clocks
Benchmarking non_crypto_benchmarks::metro128_128 ... [COMPLETE] 146.85 ns, 293.68 CPU's clocks
Benchmarking non_crypto_benchmarks::a5hash_128 ..... [COMPLETE] 54.44 ns, 108.86 CPU's clocks
Benchmarking non_crypto_benchmarks::komi_128 ....... [COMPLETE] 88.81 ns, 177.59 CPU's clocks

Benchmarking non_crypto_benchmarks::fnv64a_1024 .... [COMPLETE] 6638.73 ns, 13277.26 CPU's clocks
Benchmarking non_crypto_benchmarks::fnv32a_1024 .... [COMPLETE] 5644.38 ns, 11288.57 CPU's clocks
Benchmarking non_crypto_benchmarks::metro64_1024 ... [COMPLETE] 446.42 ns, 892.78 CPU's clocks
Benchmarking non_crypto_benchmarks::metro128_1024 .. [COMPLETE] 553.51 ns, 1106.99 CPU's clocks
Benchmarking non_crypto_benchmarks::a5hash_1024 .... [COMPLETE] 390.97 ns, 781.91 CPU's clocks
Benchmarking non_crypto_benchmarks::komi_1024 ...... [COMPLETE] 442.33 ns, 884.62 CPU's clocks

52 benchmarks run.

@lerno Given the benchmarking results when run and Dr. Prodanov's tests, as well as the poor randomness and speed of FNV hashing, I would like to propose a change - in a different pull request - for how types are hashed by default in stdlib. ~~If we use wyhash2 for types up to and including 16 bytes in size and metro64 for vectors, arrays, etc. beyond that,~~ If we reevaluate which hashes in this PR are best at particular input sizes, based on speed and randomness, C3 can have much faster and more random hashing performance (which would hopefully speed a lot of things up)!

The wyhash2 speed changes are DISGUSTING - so are Metro's at a higher data size!

Anyway, just a thought. Let me know what you think.

… hash functions

benchmarks/stdlib/hash/non_crypto_shootout.c3

data-man · 2025-07-12T16:28:19Z

modern hashes

Just for info:
https://github.com/avaneev/a5hash
https://github.com/avaneev/komihash
https://github.com/Nicoshev/rapidhash

NotsoanoNimus · 2025-07-13T20:36:11Z

modern hashes

Just for info: https://github.com/avaneev/a5hash https://github.com/avaneev/komihash https://github.com/Nicoshev/rapidhash

Now that's what I call modern!

Spent some time today and yesterday implementing a5hash and komihash, so there are even more options to choose from (sorry to the reviewers 😄).

lib/std/hash/komi.c3

…tervals

…ests

NotsoanoNimus · 2025-07-15T12:25:17Z

That about wraps up the initial code honing and few passes.

I won't add on top of this PR any further now, given no more CI issues or review feedback pop up. Looks ready to go!

lerno · 2025-07-18T18:58:51Z

lib/std/hash/metro128.c3

+
+	for (; data.len >= 32; data = data[32:^32])
+	{
+		self.state[0] += ((ulong*)data.ptr)[0] * K[0]; self.state[0] = self.state[0].rotr(29) + self.state[2];


This is unaligned, so you need @unaligned_load

lerno · 2025-07-18T19:00:03Z

lib/std/hash/metro128.c3

+
+	if (final_data.len >= 16)
+	{
+		self.state[0] += ((ulong*)final_data.ptr)[0] * K[2]; self.state[0] = self.state[0].rotr(33) * K[3];


Not sure, but maybe this needs unaligned_load

lerno · 2025-07-18T19:00:29Z

lib/std/hash/metro128.c3

+
+	if (final_data.len >= 8)
+	{
+		self.state[0] += ((ulong*)final_data.ptr)[0] * K[2]; self.state[0] = self.state[0].rotr(33) * K[3];


unaligned_load again..

lerno · 2025-07-18T19:01:32Z

I found even more examples of unaligned load, so try to look through and fix those.

lerno · 2025-07-18T23:45:45Z

A good way to test this is to pass in data that is unaligned, for example:

char[] data_aligned = "XThe data";
char[] data = data_aligned[1..]; // This is pretty sure to be unaligned.

# Conflicts: # releasenotes.md

lerno · 2025-07-19T22:05:34Z

Please check the benchmarks now that things actually use unaligned access properly! I think I fixed all the bugs, so it's time to merge this!

NotsoanoNimus added 2 commits July 11, 2025 21:02

add wyhash2, metro64, and metro128 hashes; best performing non-crypto…

24e4c34

… hash functions

add some extra benchmarking stops

5befb86

Book-reader reviewed Jul 12, 2025

View reviewed changes

benchmarks/stdlib/hash/non_crypto_shootout.c3 Show resolved Hide resolved

change benchmark constants to a much better init syntax

4886a46

NotsoanoNimus added 9 commits July 13, 2025 16:31

add superfast 64-bit a5hash; not streamed, no 128-bit impl

c8ca8e6

change a5hash macro visibility to local

3ceb115

a5hash code -> tabs

d7fa5d9

fix a few silly logic errors in wyhash and metro64/128

314c657

remove ternary expressions, per Lerno's suggestion

cfbf898

add komihash and associated tests/benchmarks

35bfc06

update version/license inclusions w/ Yi's and Vaneev's hash types

6d4adaf

final pedantic updates

a204b8e

spaces -> tabs (again)

aeff076

NotsoanoNimus added 2 commits July 13, 2025 16:41

Merge branch 'master' into stdlib-metrohash

dafcba3

Merge branch 'master' into stdlib-metrohash

9ed1e7c

lerno reviewed Jul 14, 2025

View reviewed changes

lib/std/hash/komi.c3 Outdated Show resolved Hide resolved

NotsoanoNimus added 3 commits July 15, 2025 02:05

ensure expected bulk result lengths are always equal to the set of in…

10af2d1

…tervals

attempt to fix bad logic in komihash: stop going out of bounds; bad dog

a554d99

more iterative improvements; add missing sweep "branch exploration" t…

1812092

…ests

lerno reviewed Jul 18, 2025

View reviewed changes

lerno added 2 commits July 19, 2025 23:28

Merge branch 'master' into fork/NotsoanoNimus/stdlib-metrohash

f99287e

# Conflicts: # releasenotes.md

Fix alignment issues.

caebdef

lerno merged commit ed92476 into c3lang:master Jul 19, 2025
41 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Add wyhash2 and metro64/128 modern hashing #2293

Add wyhash2 and metro64/128 modern hashing #2293

Uh oh!

NotsoanoNimus commented Jul 11, 2025 •

edited

Loading

Uh oh!

Uh oh!

data-man commented Jul 12, 2025

Uh oh!

NotsoanoNimus commented Jul 13, 2025

Uh oh!

Uh oh!

NotsoanoNimus commented Jul 15, 2025

Uh oh!

lerno Jul 18, 2025

Uh oh!

lerno Jul 18, 2025

Uh oh!

lerno Jul 18, 2025

Uh oh!

lerno commented Jul 18, 2025

Uh oh!

lerno commented Jul 18, 2025

Uh oh!

lerno commented Jul 19, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Add wyhash2 and metro64/128 modern hashing #2293

Add wyhash2 and metro64/128 modern hashing #2293

Uh oh!

Conversation

NotsoanoNimus commented Jul 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

data-man commented Jul 12, 2025

Uh oh!

NotsoanoNimus commented Jul 13, 2025

Uh oh!

Uh oh!

NotsoanoNimus commented Jul 15, 2025

Uh oh!

lerno Jul 18, 2025

Choose a reason for hiding this comment

Uh oh!

lerno Jul 18, 2025

Choose a reason for hiding this comment

Uh oh!

lerno Jul 18, 2025

Choose a reason for hiding this comment

Uh oh!

lerno commented Jul 18, 2025

Uh oh!

lerno commented Jul 18, 2025

Uh oh!

lerno commented Jul 19, 2025

Uh oh!

Uh oh!

Uh oh!

NotsoanoNimus commented Jul 11, 2025 •

edited

Loading