perf: increase interpolator3 speed and remove large minestom generator allocations #512

Bloeckchengrafik · 2025-06-29T21:31:34Z

Pull Request

Description

Makes Interpolator3, a previous hotspot, run faster.

Changelog

Checklist

Mandatory checks

The base branch of this PR is an unreleased version branch (that has a ver/ prefix)
or is a branch that is intended to be merged into a version branch.
There are no already existing PRs that provide the same changes.
The PR is within the scope of Terra (i.e. is something a configurable terrain generator should be doing).
Changes follow the code style for this project.
I have read the CONTRIBUTING.md
document in the root of the git repository.

Types of changes

Compatibility

Introduces a breaking change
Introduces new functionality in a backwards compatible way.
Introduces bug fixes

Documentation

My change requires a change to the documentation.
I have updated the documentation accordingly.

Testing

I have added tests to cover my changes.
All new and existing tests passed.

(Do benchmarks count here?)

Licensing

[x ] I am the original author of this code, and I am willing to
release it under GPLv3.
I am not the original author of this code, but it is in public domain or
released under GPLv3 or a compatible license.

solonovamax · 2025-06-30T22:26:37Z

platforms/minestom/src/main/java/com/dfsek/terra/minestom/chunk/GeneratedChunkCache.java

+        return cache.get(pack(x, z));
+    }
+
+    private long pack(final int x, final int z) {


I believe we already have a function somewhere which packs two integers into a long

I think this is from like the 3.0 days, I wrote it.

solonovamax · 2025-06-30T22:31:47Z

platforms/minestom/src/main/java/com/dfsek/terra/minestom/chunk/GeneratedChunkCache.java

            .maximumSize(128)
            .recordStats()
-            .build((Pair<Integer, Integer> key) -> generateChunk(key.getLeft(), key.getRight()));
+            .build((Long key) -> generateChunk(unpackX(key), unpackZ(key)));


I wonder if it would be worthwhile to have our own kind of cache which uses is backed by a fastutil Long2ObjectMaps and avoids boxing the primitives.

I have something in the works for this for layered, out of scope for this PR but good idea

...n/java/com/dfsek/terra/addons/chunkgenerator/generation/math/interpolation/Interpolator.java

solonovamax · 2025-06-30T22:33:04Z

buildSrc/src/main/kotlin/BenchmarkingConfig.kt

+import org.gradle.api.Project
+import org.gradle.kotlin.dsl.apply
+
+fun Project.configureBenchmarking() {
+    apply(plugin = "me.champeau.jmh")
+}


I mentioned it a while ago that it might be good to add some jmh benchmarks, I made a gradle config for that, I forget if it was just identical to this or if I had smth else as well.

solonovamax · 2025-06-30T22:37:48Z

.../dfsek/terra/addons/chunkgenerator/generation/math/interpolation/Interpolator3Benchmark.java

+public class Interpolator3Benchmark {
+    private final Interpolator3 interpolator = new Interpolator3(0, 1, 0, 1, 0, 1, 0, 1);
+
+    @Benchmark
+    public void benchmarkInterpolator3(Blackhole blackhole) {
+        blackhole.consume(interpolator.trilerp(0.5, 0.75, 0.5));
+    }
+}


this benchmark should probably use random values to stop the jvm from

perhaps pre-populate an array with random values and step through that for each invocation of the benchmark to avoid the overhead of the random number generator, since this is rather low level.

which architectures & jvms has this been tested on? low level optimizations like this are extremely finicky.

solonovamax · 2025-06-30T22:38:52Z

.../dfsek/terra/addons/chunkgenerator/generation/math/interpolation/Interpolator3Benchmark.java

+@Warmup(iterations = 2, time = 1)
+@Measurement(iterations = 2, time = 5)


probably good to do more iterations for a bit longer than that.

I usually see 1 warmup iteration for 5 seconds & 5 measurement iterations for 5 seconds, for a total of 60 seconds.

solonovamax · 2025-06-30T22:39:35Z

.../java/com/dfsek/terra/addons/chunkgenerator/generation/math/interpolation/Interpolator3.java

+        double b = ArithmeticFunctions.fma(2, y, -1);
+        double g = ArithmeticFunctions.fma(2, z, -1);
+
+        // using explicit fma here somehow makes this slower


can you share the benchmarks for this?

in fact, it would be good if you could share all the benchmarks you did for different changes.

+1, that honestly should not be the case as the jvm should just optimize fma into a*b + c on non supported platforms

Bloeckchengrafik added 2 commits June 28, 2025 13:03

feat: increase speed of Interpolator3

913b3e1

feat: reduce allocations in the generated chunk cache

f87edb6

Bloeckchengrafik requested review from dfsek, solonovamax, duplexsystem, astrsh and justaureus as code owners June 29, 2025 21:31

perf: increase interpolator3 speed even more by inlining

0e068e5

solonovamax requested changes Jun 30, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

perf: increase interpolator3 speed and remove large minestom generator allocations #512

perf: increase interpolator3 speed and remove large minestom generator allocations #512

Uh oh!

Bloeckchengrafik commented Jun 29, 2025

Uh oh!

solonovamax Jun 30, 2025

Uh oh!

duplexsystem Jul 9, 2025

Uh oh!

solonovamax Jun 30, 2025

Uh oh!

duplexsystem Jul 9, 2025

Uh oh!

Uh oh!

solonovamax Jun 30, 2025

Uh oh!

duplexsystem Jul 9, 2025

Uh oh!

solonovamax Jun 30, 2025

Uh oh!

duplexsystem Jul 9, 2025

Uh oh!

solonovamax Jun 30, 2025

Uh oh!

duplexsystem Jul 9, 2025

Uh oh!

solonovamax Jun 30, 2025

Uh oh!

duplexsystem Jul 9, 2025

Uh oh!

Uh oh!

		@Warmup(iterations = 2, time = 1)
		@Measurement(iterations = 2, time = 5)

perf: increase interpolator3 speed and remove large minestom generator allocations #512

Are you sure you want to change the base?

perf: increase interpolator3 speed and remove large minestom generator allocations #512

Uh oh!

Conversation

Bloeckchengrafik commented Jun 29, 2025

Pull Request

Description

Changelog

Checklist

Mandatory checks

Types of changes

Compatibility

Documentation

Testing

Licensing

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!