Skip to content

Adds multi-tier partitioning to randM vector track#1093

Open
ah89 wants to merge 2 commits intoelastic:masterfrom
ah89:improve-random-vector-multi-tenant
Open

Adds multi-tier partitioning to randM vector track#1093
ah89 wants to merge 2 commits intoelastic:masterfrom
ah89:improve-random-vector-multi-tenant

Conversation

@ah89
Copy link

@ah89 ah89 commented Mar 18, 2026

This updates the random_vector track to model a more realistic multi-partition workload with variable-sized small, medium, and large partitions. Documents are assigned to partitions using deterministic weighted sampling, routed by partition_id, and benchmarked with separate search phases for each partition tier so latency and QPS can be measured independently.

The change also renames the public configuration and task surface from tenant to partition, adds tier-specific search operations and conditional challenge phases, and documents the new parameters in the README. It also fixes the index template and query generation needed for the new flow by enabling custom routing for the data stream, requiring routing in mappings, and only sending rescore_vector when oversampling is greater than 0.

Validated with Rally test mode on Elasticsearch 9.3.1, which is the latest installable darwin-aarch64 release available in this environment. The track also passed a sparse configuration with small_partitions:3,medium_partitions:0,large_partitions:0, completing successfully with 0% error rate; empty partition tiers are now skipped instead of causing the benchmark to fail.

Introduces a realistic multi-partition model with small, medium, and
large partition tiers, supporting configurable counts and reproducible
sizing via seeded RNG. Updates benchmarking to separately measure search
performance per partition tier, improving analysis of index filtering
and routing efficiency. Enhances documentation and parameterization for
clarity and reproducible nightly runs.
@ah89 ah89 changed the title Adds multi-tier partitioning to randM vector trk Adds multi-tier partitioning to randM vector track Mar 18, 2026
@ah89 ah89 requested a review from benwtrent March 18, 2026 15:51
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant