balancer/randomsubsetting: Extend the unit tests in the randomsubsetting package. by marek-szews · Pull Request #8781 · grpc/grpc-go

marek-szews · 2025-12-19T11:46:39Z

Add more unit tests to increase test coverage of 'randomsubsetting' package.

RELEASE NOTES:

balancer/randomsubsetting: Implementation of additional UT.

codecov · 2025-12-19T11:50:24Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 81.40%. Comparing base (2d51986) to head (345055c).
⚠️ Report is 78 commits behind head on master.

Additional details and impacted files

@@            Coverage Diff             @@
##           master    #8781      +/-   ##
==========================================
- Coverage   83.47%   81.40%   -2.08%     
==========================================
  Files         419      416       -3     
  Lines       32595    33429     +834     
==========================================
+ Hits        27208    27212       +4     
- Misses       4017     4661     +644     
- Partials     1370     1556     +186

see 90 files with indirect coverage changes

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

Pranjali-2501 · 2026-01-23T09:20:00Z

balancer/randomsubsetting/randomsubsetting_ext_test.go

+	_ "google.golang.org/grpc/balancer/roundrobin" // For round_robin LB policy in tests
+)
+
+func (s) TestSubsettingEndpointsDomain(t *testing.T) {


This test is similar to the existing test TestCalculateSubset_Simple.
You can remove this and add more test cases in TestCalculateSubset_Simple if you think of any.

+1

Yes, the only possible case that could be added to TestCalculateSubset_Simple is the case where the number of endpoints is strictly greater than the subset size.

Indeed it is only one test case missing, i had renamed function and added to test set.

Pranjali-2501 · 2026-01-23T09:20:55Z

balancer/randomsubsetting/randomsubsetting_ext_test.go

+	}
+}
+
+func (s) TestUniformDistributionOfEndpoints(t *testing.T) {


Add a descriptive comment for this test.

Move this test to the existing test file randomsubsetting_test.go.

+1 to both the above comments. Please add comments about why the math is the way it is in the test.

Done. Appropriate commentary added.

Pranjali-2501 · 2026-01-23T09:44:41Z

balancer/randomsubsetting/randomsubsetting_ext_test.go

+	}
+}
+
+func (s) TestUniformDistributionOfEndpoints(t *testing.T) {


Can you run the test ~10k times on your local to verify that the test is not flaky.

How often does this flake currently? We do not want to add tests that can flake. We want a test that is guaranteed to pass every time that it runs, and when it fails, it should really mean that there is a bug in the code.

I have added two examples of tests for which the uniform distribution verification is negative.

Pranjali-2501 · 2026-01-23T10:01:12Z

balancer/randomsubsetting/randomsubsetting_ext_test.go

+
+	endpoints := makeEndpoints(16)
+	expected := iteration / len(endpoints) * subsetSize
+	diff = expected / 7 // allow ~14% difference


Can you explain the reasoning behind 14% error-rate.

Can you look at this and modify the test to calculate how many iterations are needed for statistical significance, instead of arbitrarily adding huge error rate to satisfy small range of iterations.

I carefully analyzed the test acceptance criteria and came to the conclusion that it was too naive. Finally I use the Chi-square goodness-of-fit test, standard statistical method used to validate whether a dataset follows a uniform distribution. It assesses if the observed frequencies in your data align with the expected frequencies of a theoretical distribution where every outcome has an equal chance of occurring.

marek-szews

All comments were addressed. Additionally, each test of distribution concludes with a short report.
=== NAME Test/UniformDistributionOfEndpoints
randomsubsetting_test.go:404: Test Case: Endpoints=16, SubsetSize=4, Iterations=10
randomsubsetting_test.go:409: Distribution check passed:
Endpoint | ExpValue | Diff from E | Status
---------------------------------------------------------------------------
endpoint-5 | 2.50 | 1.50 | ! > E ± σ (Noticeable)
endpoint-9 | 2.50 | 1.50 | ! > E ± σ (Noticeable)
endpoint-12 | 2.50 | 1.50 | ! > E ± σ (Noticeable)
endpoint-13 | 2.50 | 1.50 | ! > E ± σ (Noticeable)
endpoint-0 | 2.50 | 2.50 | ! > E ± σ (Noticeable)
endpoint-4 | 2.50 | 0.50 | E ± σ (Normal)
endpoint-10 | 2.50 | 0.50 | E ± σ (Normal)
endpoint-11 | 2.50 | 0.50 | E ± σ (Normal)
endpoint-1 | 2.50 | 1.50 | ! > E ± σ (Noticeable)
endpoint-6 | 2.50 | 0.50 | E ± σ (Normal)
endpoint-2 | 2.50 | 2.50 | ! > E ± σ (Noticeable)
endpoint-7 | 2.50 | 0.50 | E ± σ (Normal)
endpoint-8 | 2.50 | 0.50 | E ± σ (Normal)
endpoint-14 | 2.50 | 0.50 | E ± σ (Normal)
endpoint-15 | 2.50 | 1.50 | ! > E ± σ (Noticeable)
endpoint-3 | 2.50 | 1.50 | ! > E ± σ (Noticeable)
---------------------------------------------------------------------------
Distribution is uniform (χ²=12.00 <= critical value=24.99)

emil10001 · 2026-02-06T17:18:41Z

/gemini review

gemini-code-assist

Code Review

This pull request adds valuable unit tests for the randomsubsetting balancer, significantly increasing test coverage. The new tests include simple subset size validation and a statistical test for uniform endpoint distribution. My review focuses on improving the implementation of these tests for better clarity, correctness, and adherence to idiomatic Go practices. The suggestions include simplifying comparison logic, correcting the use of t.Run in a statistical test loop, refactoring complex boolean logic, and using idiomatic function signatures for maps.

balancer/randomsubsetting/randomsubsetting_test.go

marek-szews

All comments left by Gemini Code Assist have been resolved. Test execution time has been reduced from 3 seconds to 0.1 seconds. The test output has also become much clearer.

marek-szews

Please check the current shape of implementation.

marek-szews

Could you verify my rework

Pranjali-2501 · 2026-02-24T14:23:05Z

Apologies for the delay, I'll take a look at it today.

Pranjali-2501 · 2026-02-25T16:29:16Z

balancer/randomsubsetting/randomsubsetting_test.go

 	}
 }
+
+func (s) TestSubsettingEndpointsSimply(t *testing.T) {


Test TestSubsettingEndpointsSimply and existing test TestCalculateSubset_Simple are doing the same think.

As mentioned here we can get rid of TestSubsettingEndpointsSimply completely and added a testcase in TestCalculateSubset_Simple in which the endpoints are strictly greater than subset size.

Something like this

func (s) TestCalculateSubset_Simple(t *testing.T) { tests := []struct { name string endpoints []resolver.Endpoint subsetSize uint32 want []resolver.Endpoint }{ // existing code { name: "SubsetSizeLessThanNumberOfEndpoints", endpoints: makeEndpoints(15), subsetSize: 5, want: makeEndpoints(5), }, } // existing code }

Not exactly like that, function makeEndpoints(15) will produce the series of endpoints with a arithmetic order {endpoint_0, endpoint_1, ..., endpoint_14}. Same makeEndpoints(5) give us {endpoint_0, endpoint_1,.., endpoint_4}. But the result of LB.calculateSubset(eps) is unpredictable. One time we can received subset {endpoint_4, endpoint_5, endpoint_8, endpoint_11, endpoint_14} another time completely different. Validation of members of subset 99% will failed. I have left those two tests separated, because the first of them, check the content of set (boundary condition is meet so method always return the unchanged set - with a origin order) while the second test due to randomisation of content of subset, validate only the cardinality of the set (number of elements).

Pranjali-2501 · 2026-02-25T16:29:23Z

balancer/randomsubsetting/randomsubsetting_test.go

+}
+
+func (s) TestUniformDistributionOfEndpoints(t *testing.T) {
+


Remove this extra line.

Pranjali-2501 · 2026-02-25T16:29:34Z

balancer/randomsubsetting/randomsubsetting_test.go

+
+	for _, tc := range testCases {
+		endpoints := makeEndpoints(tc.eps)
+		// From a set of N numbers, we randomly select K-times a subset of L numbers,


Instead of keeping it as an inline comment inside the test loop, we should move it to the official Godoc for TestUniformDistributionOfEndpoints

Something like this in the beginning:

Suggested change

// From a set of N numbers, we randomly select K-times a subset of L numbers,

// TestUniformDistributionOfEndpoints verifies that the random subsetting

// policy achieves a uniform distribution across backends. From a set of N

// numbers, it randomly selects K-times a subset of L numbers, where L < N.

// Then it calculates how many times each number belonging to set N appears,

// compute the variance and standard deviation, and use a Chi-Square test to

// check whether the distribution is uniform.

Done and move to the beginning of function declaration.

Pranjali-2501 · 2026-02-25T16:29:39Z

balancer/randomsubsetting/randomsubsetting_test.go

+		sigma := math.Sqrt(variance)         // Standard Deviation σ(N) = sqrt(σ²(N))
+
+		EndpointCount := make(map[string]int, N)
+		initEndpointCount(EndpointCount, endpoints)


Since initEndpointCount() is a very simple helper used only once within TestUniformDistributionOfEndpoints, we could consider inlining this logic directly into the main test function.

Pranjali-2501 · 2026-02-25T16:29:42Z

balancer/randomsubsetting/randomsubsetting_test.go

+// ChiSquareCriticalValue calculates the critical value for alpha (e.g., 0.05)
+// and degrees of freedom (df).
+func chiSquareCriticalValue(alpha float64, df float64) float64 {
+	// 1. Find the Z-score for the given alpha.


Please remove 1./2. from the comments.

Pranjali-2501 · 2026-02-25T16:30:07Z

balancer/randomsubsetting/randomsubsetting_test.go

+func chiSquareCriticalValue(alpha float64, df float64) float64 {
+	// 1. Find the Z-score for the given alpha.
+	// For alpha = 0.05 (95% confidence), Z is approx 1.64485
+	z := getZScore(1 - alpha)


Same here.
getZScore is only used once within chiSquareCriticalValue and serves as a simple lookup table, we can inline this logic too.

Pranjali-2501

@marek-szews I have added some comments, please take a look at it.

Pranjali-2501 · 2026-02-25T16:35:10Z

@arjan-bal and @easwars, could you please take a look at the statistical verification logic introduced in TestUniformDistributionOfEndpoints.

Thanks

easwars · 2026-02-25T20:55:32Z

TestUniformDistributionOfEndpoints

We already have an implementation of this statistical verification logic here:

grpc-go/internal/testutils/roundrobin/roundrobin.go

Line 253 in 7136e99

    
           func pearsonsChiSquareTest(t *testing.T, observedCounts, expectedCounts map[string]float64) error {

@arjan-bal knows this better than anyone else.

But we should try and see if this can reused here instead of reimplementing it.

marek-szews

All comments have been taken into account, please check again.

marek-szews · 2026-02-26T13:12:56Z

balancer/randomsubsetting/randomsubsetting_test.go

 	}
 }
+
+func (s) TestSubsettingEndpointsSimply(t *testing.T) {


Not exactly like that, function makeEndpoints(15) will produce the series of endpoints with a arithmetic order {endpoint_0, endpoint_1, ..., endpoint_14}. Same makeEndpoints(5) give us {endpoint_0, endpoint_1,.., endpoint_4}. But the result of LB.calculateSubset(eps) is unpredictable. One time we can received subset {endpoint_4, endpoint_5, endpoint_8, endpoint_11, endpoint_14} another time completely different. Validation of members of subset 99% will failed. I have left those two tests separated, because the first of them, check the content of set (boundary condition is meet so method always return the unchanged set - with a origin order) while the second test due to randomisation of content of subset, validate only the cardinality of the set (number of elements).

marek-szews · 2026-02-26T13:19:10Z

balancer/randomsubsetting/randomsubsetting_test.go

+}
+
+func (s) TestUniformDistributionOfEndpoints(t *testing.T) {
+


marek-szews · 2026-02-26T15:08:47Z

balancer/randomsubsetting/randomsubsetting_test.go

+
+	for _, tc := range testCases {
+		endpoints := makeEndpoints(tc.eps)
+		// From a set of N numbers, we randomly select K-times a subset of L numbers,


Done and move to the beginning of function declaration.

marek-szews · 2026-02-26T15:12:25Z

balancer/randomsubsetting/randomsubsetting_test.go

+		sigma := math.Sqrt(variance)         // Standard Deviation σ(N) = sqrt(σ²(N))
+
+		EndpointCount := make(map[string]int, N)
+		initEndpointCount(EndpointCount, endpoints)


marek-szews · 2026-02-26T16:06:53Z

balancer/randomsubsetting/randomsubsetting_test.go

+
+	for _, tc := range testCases {
+		endpoints := makeEndpoints(tc.eps)
+		// From a set of N numbers, we randomly select K-times a subset of L numbers,


Done and move to the beginning of function declaration.

marek-szews · 2026-02-26T16:07:10Z

balancer/randomsubsetting/randomsubsetting_test.go

+		sigma := math.Sqrt(variance)         // Standard Deviation σ(N) = sqrt(σ²(N))
+
+		EndpointCount := make(map[string]int, N)
+		initEndpointCount(EndpointCount, endpoints)


marek-szews · 2026-02-26T16:07:39Z

balancer/randomsubsetting/randomsubsetting_test.go

+// ChiSquareCriticalValue calculates the critical value for alpha (e.g., 0.05)
+// and degrees of freedom (df).
+func chiSquareCriticalValue(alpha float64, df float64) float64 {
+	// 1. Find the Z-score for the given alpha.


marek-szews · 2026-02-26T16:09:10Z

balancer/randomsubsetting/randomsubsetting_test.go

+func chiSquareCriticalValue(alpha float64, df float64) float64 {
+	// 1. Find the Z-score for the given alpha.
+	// For alpha = 0.05 (95% confidence), Z is approx 1.64485
+	z := getZScore(1 - alpha)


Extend the test coverage of randomsubsetting package.

04b9193

marek-szews changed the title ~~balancer/randomsubsetting: Extend the test coverage of randomsubsetting package.~~ balancer/randomsubsetting: Extend the unit tests in the randomsubsetting package. Dec 19, 2025

Fix the CI checks failures.

a2f51bc

Pranjali-2501 reviewed Jan 23, 2026

View reviewed changes

Pranjali-2501 added the Type: Testing label Jan 23, 2026

Pranjali-2501 assigned marek-szews Jan 23, 2026

Pranjali-2501 added this to the 1.80 Release milestone Jan 23, 2026

easwars assigned easwars and unassigned easwars Jan 27, 2026

easwars added the Status: Requires Reporter Clarification label Jan 27, 2026

marek-szews added 5 commits January 30, 2026 14:54

Rework of PR. Add validation of uniform distribution.

7638565

Rework of PR. Add negative scenarios.

b79c675

Rework of PR. Change of name of function.

26c52b6

Rework of PR. Change of name of function.

7a18d55

Rework of PR. Fix of static analysis error.

ae6f169

marek-szews commented Feb 6, 2026

View reviewed changes

gemini-code-assist bot reviewed Feb 6, 2026

View reviewed changes

All comments left by Gemini Code Assist have been resolved.

345055c

marek-szews commented Feb 12, 2026

View reviewed changes

easwars removed the Status: Requires Reporter Clarification label Feb 12, 2026

easwars assigned Pranjali-2501 and unassigned marek-szews Feb 12, 2026

marek-szews commented Feb 20, 2026

View reviewed changes

marek-szews commented Feb 24, 2026

View reviewed changes

Pranjali-2501 reviewed Feb 25, 2026

View reviewed changes

Pranjali-2501 assigned easwars, arjan-bal and marek-szews and unassigned Pranjali-2501 Feb 25, 2026

Pranjali-2501 requested review from arjan-bal and easwars February 25, 2026 16:37

easwars unassigned easwars and arjan-bal Feb 25, 2026

easwars added the Status: Requires Reporter Clarification label Feb 25, 2026

marek-szews commented Feb 26, 2026

View reviewed changes

Pranjali-2501 removed the Status: Requires Reporter Clarification label Feb 28, 2026

		}

		func (s) TestUniformDistributionOfEndpoints(t *testing.T) {

-		// From a set of N numbers, we randomly select K-times a subset of L numbers,
+// TestUniformDistributionOfEndpoints verifies that the random subsetting
+// policy achieves a uniform distribution across backends. From a set of N
+// numbers, it randomly selects K-times a subset of L numbers, where L < N.
+// Then it calculates how many times each number belonging to set N appears,
+// compute the variance and standard deviation, and use a Chi-Square test to
+// check whether the distribution is uniform.

Conversation

marek-szews commented Dec 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov bot commented Dec 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

marek-szews Feb 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

marek-szews left a comment

Choose a reason for hiding this comment

Uh oh!

emil10001 commented Feb 6, 2026

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

marek-szews left a comment

Choose a reason for hiding this comment

Uh oh!

marek-szews left a comment

Choose a reason for hiding this comment

Uh oh!

marek-szews left a comment

Choose a reason for hiding this comment

Uh oh!

Pranjali-2501 commented Feb 24, 2026

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

marek-szews commented Dec 19, 2025 •

edited

Loading

codecov bot commented Dec 19, 2025 •

edited

Loading

marek-szews Feb 3, 2026 •

edited

Loading

Pranjali-2501 left a comment •

edited

Loading

Pranjali-2501 commented Feb 25, 2026 •

edited

Loading