Skip to content

Fix flakereport#9399

Open
stephanos wants to merge 1 commit intomainfrom
fix-flakereport
Open

Fix flakereport#9399
stephanos wants to merge 1 commit intomainfrom
fix-flakereport

Conversation

@stephanos
Copy link
Contributor

@stephanos stephanos commented Feb 25, 2026

What changed?

Fix issues with flakereport; see comments for details.

How did you test it?

  • built
  • run locally and tested manually
  • covered by existing tests
  • added new unit test(s)
  • added new functional test(s)
Example

Flaky Tests Report - 2026-02-25 11:25:50

Overall Statistics

  • CI Success Rate: 4/8 (50.00%)
  • Total Test Runs: 261830
  • Total Failures: 404
  • Overall Failure Rate: 1.5 per 1000 tests

Failure Categories Summary

Category Unique Tests
CI Breakers 1
Crashes 1
Timeouts 0
Flaky Tests 129

CI Breakers (Failed All Retries)

Test CI Runs Broken Total Failures Last Failure Links
TestVersionWorkflowSuite/v2/Test_DeleteVersion_QueryAfterDeletion 1 1 21h ago 1

Crashes

Test Flake Rate Last Failure Links
unit-test 100.0% (2/2) 21h ago 1 2

Flaky Tests

Test Flake Rate Last Failure Links
TestClientDataConverterTestSuite 100.0% (1/1) 44h ago 1
PANIC: runtime error: index out of range [0] with length 0 [recovered, repanicked] — in go.temporal.io/server/tests.TestWorkflowUpdateSuite.func21.2 100.0% (1/1) 44h ago 1
PANIC: Fail in goroutine after TestVersioning3FunctionalSuiteV2/TestWorkflowWithPinnedOverride_NoSticky has completed — in TestVersioning3FunctionalSuiteV0/TestActivityRetryAutoUpgradeDuringBackoff 100.0% (1/1) 44h ago 1
PANIC: Fail in goroutine after TestVersioning3FunctionalSuiteV0/TestPinnedQuery_RollbackDrainedVersion/ForceTaskForwardNoPollForwardForceAsync has completed — in TestVersioning3FunctionalSuiteV2/TestActivityRetryAutoUpgradeDuringBackoff 100.0% (1/1) 60h ago 1
DATA RACE: Data race detected 100.0% (1/1) 23h ago 1
PANIC: runtime error: invalid memory address or nil pointer dereference — in go.temporal.io/server/tests.(*DeploymentVersionSuite).TestForceCAN_WithOverrideState.func1 100.0% (1/1) 38h ago 1
TestVersioning3FunctionalSuiteV0/TestTransitionDuringTransientTask_WithSignal 48.9% (45/92) 1h ago 1 2 3
TestVersioning3FunctionalSuiteV2/TestTransitionDuringTransientTask_WithoutSignal 48.9% (45/92) 1h ago 1 2 3
TestVersioning3FunctionalSuiteV2/TestTransitionDuringTransientTask_WithSignal 48.9% (45/92) 1h ago 1 2 3
TestVersioning3FunctionalSuiteV0/TestTransitionDuringTransientTask_WithoutSignal 43.4% (36/83) 1h ago 1 2 3
TestVersionWorkflowSuite/v2/Test_DeleteVersion_QueryAfterDeletion 42.9% (3/7) 21h ago 1 2 3
TestUserData_FetchesUpTree 16.7% (1/6) 23h ago 1
TestNewServerWithOTEL/with_OTEL_Collector_running 16.7% (1/6) 21h ago 1
TestInterleavedWeightedRoundRobinSchedulerSuite/TestInactiveChannelDeletionRace 16.7% (1/6) 60h ago 1
TestUserData_FetchesActivityToWorkflow 16.7% (1/6) 23h ago 1
TestDeploymentVersionSuiteV0/TestReactivationSignalCache_Deduplication_StartWorkflow 16.1% (9/56) 1h ago 1 2 3
TestNewServer 15.4% (2/13) 23h ago 1 2
TestDeploymentVersionSuiteV0/TestReactivationSignalCache_Deduplication_SignalWithStart 11.3% (6/53) 21h ago 1 2 3
TestDeploymentVersionSuiteV0/TestStartWorkflowExecution_ReactivateVersionOnPinned_WithConflictPolicy 9.6% (5/52) 21h ago 1 2 3
TestDeploymentVersionSuiteV2/TestVersionScavenger_DeleteOnAdd 9.6% (5/52) 1h ago 1 2 3
TestDeploymentVersionSuiteV0/TestStartWorkflowExecution_ReactivateVersionOnPinned 9.3% (5/54) 21h ago 1 2 3
TestDeploymentVersionSuiteV0/TestSignalWithStartWorkflowExecution_ReactivateVersionOnPinned 7.8% (4/51) 21h ago 1 2 3
TestFairnessAutoEnableSuite/Test_Activity_Basic 7.7% (4/52) 1h ago 1 2 3
TestDeploymentVersionSuiteV2/TestReactivationSignalCache_Deduplication_UpdateOptions 7.7% (4/52) 21h ago 1 2 3
TestActivityClientTestSuite/TestActivityScheduleToClose_FiredDuringBackoff 7.5% (4/53) 1h ago 1 2 3
TestDeploymentVersionSuiteV0/TestReactivationSignalCache_Deduplication_Reset 6.0% (3/50) 38h ago 1 2 3
TestDeploymentVersionSuiteV0/TestResetWorkflowExecution_ReactivateVersionOnPinned 6.0% (3/50) 21h ago 1 2 3
TestDeploymentVersionSuiteV2/TestVersionMissingTaskQueues_ValidSetRampingVersion 6.0% (3/50) 21h ago 1 2 3
TestTaskQueueStats_Pri_Suite/TestRampingAndCurrentAbsorbUnversionedBacklog/ForceTaskForwardForcePollForwardAllowSync 5.9% (3/51) 21h ago 1 2 3
TestWorkerDeploymentSuiteV0/TestSetCurrentVersion_Concurrent_DifferentVersions_NoUnexpectedErrors 5.9% (3/51) 21h ago 1 2 3
TestDeploymentVersionSuiteV0/TestUpdateWorkflowExecutionOptions_ReactivateVersionOnPinned 5.9% (3/51) 1h ago 1 2 3
TestCronTestClientSuite/TestCronWorkflowCompletionStates 5.8% (3/52) 21h ago 1 2 3
TestCallbacksSuiteCHASM/TestWorkflowNexusCallbacks_CarriedOver/ContinueAsNew 5.8% (3/52) 1h ago 1 2 3
TestStandaloneActivityTestSuite/TestHeartbeat/HeartbeatKeepsActivityAlive 5.8% (3/52) 21h ago 1 2 3
TestCallbacksSuiteCHASM/TestWorkflowNexusCallbacks_CarriedOver/WorkflowRunTimeout 5.8% (3/52) 1h ago 1 2 3
TestWorkerDeploymentSuiteV0/TestDescribeWorkerDeployment_SetCurrentVersion 5.8% (3/52) 21h ago 1 2 3
TestDeploymentVersionSuiteV0/TestReactivationSignalCache_Deduplication_UpdateOptions 4.1% (2/49) 1h ago 1 2
TestTaskQueueStats_Classic_Suite/TestAddMultipleTasks_ValidateStats_Cached 4.0% (2/50) 21h ago 1 2
TestTaskQueueStats_Pri_Suite/TestInactiveVersionDoesNotAbsorbUnversionedBacklog/NoTaskForwardNoPollForwardAllowSync 4.0% (2/50) 1h ago 1 2
TestTaskQueueStats_Pri_Suite/TestMultipleTasks_WithMatchingBehavior_ValidateStats/ForceTaskForwardForcePollForwardAllowSync 4.0% (2/50) 21h ago 1 2
TestWorkerDeploymentSuiteV0/TestSetRampingVersion_AfterDrained 4.0% (2/50) 38h ago 1 2
TestWorkerDeploymentSuiteV2/TestDrainRollbackedVersion 4.0% (2/50) 38h ago 1 2
TestFuncClustersTestSuite/EnableTransitionHistory/TestForceMigration_ResetWorkflow 4.0% (2/50) 1h ago 1 2
TestDeploymentVersionSuiteV0/TestVersionScavenger_DeleteOnAdd 4.0% (2/50) 38h ago 1 2
TestWorkerDeploymentSuiteV2/TestSetCurrentVersion_Concurrent_DifferentVersions_NoUnexpectedErrors 4.0% (2/50) 21h ago 1 2
TestPrioritySuite/TestSubqueue_Migration 3.9% (2/51) 21h ago 1 2
TestCallbacksSuiteCHASM/TestWorkflowNexusCallbacks_CarriedOver/WorkflowFailureRetry 3.9% (2/51) 1h ago 1 2
TestWorkerDeploymentSuiteV0/TestDrainRollbackedVersion 3.9% (2/51) 23h ago 1 2
TestVersioning3FunctionalSuiteV0/TestPinnedTask_NoProperPoller/NoTaskForwardNoPollForwardForceAsync 2.1% (1/48) 38h ago 1
TestVersioning3FunctionalSuiteV2/TestPinnedCaN_UpgradeOnCaN_NormalWFT_WithSuggest/ForceTaskForwardNoPollForwardForceAsync 2.1% (1/48) 60h ago 1
TestDeploymentVersionSuiteV0/TestUpdateWorkflowExecutionOptions_SetImpliedPinnedSuccess 2.1% (1/48) 21h ago 1
TestGetHistoryFunctionalSuite/DisableTransitionHistory/TestGetWorkflowExecutionHistory_All 2.1% (1/48) 23h ago 1
TestVersioning3FunctionalSuiteV2/TestUnpinnedCaN 2.1% (1/48) 23h ago 1
TestVersioningFunctionalSuite/TestWorkflowTaskRedirectInRetryNonFirstTask/NoTaskForwardForcePollForwardAllowSync 2.1% (1/48) 1h ago 1
TestWorkerDeploymentSuiteV0/TestListWorkerDeployments_TwoVersions_SameDeployment_OneCurrent_OneRamping 2.1% (1/48) 23h ago 1
TestTaskQueueStats_Classic_Suite/TestMultipleTasks_WithMatchingBehavior_ValidateStats/ForceTaskForwardNoPollForwardForceAsync 2.1% (1/48) 1h ago 1
TestVersioning3FunctionalSuiteV2/TestWorkflowWithPinnedOverride_NoSticky/ForceTaskForwardForcePollForwardForceAsync 2.1% (1/48) 44h ago 1
TestAdvancedVisibilitySuiteLegacy/TestListWorkflow_StringQuery 2.1% (1/48) 23h ago 1
TestVersioning3FunctionalSuiteV0/TestChildWorkflowInheritance_ParentPinnedByOverride 2.1% (1/48) 23h ago 1
TestStandaloneActivityTestSuite/TestRequestCancel/MismatchedTokenComponentRef 2.1% (1/48) 1h ago 1
TestDeploymentVersionSuiteV2/TestVersionMissingTaskQueues_ValidSetCurrentVersion 2.1% (1/48) 21h ago 1
TestVersioning3FunctionalSuiteV0/TestPinnedQuery_DrainedVersion_PollersAbsent/ForceTaskForwardNoPollForwardAllowSync 2.1% (1/48) 23h ago 1
TestVersioning3FunctionalSuiteV0/TestPinnedWorkflowWithLateActivityPoller/ForceTaskForwardNoPollForwardAllowSync 2.1% (1/48) 60h ago 1
TestTaskQueueStats_Classic_Suite/TestMultipleTasks_WithMatchingBehavior_ValidateStats/ForceTaskForwardForcePollForwardAllowSync 2.1% (1/48) 21h ago 1
TestDeploymentVersionSuiteV2/TestStartWorkflowExecution_ReactivateVersionOnPinned_WithConflictPolicy 2.1% (1/48) 1h ago 1
TestDeploymentVersionSuiteV2/TestSignalWithStartWorkflowExecution_ReactivateVersionOnPinned 2.1% (1/48) 23h ago 1
TestVersioningFunctionalSuite/TestDispatchQueryOld/ForceTaskForwardNoPollForwardAllowSync 2.1% (1/48) 23h ago 1
TestVersioningFunctionalSuite/TestWorkflowTaskRedirectInRetryFirstTask/ForceTaskForwardForcePollForwardForceAsync 2.1% (1/48) 44h ago 1
TestTaskQueueStats_Pri_Suite/TestMultipleTasks_WithMatchingBehavior_ValidateStats/ForceTaskForwardForcePollForwardForceAsync 2.1% (1/48) 38h ago 1
TestWorkerDeploymentSuiteV0/TestDescribeWorkerDeployment_MultipleVersions_Sorted 2.0% (1/49) 38h ago 1
TestDeploymentVersionSuiteV2/TestDrainageStatus_SetCurrentVersion_YesOpenWFs 2.0% (1/49) 38h ago 1
TestVersioning3FunctionalSuiteV0/TestAutoUpgradeCaN_UpgradeOnCaN/ForceTaskForwardNoPollForwardAllowSync 2.0% (1/49) 38h ago 1
TestWorkerDeploymentSuiteV2/TestSetCurrentVersion_Batching 2.0% (1/49) 38h ago 1
TestTaskQueueStats_Classic_Suite/TestCurrentAbsorbsUnversionedBacklog_WhenRampingToUnversioned/NoTaskForwardForcePollForwardForceAsync 2.0% (1/49) 44h ago 1
TestTaskQueueSuite/TestTaskQueueRateLimit_UpdateFromWorkerConfigAndAPI 2.0% (1/49) 1h ago 1
TestVersioning3FunctionalSuiteV0/TestPinnedQuery_RollbackDrainedVersion/ForceTaskForwardNoPollForwardForceAsync 2.0% (1/49) 60h ago 1
TestWorkflowUpdateSuite/StickySpeculativeWorkflowTask_AcceptComplete_StickyWorkerUnavailable 2.0% (1/49) 44h ago 1
TestWorkerDeploymentSuiteV2/TestResourceExhaustedErrors_Converted_To_ReadableMessage 2.0% (1/49) 60h ago 1
TestVersioning3FunctionalSuiteV0/TestPinnedCaN_UpgradeOnCaN_TransientWFT_WithSuggest/ForceTaskForwardForcePollForwardForceAsync 2.0% (1/49) 1h ago 1
TestVersioning3FunctionalSuiteV2/TestPinnedCaN_UpgradeOnCaN_NormalWFT_PinnedOverride_WithSuggest/NoTaskForwardForcePollForwardAllowSync 2.0% (1/49) 38h ago 1
TestTaskQueueStats_Pri_Suite/TestMultipleTasks_WithMatchingBehavior_ValidateStats/ForceTaskForwardNoPollForwardForceAsync 2.0% (1/49) 44h ago 1
TestChildWorkflowSuite/TestCronChildWorkflowExecution 2.0% (1/49) 60h ago 1
TestNexusStateReplicationTestSuite/DisableTransitionHistory/TestNexusCallbackReplicated 2.0% (1/49) 1h ago 1
TestWorkerDeploymentSuiteV0/TestSetWorkerDeploymentRampingVersion_WithCurrent_Unset_Ramp 2.0% (1/49) 60h ago 1
TestFuncClustersTestSuite/EnableTransitionHistory/TestForceMigration_ClosedWorkflow 2.0% (1/49) 23h ago 1
TestTaskQueueStats_Classic_Suite/TestCurrentVersionAbsorbsUnversionedBacklog_NoRamping/ForceTaskForwardNoPollForwardForceAsync 2.0% (1/49) 60h ago 1
TestWorkflowUpdateSuite/SpeculativeWorkflowTask_ScheduleToStartTimeoutOnNormalTaskQueue 2.0% (1/49) 44h ago 1
TestStandaloneActivityTestSuite/TestRequestCancel/StaleAttemptToken 2.0% (1/49) 38h ago 1
TestTaskQueueStats_Classic_Suite/TestCurrentVersionAbsorbsUnversionedBacklog_NoRamping/NoTaskForwardForcePollForwardAllowSync 2.0% (1/49) 23h ago 1
TestVersioning3FunctionalSuiteV2/TestUnpinnedQuery_NoSticky/ForceTaskForwardForcePollForwardForceAsync 2.0% (1/49) 21h ago 1
TestTaskQueueStats_Classic_Suite/TestRampingAbsorbsUnversionedBacklog_WhenCurrentIsUnversioned/NoTaskForwardForcePollForwardForceAsync 2.0% (1/49) 60h ago 1
TestTaskQueueStats_Pri_Suite/TestCurrentVersionAbsorbsUnversionedBacklog_NoRamping/NoTaskForwardNoPollForwardAllowSync 2.0% (1/49) 1h ago 1
TestTaskQueueStats_Pri_Suite/TestRampingAndCurrentAbsorbUnversionedBacklog/NoTaskForwardNoPollForwardAllowSync 2.0% (1/49) 44h ago 1
TestFuncClustersTestSuite/DisableTransitionHistory/TestForceMigration_ClosedWorkflow 2.0% (1/49) 60h ago 1
TestWorkerDeploymentSuiteV0/TestSetRampingVersion_Concurrent_DifferentVersions_NoUnexpectedErrors 2.0% (1/49) 21h ago 1
TestTaskQueueStats_Classic_Suite/TestMultipleTasks_WithMatchingBehavior_ValidateStats/NoTaskForwardNoPollForwardAllowSync 2.0% (1/49) 38h ago 1
TestVersioning3FunctionalSuiteV0/TestPinnedQuery_RollbackDrainedVersion/NoTaskForwardForcePollForwardAllowSync 2.0% (1/49) 60h ago 1
TestVersioning3FunctionalSuiteV2/TestUnpinnedQuery_NoSticky/NoTaskForwardForcePollForwardAllowSync 2.0% (1/49) 38h ago 1
TestDeploymentVersionSuiteV0/TestForceCAN_WithOverrideState 2.0% (1/49) 38h ago 1
TestVersioningFunctionalSuite/TestDispatchActivityFailCrossTq/NoTaskForwardForcePollForwardAllowSync 2.0% (1/49) 38h ago 1
TestVersioning3FunctionalSuiteV0/TestPinnedCaN_UpgradeOnCaN_TransientWFT_WithSuggest/ForceTaskForwardNoPollForwardForceAsync 2.0% (1/49) 44h ago 1
TestWorkerDeploymentSuiteV0/TestSetWorkerDeploymentRampingVersion_Batching 2.0% (1/49) 60h ago 1
TestDeploymentVersionSuiteV0/TestDrainageStatus_SetCurrentVersion_NoOpenWFs 2.0% (1/50) 38h ago 1
TestScheduleFunctionalSuite/TestNextTimeCache 2.0% (1/50) 21h ago 1
TestVersioning3FunctionalSuiteV2/TestUnpinnedQuery_NoSticky/ForceTaskForwardNoPollForwardAllowSync 2.0% (1/50) 60h ago 1
TestPollerScalingFunctionalSuite/TestPollerScalingDecisionsAreSeenProbabilistically 2.0% (1/50) 44h ago 1
TestFairnessSuite/TestMigration_FromFair 2.0% (1/50) 60h ago 1
TestFairnessAutoEnableSuite/TestMigration_FromClassic 2.0% (1/50) 44h ago 1
TestTaskQueueStats_Pri_Suite/TestMultipleTasks_WithMatchingBehavior_ValidateStats/ForceTaskForwardNoPollForwardAllowSync 2.0% (1/50) 23h ago 1
TestWorkflowTaskTestSuite/TestWorkflowTaskHeartbeatingWithEmptyResult 2.0% (1/50) 60h ago 1
TestAdvancedVisibilitySuite/TestWorkerTaskReachability_Unversioned_InTaskQueue 2.0% (1/50) 60h ago 1
TestDeploymentVersionSuiteV2/TestDeleteVersion_DeleteRampedVersion 2.0% (1/50) 1h ago 1
TestWorkerDeploymentSuiteV0/TestDeploymentVersionLimits 2.0% (1/50) 60h ago 1
TestVersioning3FunctionalSuiteV2/TestPinnedQuery_RollbackDrainedVersion/ForceTaskForwardForcePollForwardForceAsync 2.0% (1/50) 1h ago 1
TestVersioning3FunctionalSuiteV2/TestUnpinnedQuery_NoSticky/NoTaskForwardNoPollForwardAllowSync 2.0% (1/50) 38h ago 1
TestDeploymentVersionSuiteV2/TestDeleteVersion_Drained_But_Pollers_Exist 2.0% (1/50) 1h ago 1
TestVersioning3FunctionalSuiteV2/TestQueryWithPinnedOverride_Sticky/ForceTaskForwardForcePollForwardAllowSync 2.0% (1/50) 38h ago 1
TestFairnessAutoEnableSuite/TestMigration_FromFair 2.0% (1/50) 1h ago 1
TestVersioning3FunctionalSuiteV0/TestPinnedCaN_UpgradeOnCaN_SpeculativeWFT_NoSuggest/NoTaskForwardNoPollForwardAllowSync 2.0% (1/50) 38h ago 1
TestWorkerDeploymentSuiteV0/TestDescribeWorkerDeployment_TwoVersions_Sorted 2.0% (1/50) 38h ago 1
TestDeploymentVersionSuiteV2/TestDrainageStatus_SetCurrentVersion_NoOpenWFs 2.0% (1/50) 1h ago 1
TestDeploymentVersionSuiteV2/TestDeleteVersion_ValidDelete 2.0% (1/50) 21h ago 1
TestScheduleFunctionalSuite/TestListSchedulesReturnsWorkflowStatus 2.0% (1/50) 38h ago 1
TestVersioningFunctionalSuite/TestWorkflowTaskRedirectInRetryFirstTask/ForceTaskForwardForcePollForwardAllowSync 2.0% (1/50) 60h ago 1
TestCronTestSuite/TestCronWorkflow 2.0% (1/50) 44h ago 1
TestVersioning3FunctionalSuiteV0/TestPinnedCaN_UpgradeOnCaN_NormalWFT_WithSuggest/ForceTaskForwardNoPollForwardForceAsync 2.0% (1/50) 38h ago 1
TestVersioning3FunctionalSuiteV2/TestAutoUpgradeCaN_UpgradeOnCaN/ForceTaskForwardForcePollForwardAllowSync 2.0% (1/50) 38h ago 1
TestDeploymentVersionSuiteV2/TestReactivationSignalCache_Deduplication_SignalWithStart 2.0% (1/50) 60h ago 1
TestVersioning3FunctionalSuiteV2/TestPinnedQuery_DrainedVersion_PollersAbsent/ForceTaskForwardNoPollForwardAllowSync 2.0% (1/50) 21h ago 1

Flaky Suites

Suite Flake Rate Last Failure
TestAcquireShard_DeadlineExceededErrorSuite 2.1% (1/47) 38h ago
TestAcquireShard_OwnershipLostErrorSuite 2.1% (1/48) 44h ago
TestActivityClientTestSuite 8.3% (4/48) 1h ago
TestActivityTestSuite 2.1% (1/48) 44h ago
TestAdminTestSuite 2.1% (1/48) 44h ago
TestAdvancedVisibilitySuite 4.2% (2/48) 44h ago
TestAdvancedVisibilitySuiteLegacy 2.1% (1/48) 23h ago
TestCallbacksSuiteCHASM 8.3% (4/48) 1h ago
TestChasmTestSuiteLegacy 2.1% (1/48) 44h ago
TestChildWorkflowSuite 2.1% (1/48) 60h ago
TestClientDataConverterTestSuite 100.0% (1/1) 44h ago
TestClientMiscTestSuite 2.1% (1/48) 44h ago
TestCronTestClientSuite 6.2% (3/48) 21h ago
TestCronTestSuite 2.1% (1/48) 44h ago
TestDLQSuite 4.2% (2/48) 44h ago
TestDeploymentVersionSuiteV0 62.5% (30/48) 1h ago
TestDeploymentVersionSuiteV2 22.9% (11/48) 1h ago
TestFairnessAutoEnableSuite 8.3% (4/48) 1h ago
TestFairnessSuite 2.1% (1/48) 60h ago
TestFuncClustersTestSuite 8.3% (4/48) 1h ago
TestGetHistoryFunctionalSuite 2.1% (1/48) 23h ago
TestNamespaceInterceptorTestSuite 6.2% (3/48) 38h ago
TestNamespaceSuite 1.9% (1/53) 44h ago
TestNexusApiTestSuiteWithTemporalFailures 2.1% (1/48) 44h ago
TestNexusStateReplicationTestSuite 2.1% (1/48) 1h ago
TestNexusWorkflowTestSuite 2.1% (1/48) 44h ago
TestPollerScalingFunctionalSuite 2.1% (1/48) 44h ago
TestPrioritySuite 4.2% (2/48) 21h ago
TestPurgeDLQTasksSuite 2.1% (1/48) 44h ago
TestQueryWorkflowSuite 2.1% (1/48) 38h ago
TestRawHistoryClientSuite 2.1% (1/48) 44h ago
TestRelayTaskTestSuite 2.1% (1/47) 44h ago
TestResetWorkflowTestSuite 2.1% (1/48) 44h ago
TestScheduleFunctionalSuite 6.2% (3/48) 21h ago
TestStandaloneActivityTestSuite 10.4% (5/48) 1h ago
TestTLSFunctionalSuite 2.1% (1/48) 44h ago
TestTaskQueueStats_Classic_Suite 18.8% (9/48) 1h ago
TestTaskQueueStats_Pri_Suite 25.0% (12/48) 1h ago
TestTaskQueueSuite 4.2% (2/48) 1h ago
TestTransientTaskSuite 2.1% (1/48) 44h ago
TestUpdateWorkflowSdkSuite 2.1% (1/48) 44h ago
TestUserMetadataSuite 2.1% (1/47) 44h ago
TestUserTimersTestSuite 2.1% (1/47) 44h ago
TestVersionWorkflowSuite 20.0% (1/5) 21h ago
TestVersioning3FunctionalSuiteV0 100.0% (48/48) 1h ago
TestVersioning3FunctionalSuiteV2 100.0% (48/48) 1h ago
TestVersioningFunctionalSuite 10.4% (5/48) 1h ago
TestWorkerDeploymentSuiteV0 16.7% (8/48) 21h ago
TestWorkerDeploymentSuiteV2 12.5% (6/48) 21h ago
TestWorkflowMemoTestSuite 2.1% (1/48) 44h ago
TestWorkflowTaskTestSuite 4.2% (2/48) 38h ago
TestWorkflowUpdateSuite 2.1% (1/48) 44h ago

if: ${{ !cancelled() }}
with:
name: junit-xml--${{github.run_id}}--${{github.run_attempt}}--unit-test
name: junit-xml--${{ github.run_id }}--${{ steps.get_job_id.outputs.job_id }}--${{ github.run_attempt }}--unit-test
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Aligning artifact name with functional tests to have consistent naming scheme for parsing.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Will this break backwards? so the next week won't have all of the reports but going forward it will?

Copy link
Contributor Author

@stephanos stephanos Feb 26, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think it's worth it since unit tests and integration tests have been flying under the radar anyway so far

// Group failures by test name
// Group failures by test name, then remove parent entries whose subtests were observed.
grouped := groupFailuresByTest(allFailures)
filterParentTests(grouped, testRunCounts)
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This will remove parent suites from the reports when they have subtests; removes misleading/noisy entries.

@stephanos stephanos marked this pull request as ready for review February 25, 2026 18:53
@stephanos stephanos requested review from a team as code owners February 25, 2026 18:53
@stephanos stephanos requested a review from spkane31 February 25, 2026 18:54
// generateSuiteReports creates per-suite flake breakdown from all failures and test runs.
// Suite flake rate = % of workflow runs where the suite had at least one non-retry failure.
// Suite flake rate = % of job executions where the suite had at least one non-retry failure.
func generateSuiteReports(allFailures []TestFailure, allTestRuns []TestRun) []SuiteReport {
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Grouping of suites was too aggressive, now shows all suite runs.

@stephanos stephanos force-pushed the fix-flakereport branch 10 times, most recently from b499b6e to 9143911 Compare February 25, 2026 19:21
)

// hoursAgo formats a timestamp as "Xh ago" relative to now.
func hoursAgo(t time.Time) string {
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think this is easier as it uses a consistent time unit.

@stephanos stephanos force-pushed the fix-flakereport branch 3 times, most recently from 63c61b7 to 95153bc Compare February 25, 2026 19:25
sr.SuiteName, sr.FlakeRate, sr.FailedRuns, sr.TotalRuns, lastFailure))
rate := fmt.Sprintf("%.1f%% (%d/%d)", sr.FlakeRate, sr.FailedRuns, sr.TotalRuns)
if sr.FlakeRate > 5 {
rate = "**" + rate + "**"
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Little bit of emphasis.

@stephanos stephanos force-pushed the fix-flakereport branch 2 times, most recently from c060df7 to b809a21 Compare February 25, 2026 19:37
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants