Add support for asynchronous release to replicaKeysWithExpire on writable replica #2849

Scut-Corgis · 2025-11-16T11:31:10Z

Problem

When executing FLUSHALL ASYNC on a writable replica that has
a large number of expired keys directly written to it, the main thread
gets blocked for an extended period while synchronously releasing the
replicaKeysWithExpire dictionary.

Root Cause

FLUSHALL ASYNC is designed for asynchronous lazy freeing of core data
structures, but the release of replicaKeysWithExpire (a dictionary tracking
expired keys on replicas) still happens synchronously in the main thread.
This synchronous operation becomes a bottleneck when dealing with massive
key volumes, as it cannot be offloaded to the lazyfree background thread.

This PR addresses the issue by moving the release of replicaKeysWithExpire
to the lazyfree background thread, aligning it with the asynchronous design
of FLUSHALL ASYNC and eliminating main thread blocking.

User scenarios

In some operations, people often need to do primary-replica switches.
One goal is to avoid noticeable impact on the business—like key loss
or reduced availability (e.g., write failures).

Here is the process: First, temporarily switch traffic to writable replicas.
Then we wait for the primary pending replication data to be fully synced
(so primry and replicas are in sync), before finishing the switch. We don't
usually need to do the flush in this case, but it's an optimization that can
be done.

codecov · 2025-11-19T08:57:52Z

Codecov Report

❌ Patch coverage is 17.64706% with 14 lines in your changes missing coverage. Please review.
✅ Project coverage is 72.43%. Comparing base (86db609) to head (2a63009).
⚠️ Report is 12 commits behind head on unstable.

Files with missing lines	Patch %	Lines
src/lazyfree.c	0.00%	11 Missing ⚠️
src/expire.c	25.00%	3 Missing ⚠️

Additional details and impacted files

@@             Coverage Diff              @@
##           unstable    #2849      +/-   ##
============================================
+ Coverage     72.26%   72.43%   +0.17%     
============================================
  Files           128      128              
  Lines         70370    70428      +58     
============================================
+ Hits          50851    51017     +166     
+ Misses        19519    19411     -108

Files with missing lines	Coverage Δ
src/db.c	`93.15% <100.00%> (ø)`
src/expire.c	`97.29% <25.00%> (-0.53%)`	⬇️
src/lazyfree.c	`85.41% <0.00%> (-7.07%)`	⬇️

... and 23 files with indirect coverage changes

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

Signed-off-by: Scut-Corgis <[email protected]>

src/lazyfree.c

enjoy-binbin · 2025-11-24T07:12:39Z

We generally discourage the use of writable replicas. What user scenarios would you use it in?

The async way code is pretty simple, so i guess there is no harm to use it. @zuiderkwast WDYT?

Scut-Corgis · 2025-11-24T07:44:52Z

We generally discourage the use of writable replicas. What user scenarios would you use it in?

In our daily operations, we often need to do master-slave switches. Our goal is to avoid noticeable impact on the business—like key loss or reduced availability (e.g., write failures).
Here’s our process: First, we temporarily switch traffic to writable replicas. Then we wait for the master’s pending replication data to be fully synced (so master and replicas are in sync), before finishing the full master-slave switch.

Co-authored-by: Binbin <[email protected]> Signed-off-by: jiegang0219 <[email protected]>

zuiderkwast · 2025-11-24T20:18:54Z

We generally discourage the use of writable replicas. What user scenarios would you use it in?

In our daily operations, we often need to do master-slave switches. Our goal is to avoid noticeable impact on the business—like key loss or reduced availability (e.g., write failures). Here’s our process: First, we temporarily switch traffic to writable replicas. Then we wait for the master’s pending replication data to be fully synced (so master and replicas are in sync), before finishing the full master-slave switch.

Yeah, this method is described here: https://valkey.io/topics/admin/#upgrading-or-restarting-a-valkey-instance-without-downtime. Is this where you found the recommendation?

I think there is no need to use writeable replicas. We should instead recommend the FAILOVER command, which does a coordinated failover between the primary and the replica. Is there any benefit in using writeable replicas during this switch-over or is the documentation simply outdated?

Scut-Corgis · 2025-11-25T06:45:38Z

Yeah, this method is described here: https://valkey.io/topics/admin/#upgrading-or-restarting-a-valkey-instance-without-downtime. Is this where you found the recommendation?

I think there is no need to use writeable replicas. We should instead recommend the FAILOVER command, which does a coordinated failover between the primary and the replica. Is there any benefit in using writeable replicas during this switch-over or is the documentation simply outdated?

The method in the linked document aligns with what I described — we use writeable replicas for primary-replica switching.

The FAILOVER command has two modes: either it makes the primary lose some written data, or it disables writes for a period to align their offsets before switching. Both modes are actually business-perceptible for us. @zuiderkwast

zuiderkwast

The FAILOVER command has two modes: either it makes the primary lose some written data, or it disables writes for a period to align their offsets before switching. Both modes are actually business-perceptible for us.

Interesting. So if you (and other users) really need writable replicas, then we can't deprecate them. The implementation is very simple so I want to accept it. @enjoy-binbin Do you agree?

The reason we are skeptical to writable replicas in general is that it can cause data inconsistency. If some data written directly to the replica (such as SET k v) and some data is replicated from the primary (such as HSET k f v), the replication can fail if the key is of a different type than what the replication expected (for example repliction of HSET would fail if it's not a hash). It is a theoretical problem. In practice, it's easy to avoid it, but it's good to be aware of it.

github-actions bot assigned Scut-Corgis Nov 16, 2025

Scut-Corgis removed their assignment Nov 16, 2025

Scut-Corgis closed this Nov 16, 2025

Scut-Corgis reopened this Nov 16, 2025

github-actions bot assigned Scut-Corgis Nov 16, 2025

feat(lazyfree): can free replicaKeysWithExpire dict async

7fa9355

Signed-off-by: Scut-Corgis <[email protected]>

Scut-Corgis force-pushed the lazyfree-replicaKeysWithExpire branch from a428436 to 7fa9355 Compare November 19, 2025 12:06

enjoy-binbin requested changes Nov 24, 2025

View reviewed changes

src/lazyfree.c Outdated Show resolved Hide resolved

src/lazyfree.c Outdated Show resolved Hide resolved

src/lazyfree.c Show resolved Hide resolved

Apply suggestions from code review

2a63009

Co-authored-by: Binbin <[email protected]> Signed-off-by: jiegang0219 <[email protected]>

zuiderkwast approved these changes Nov 25, 2025

View reviewed changes

enjoy-binbin changed the title ~~Implement Lazyfree for replicaKeysWithExpire to Avoid Main Thread Blocking in FLUSHALL ASYNC~~ Add support for asynchronous release to replicaKeysWithExpire on writable replica Nov 25, 2025

enjoy-binbin approved these changes Nov 25, 2025

View reviewed changes

enjoy-binbin merged commit dd2827a into valkey-io:unstable Nov 25, 2025
55 checks passed

enjoy-binbin added the release-notes This issue should get a line item in the release notes label Nov 25, 2025

enjoy-binbin added this to Valkey 9.1 Nov 25, 2025

github-project-automation bot moved this to Done in Valkey 9.1 Nov 25, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add support for asynchronous release to replicaKeysWithExpire on writable replica #2849

Add support for asynchronous release to replicaKeysWithExpire on writable replica #2849

Scut-Corgis commented Nov 16, 2025 •

edited by enjoy-binbin

Loading

Uh oh!

codecov bot commented Nov 19, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

enjoy-binbin commented Nov 24, 2025 •

edited

Loading

Uh oh!

Scut-Corgis commented Nov 24, 2025

Uh oh!

zuiderkwast commented Nov 24, 2025

Uh oh!

Scut-Corgis commented Nov 25, 2025

Uh oh!

zuiderkwast left a comment •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Add support for asynchronous release to replicaKeysWithExpire on writable replica #2849

Add support for asynchronous release to replicaKeysWithExpire on writable replica #2849

Conversation

Scut-Corgis commented Nov 16, 2025 • edited by enjoy-binbin Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Problem

Root Cause

User scenarios

Uh oh!

codecov bot commented Nov 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

Uh oh!

Uh oh!

enjoy-binbin commented Nov 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Scut-Corgis commented Nov 24, 2025

Uh oh!

zuiderkwast commented Nov 24, 2025

Uh oh!

Scut-Corgis commented Nov 25, 2025

Uh oh!

zuiderkwast left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Scut-Corgis commented Nov 16, 2025 •

edited by enjoy-binbin

Loading

codecov bot commented Nov 19, 2025 •

edited

Loading

enjoy-binbin commented Nov 24, 2025 •

edited

Loading

zuiderkwast left a comment •

edited

Loading