`storage.BatchSeriesReferencer` to create all series in a single call #860

colega · 2025-03-26T17:12:00Z

This adds an experimental BatchSeriesRefs method in the appender that should be used to create all series from a scrape/write request in a single call, reducing the mutex contention on MemPostings.Add for customers with a lot of churn and for Mimir ingesters that have recently started (or went out of read-only mode) and are receiving avalanches of new series.

`storage.BatchAppender` adds the BatchSeriesRefs method to the Appender(). This method can be used to make sure that all the series from the incoming scrape or write request already exist (not wired here, as I'm implementing it for Mimir, but can be used for Prometheus as well). The reason to add this is to take advantage of a single operation to reduce locking time on MemPostings among others: right now during Mimir startup we have to call `Add()` millions of times within a few seconds, which causes lots of lock contention, but most of the time is actually spent locking & unlocking rather than adding series. This method does not implement the optimization yet: it only focuses on the exposed interface. I couldn't make Head.Appender() return a BatchAppender because it returns the initAppender which initializes the head on the first append call: if I wanted initAppender to implement BatchAppender we would need to add a timestamp to BatchSeriesRefs, which doesn't really belong here. So we are exposing HeadInitialized() and InitializeHead() methods and move the initialization responisibility to the caller, so it can be sure that BatchAppender() method has always an initialized appender. Signed-off-by: Oleg Zaytsev <[email protected]>

This goes one step further, and does batch series creation in Appender.BatchSeriesRef as well as creates a new implementation of getOrCreateBatch(). One of the advantages of the getOrCreateBatch() is that it's now adding new seires to MemPostings in a batch, probably in a more sorted fashion than previous implementation. MemPostings.AddBatch was added but it trivially uses Add() so far. Signed-off-by: Oleg Zaytsev <[email protected]>

This implements AddBatch() in a simple way: take the mutex, create series using the existing mechanisms, and make pauses every 512 series to allow reads to proceed: we don't want them to wait forever. This optimizes the locking/unlocking time: effectively taking it 512x times less. I think we could still do better: if batch is large enough (1000 elements?) we could send it through some workers optimizing the locked time. We can just ensure from the beginning that all labels exist, and then shard the labels across GOMAXPROCS workers, and send each label to it's specific worker, without having to coordinate them and still holding a single mutex. Signed-off-by: Oleg Zaytsev <[email protected]>

This modifies the method to copy the provided labels before creating them in the index and Head, and return the existing stored labels if possible. This is quite specific to our Mimir optimizations, but it's really needed if we want to use this method with all the current optimizations. Signed-off-by: Oleg Zaytsev <[email protected]>

Adding series to a.series will make sure they're tracked in the WAL. There's an increased risk of losing these series if gc() happens in between, that should be fixed once prometheus#16333 is merged. Signed-off-by: Oleg Zaytsev <[email protected]>

Signed-off-by: Oleg Zaytsev <[email protected]>

colega mentioned this pull request Mar 26, 2025

Use BatchSeriesRefs() to create incoming series in one batch instead of one by one grafana/mimir#11019

Closed

colega force-pushed the batch-appender branch 2 times, most recently from e0efb52 to bf5711f Compare March 27, 2025 14:27

colega changed the title ~~storage.BatchAppender with BatchSeriesRef to create all series in a single call~~ storage.BatchAppender with BatchSeriesRefs to create all series in a single call Mar 31, 2025

colega changed the title ~~storage.BatchAppender with BatchSeriesRefs to create all series in a single call~~ storage.BatchSeriesReferencer to create all series in a single call Mar 31, 2025

colega added 7 commits March 31, 2025 17:07

Fix scratch builder usage in the test

346b79e

Signed-off-by: Oleg Zaytsev <[email protected]>

Simplify interface, just fallback to noop

4094fcf

Signed-off-by: Oleg Zaytsev <[email protected]>

colega force-pushed the batch-appender branch from 3f85ca4 to 4094fcf Compare March 31, 2025 15:07

colega added 4 commits March 31, 2025 17:31

Init time in tests

99ac8dd

Signed-off-by: Oleg Zaytsev <[email protected]>

Fix another test

add283e

Signed-off-by: Oleg Zaytsev <[email protected]>

make linter happy

7fd0e25

Signed-off-by: Oleg Zaytsev <[email protected]>

Fix index getting non copied labels

29b5993

Signed-off-by: Oleg Zaytsev <[email protected]>

colega mentioned this pull request Apr 1, 2025

Improvement: improve multiple series creation (especially on ingester startup) grafana/mimir#11078

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

`storage.BatchSeriesReferencer` to create all series in a single call #860

`storage.BatchSeriesReferencer` to create all series in a single call #860

Uh oh!

colega commented Mar 26, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

storage.BatchSeriesReferencer to create all series in a single call #860

Are you sure you want to change the base?

storage.BatchSeriesReferencer to create all series in a single call #860

Uh oh!

Conversation

colega commented Mar 26, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

`storage.BatchSeriesReferencer` to create all series in a single call #860

`storage.BatchSeriesReferencer` to create all series in a single call #860