Improve memory estimates in cloud storage by ballard26 · Pull Request #29773 · redpanda-data/redpanda

ballard26 · 2026-03-08T21:30:52Z

Previously the per-segment memory estimate used was sizeof(remote_segment) + 4KB. This significantly underestimated actual usage. The biggest missing contributor was the hydration wait lists in both the segment and chunks. Each can cache one free chunk(~20KB) even when empty. Assuming that there is likely to be at least one segment wait list and one chunked wait list this results in ~40KB of memory usage unaccounted for. In workloads with many small segments this allowed RP to materialize far more segments than memory allowed resulting in an OOM.

Backports Required

Release Notes

none

The memory semaphore that limits materialized remote segments was significantly underestimating per-segment memory usage. The projection only accounted for sizeof(remote_segment) + 4KB for the index, missing coroutine frames, heap-allocated subobjects (paths, coarse index, segment_chunks), and ~40KB per segment from chunked_fifo's free chunk caching policy in the expiring_fifo wait lists. This caused the system to materialize far more segments than actual memory could support, leading to OOM under workloads with many small segments and prefetch enabled.

…rics

Copilot

Pull request overview

This PR improves memory estimates for materialized remote segments in cloud storage to prevent OOM in workloads with many small segments. The previous estimate of sizeof(remote_segment) + 4KB significantly underestimated actual usage, primarily missing the ~40KB overhead from expiring_fifo retained free chunks in both segment and chunk hydration wait lists.

Changes:

Added estimate_wait_list_overhead() static method and improved estimate_memory_use() to account for heap-allocated paths, coarse index, segment chunks, and waiter FIFO overhead.
Updated projected_remote_segment_memory_usage() in materialized_resources.cc to include subobject, coroutine, and waiter overhead (~56KB total projected overhead vs previous ~4KB).
Fixed two typos ("extimate" → "estimate") in log messages and a minor code ordering cleanup in remote_partition.cc.

Reviewed changes

Copilot reviewed 4 out of 4 changed files in this pull request and generated 1 comment.

File	Description
`src/v/cloud_storage/remote_segment.h`	Added `estimate_wait_list_overhead()` static method declaration
`src/v/cloud_storage/remote_segment.cc`	Implemented `estimate_wait_list_overhead()` and enhanced `estimate_memory_use()` with heap path, index, chunk, and waiter overhead accounting
`src/v/cloud_storage/materialized_resources.cc`	Updated projected memory estimate with subobject, coroutine, and waiter overhead; fixed two "extimate" → "estimate" typos
`src/v/cloud_storage/remote_partition.cc`	Moved `_ts_probe.segment_materialized()` after log statement for consistency

Copilot · 2026-03-08T21:35:20Z

+        // retry chain, etc.)
+        res += sizeof(segment_chunks);


sizeof(remote_segment) already includes sizeof(std::optional<segment_chunks>), which provides inline storage for the entire segment_chunks struct. Adding sizeof(segment_chunks) again when _chunks_api is engaged double-counts the inline struct size.

If the intent is to account for heap allocations from segment_chunks's internal data structures (e.g., the _chunks btree_map root node, _prefetches chunked_vector buffer, etc.), consider estimating that heap overhead directly instead of using sizeof(segment_chunks).

For a memory-estimation function aimed at preventing OOM, over-counting is safer than under-counting, so this isn't critical, but it does make the estimate less accurate than intended.

Suggested change

// retry chain, etc.)

res += sizeof(segment_chunks);

// retry chain, etc.). Account approximately for per-chunk overhead.

vbotbuildovich · 2026-03-08T23:06:35Z

CI test results

test results on build#81488

test_class	test_method	test_arguments	test_kind	job_url	test_status	passed	reason	test_history
CloudTopicsL0GCNodeFailureTest	test_node_failure_mid_gc	{"cloud_storage_type": 2}	integration	https://buildkite.com/redpanda/redpanda/builds/81488#019ccf7b-5bbf-4238-a410-f661d05b4acf	FLAKY	10/11	Test PASSES after retries.No significant increase in flaky rate(baseline=0.0269, p0=1.0000, reject_threshold=0.0100. adj_baseline=0.1000, p1=0.3487, trust_threshold=0.5000)	https://redpanda.metabaseapp.com/dashboard/87-tests?tab=142-dt-individual-test-history&test_class=CloudTopicsL0GCNodeFailureTest&test_method=test_node_failure_mid_gc
src/v/cloud_storage/tests/remote_partition_fuzz_test	src/v/cloud_storage/tests/remote_partition_fuzz_test		unit	https://buildkite.com/redpanda/redpanda/builds/81488#019ccf5c-ce7d-480c-b7ff-e8d6f824eeda	FAIL	0/1

Lazin · 2026-03-19T13:57:55Z

- Test timed out at 2026-03-08 21:59:34 UTC --
I don't think it's caused by the change in this PR

Lazin · 2026-03-19T14:00:13Z

There is a small chance that this new memory accounting logic limits concurrency so the test naturally becomes slower. But this is an old test and I don't think it stresses memory bad enough.

ballard26 added 3 commits March 8, 2026 17:22

cloud_storage: fix typo in materialized_resources::register_segment

d74f8b8

cloud_storage: remove double counting of materialized segments in met…

e5fb0e2

…rics

ballard26 requested review from Lazin and Copilot March 8, 2026 21:30

github-actions Bot added the area/redpanda label Mar 8, 2026

Copilot started reviewing on behalf of ballard26 March 8, 2026 21:31 View session

Copilot AI reviewed Mar 8, 2026

View reviewed changes

Lazin approved these changes Mar 19, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve memory estimates in cloud storage#29773

Improve memory estimates in cloud storage#29773
ballard26 wants to merge 3 commits intoredpanda-data:devfrom
ballard26:cs-imprv-mem-est

ballard26 commented Mar 8, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Mar 8, 2026

Uh oh!

vbotbuildovich commented Mar 8, 2026

Uh oh!

Lazin commented Mar 19, 2026

Uh oh!

Lazin commented Mar 19, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

	// retry chain, etc.)
	res += sizeof(segment_chunks);
	// retry chain, etc.). Account approximately for per-chunk overhead.

Conversation

ballard26 commented Mar 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Backports Required

Release Notes

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Mar 8, 2026

Choose a reason for hiding this comment

Uh oh!

vbotbuildovich commented Mar 8, 2026

CI test results

Uh oh!

Lazin commented Mar 19, 2026

Uh oh!

Lazin commented Mar 19, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

ballard26 commented Mar 8, 2026 •

edited

Loading