[server] throttling deletes to limit disk i/o #2397

ymuppala · 2026-01-23T22:15:49Z

Problem Statement

Venice storage nodes with the discard mount option enabled are experiencing disk health check failures during large store deletions due to synchronous TRIM operations saturating disk I/O, causing the DiskHealthCheckService to timeout and mark nodes unhealthy.

Solution

Added configurable deletion rate limiting for RocksDB's SstFileManager to throttle file deletion operations during store cleanup:
Added two new configuration parameters to RocksDBServerConfig: rocksdb.sst.file.manager.delete.rate.bytes.per.second & rocksdb.sst.file.manager.max.trash.db.ratio
Updated RocksDBStorageEngineFactory to apply deletion rate limits to the shared SstFileManager instance when configured

Code changes

Added new code behind a config. If so list the config names and their default values in the PR description.
Introduced new log lines.
- Confirmed if logs need to be rate limited to avoid excessive logging.

Concurrency-Specific Checks

Both reviewer and PR author to verify

Code has no race conditions or thread safety issues.
Proper synchronization mechanisms (e.g., synchronized, RWLock) are used where needed.
No blocking calls inside critical sections that could lead to deadlocks or performance degradation.
Verified thread-safe collections are used (e.g., ConcurrentHashMap, CopyOnWriteArrayList).
Validated proper exception handling in multi-threaded code to avoid silent thread termination.

How was this PR tested?

New unit tests added.
New integration tests added.
Modified or extended existing tests.
Verified backward compatibility (if applicable).

Does this PR introduce any user-facing or breaking changes?

No. You can skip the rest of this section.
Yes. Clearly explain the behavior change and its impact.

huangminchn

Thanks for the quick turnaround Yug!
Can we double check the code path used during backup version deletion? My understanding is that it's going through "RocksDBStoragePartition#drop()".

We can add a test case for backup version deletion if none exists yet, and validate the deletion rate.

huangminchn · 2026-01-24T00:00:27Z

...client/src/test/java/com/linkedin/davinci/store/rocksdb/RocksDBStorageEngineFactoryTest.java

+        new VeniceStoreVersionConfig(testStore, veniceServerProperties, PersistenceType.ROCKS_DB);
+    StorageEngine storeEngine = factory.getStorageEngine(testStoreConfig);
+    Assert.assertNotNull(storeEngine, "Storage engine should be created successfully");
+    factory.removeStorageEngine(storeEngine);


Can we double check that this code path is used during backup version deletion?

Yeah, it is in the code path. The call stack looks like the following

VersionBackend.delete() → IngestionBackend.removeStorageEngine(topicName) → StorageService.removeStorageEngine(kafkaTopic) → StorageEngine.drop() → AbstractStorageEngine.drop() [iterates partitions] → RocksDBStoragePartition.drop() → RocksDB.destroyDB()

eldernewborn · 2026-01-24T21:06:44Z

...ts/da-vinci-client/src/main/java/com/linkedin/davinci/store/rocksdb/RocksDBServerConfig.java

+
+    // SstFileManager deletion rate limiting - disabled by default (0) for backward compatibility
+    // Recommended: 64 MB/s (67108864) for nodes with 'discard' mount option
+    this.sstFileManagerDeleteRateBytesPerSecond =
+        props.getSizeInBytes(ROCKSDB_SST_FILE_MANAGER_DELETE_RATE_BYTES_PER_SECOND, 0L);
+    this.sstFileManagerMaxTrashDBRatio = props.getDouble(ROCKSDB_SST_FILE_MANAGER_MAX_TRASH_DB_RATIO, 0.25); // 25%
+                                                                                                             // default


Wouldn't this mean that 75% of the DB will be deleted in a throttled manner, but then the last 25% will be immediately deleted ?

We need the throttling to continue to be applied all the way through .

I do not think that is what the config value means. It specifies the upper limit on trash size to live DB size ratio that can be tolerated before files are immediately deleted, overriding the rate limit. So, when the ratio of trash to live db hits 25% throttling will be ignored.

ymuppala · 2026-01-24T23:57:46Z

Thanks for the quick turnaround Yug! Can we double check the code path used during backup version deletion? My understanding is that it's going through "RocksDBStoragePartition#drop()".

We can add a test case for backup version deletion if none exists yet, and validate the deletion rate.

I have added a test to validate the deletion rate

ymuppala force-pushed the delete_throttling branch 2 times, most recently from 2e7fca9 to cc511e8 Compare January 23, 2026 23:34

[server] throttling deletes to limit disk i/o

0c96638

ymuppala force-pushed the delete_throttling branch from cc511e8 to 0c96638 Compare January 23, 2026 23:36

ymuppala marked this pull request as ready for review January 23, 2026 23:37

huangminchn requested changes Jan 24, 2026

View reviewed changes

eldernewborn requested changes Jan 24, 2026

View reviewed changes

Adding throttling integration test

29d8dc5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[server] throttling deletes to limit disk i/o #2397

[server] throttling deletes to limit disk i/o #2397

Uh oh!

ymuppala commented Jan 23, 2026 •

edited

Loading

Uh oh!

huangminchn left a comment

Uh oh!

huangminchn Jan 24, 2026

Uh oh!

ymuppala Jan 24, 2026

Uh oh!

eldernewborn Jan 24, 2026

Uh oh!

ymuppala Jan 24, 2026

Uh oh!

ymuppala commented Jan 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[server] throttling deletes to limit disk i/o #2397

Are you sure you want to change the base?

[server] throttling deletes to limit disk i/o #2397

Uh oh!

Conversation

ymuppala commented Jan 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Problem Statement

Solution

Code changes

Concurrency-Specific Checks

How was this PR tested?

Does this PR introduce any user-facing or breaking changes?

Uh oh!

huangminchn left a comment

Choose a reason for hiding this comment

Uh oh!

huangminchn Jan 24, 2026

Choose a reason for hiding this comment

Uh oh!

ymuppala Jan 24, 2026

Choose a reason for hiding this comment

Uh oh!

eldernewborn Jan 24, 2026

Choose a reason for hiding this comment

Uh oh!

ymuppala Jan 24, 2026

Choose a reason for hiding this comment

Uh oh!

ymuppala commented Jan 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ymuppala commented Jan 23, 2026 •

edited

Loading