Competition: The 8th CCF Open Source Innovation Competition, Open Source Task Challenge Track.
Task: Mooncake KVCache storage design and performance optimization.
Work Summary
This submission adds a new Mooncake Store allocation strategy named fragmentation_aware. The strategy improves allocation decisions for mixed-size KV cache workloads by considering the largest contiguous free region in each candidate segment, not only the aggregate free-space ratio.
The feature is designed to reduce failed allocation attempts in long-running clusters where memory becomes fragmented. It keeps Mooncake’s existing best-effort replica semantics and uses a bounded sampled candidate set to avoid full-cluster scans.
Key Deliverables
New AllocationStrategyType::FRAGMENTATION_AWARE.
New FragmentationAwareAllocationStrategy.
Master startup support via --allocation_strategy=fragmentation_aware.
Unit test covering a deterministic fragmentation scenario.
Allocation benchmark matrix support for the new strategy.
Updated design, deployment, and quick-start documentation.
TCP-only / old-libibverbs build compatibility fixes needed for local
validation on Ubuntu 20.04.
Mooncake Fragmentation-Aware Allocation Strategy
Competition: The 8th CCF Open Source Innovation Competition, Open Source Task Challenge Track.
Task: Mooncake KVCache storage design and performance optimization.
Work Summary
This submission adds a new Mooncake Store allocation strategy named
fragmentation_aware. The strategy improves allocation decisions for mixed-size KV cache workloads by considering the largest contiguous free region in each candidate segment, not only the aggregate free-space ratio.The feature is designed to reduce failed allocation attempts in long-running clusters where memory becomes fragmented. It keeps Mooncake’s existing best-effort replica semantics and uses a bounded sampled candidate set to avoid full-cluster scans.
Key Deliverables
AllocationStrategyType::FRAGMENTATION_AWARE.FragmentationAwareAllocationStrategy.--allocation_strategy=fragmentation_aware.Repository
Main working repository:
C:\CCFOpenSource\Repos\Mooncake-githubWorking branch:
ccf-fragmentation-aware-allocationUsage
Start Mooncake master with:
For HTTP metadata quick start:
Status
Implementation patch is prepared at:
C:\CCFOpenSource\Submission\mooncake_fragmentation_aware.patchLocal verification results are recorded in
C:\CCFOpenSource\Submission\testing.mdandC:\CCFOpenSource\Submission\logs.Primary local validation:
allocation_strategy_light_test.log: passed.fragmentation_aware_sim.log: passed.