Mon 3 Mar 2025 15:40 - 16:00 at Acacia D - Session 4: Memory (Session Chair: Dong Li)

In-memory key-value (KV) caching bridges the performance gap between high-performance networks and disk devices. Prior in-memory KV caching systems either consider large objects or introduce additional memory overhead. In this paper, we conduct a systematic analysis over 56 production traces, and make three observations: (i) small objects dominate the traces and data accesses are highly skewed; (ii) the hotness of objects keeps stable across days; and (iii) the multi-get operation that retrieves multiple objects from the same node incurs much shorter tail latency than purely using the single-get operation.

These observations motivate the design of AC-Cache, a correlation-aware distribution scheme of KV objects for in-memory caching systems. AC-Cache comprises three design primitives: (i) we formulate the distribution of KV objects as an integer linear programming problem, seeking to balance both data accesses and memory overhead; (ii) we capture the object correlation in a memory-efficient means and generate fine-grained correlation groups; and (iii) we formulate the distribution of the correlation groups as a maximum flow problem to balance data accesses, and leverage a heuristic algorithm to dispatch other KV objects to balance memory consumption. Extensive experiments on Alibaba Cloud show that AC-Cache can reduce the tail latency by 5.1–80.2% and increase the access throughput by 42.8–534.8%.

Mon 3 Mar

Displayed time zone: Pacific Time (US & Canada) change

15:40 - 16:40
Session 4: Memory (Session Chair: Dong Li)Main Conference at Acacia D
15:40
20m
Talk
AC-Cache: A Memory-Efficient Caching System for Small Objects via Exploiting Access Correlations
Main Conference
Fulin Nan Xiamen Univeristy, Zhirong Shen Xiamen University
16:00
20m
Talk
Effectively Virtual Page Prefetching via Spatial-Temporal Patterns for Memory-intensive Cloud Applications
Main Conference
Yun Wang Shanghai Jiao Tong University, Liang Chen , Tianmai Deng Shanghai Jiao Tong University, Ben Luo Alibaba Group, Yibin Shen Alibaba Cloud, Zhixiang Wei Shanghai Jiao Tong University, Yixiao Xu Shanghai Jiao Tong University, Minglang Huang Shanghai Jiao Tong University, Zhengwei Qi Shanghai Jiao Tong University
16:20
20m
Talk
Harnessing Inter-GPU Shared Memory for Seamless MoE Communication-Computation Fusion
Main Conference
Hulin Wang , Yaqi Xia Wuhan University, Donglin Yang Nvidia Corporation, Xiaobo Zhou University of Macau, Dazhao Cheng WuHan University