UCL Discovery Stage
UCL home » Library Services » Electronic resources » UCL Discovery Stage

Adaptive Online Cache Capacity Optimization via Lightweight Working Set Size Estimation at Scale

Gu, Rong; Li, Simian; Dai, Haipeng; Wang, Hancheng; Luo, Yili; Fan, Bin; Basat, Ran Ben; ... Chen, Guihai; + view all (2023) Adaptive Online Cache Capacity Optimization via Lightweight Working Set Size Estimation at Scale. In: Proceedings of the 2023 USENIX Annual Technical Conference. (pp. pp. 467-484). USENIX: Boston, MA, USA. Green open access

[thumbnail of atc23-gu.pdf]
Preview
Text
atc23-gu.pdf - Published Version

Download (5MB) | Preview

Abstract

Big data applications extensively use cache techniques to accelerate data access. A key challenge for improving cache utilization is provisioning a suitable cache size to fit the dynamic working set size (WSS) and understanding the related item repetition ratio (IRR) of the trace. We propose Cuki, an approximate data structure for efficiently estimating online WSS and IRR for variable-size item access with proven accuracy guarantee. Our solution is cache-friendly, thread-safe, and light-weighted in design. Based on that, we design an adaptive online cache capacity tuning mechanism. Moreover, Cuki can also be adapted to accurately estimate the cache miss ratio curve (MRC) online. We built Cuki as a lightweight plugin of the widely-used distributed file caching system Alluxio. Evaluation results show that Cuki has higher accuracy than four state-of-the-art algorithms by over an order of magnitude and with better stability in performance. The end-to-end data access experiments show that the adaptive cache tuning framework using Cuki reduces the table querying latency by 79% and improves the file reading throughput by 29% on average. Compared with the cutting-edge MRC approach, Cuki uses less memory and improves accuracy by around 73% on average. Cuki is deployed on one of the world’s largest social platforms to run the Presto query workloads.

Type: Proceedings paper
Title: Adaptive Online Cache Capacity Optimization via Lightweight Working Set Size Estimation at Scale
Event: 2023 USENIX Annual Technical Conference (USENIX ATC '23)
ISBN-13: 978-1-939133-35-9
Open access status: An open access version is available from UCL Discovery
Publisher version: https://www.usenix.org/conference/atc23/presentati...
Language: English
Additional information: This version is the version of record. For information on re-use, please refer to the publisher’s terms and conditions.
UCL classification: UCL
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Computer Science
URI: https://discovery-pp.ucl.ac.uk/id/eprint/10183246
Downloads since deposit
324Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item