Optimal Online Algorithms for File-Bundle Caching and Generalization to Distributed Caching

11/06/2020
by   Tiancheng Qin, et al.
0

We consider a generalization of the standard cache problem called file-bundle caching, where different queries (tasks), each containing l≥ 1 files, sequentially arrive. An online algorithm that does not know the sequence of queries ahead of time must adaptively decide on what files to keep in the cache to incur the minimum number of cache misses. Here a cache miss refers to the case where at least one file in a query is missing among the cache files. In the special case where l=1, this problem reduces to the standard cache problem. We first analyze the performance of the classic least recently used (LRU) algorithm in this setting and show that LRU is a near-optimal online deterministic algorithm for file-bundle caching with regard to competitive ratio. We then extend our results to a generalized (h,k)-paging problem in this file-bundle setting, where the performance of the online algorithm with a cache size k is compared to an optimal offline benchmark of a smaller cache size h<k. In this latter case, we provide a randomized O(l lnk/k-h)-competitive algorithm for our generalized (h,k)-paging problem, which can be viewed as an extension of the classic marking algorithm. We complete this result by providing a matching lower bound for the competitive ratio, indicating that the performance of this modified marking algorithm is within a factor of two of any randomized online algorithm. Finally, we look at the distributed version of the file-bundle caching problem where there are m≥ 1 identical caches in the system. In this case we show that for m=l+1 caches, there is a deterministic distributed caching algorithm which is (l^2+l)-competitive and a randomized distributed caching algorithm which is O(lln(2l+1))-competitive when l≥ 2.

READ FULL TEXT
research
01/18/2021

Online Caching with Optimal Switching Regret

We consider the classical uncoded caching problem from an online learnin...
research
05/30/2020

Lower Bounds for Caching with Delayed Hits

Caches are a fundamental component of latency-sensitive computer systems...
research
06/28/2018

Generalization of LRU Cache Replacement Policy with Applications to Video Streaming

Caching plays a crucial role in networking systems to reduce the load on...
research
07/13/2022

Caching with Reserves

Caching is a crucial component of many computer systems, so naturally it...
research
11/03/2020

Beyond Worst-case Analysis of Multicore Caching Strategies

Every processor with multiple cores sharing a cache needs to implement a...
research
11/20/2022

Metadata Caching in Presto: Towards Fast Data Processing

Presto is an open-source distributed SQL query engine for OLAP, aiming f...
research
06/11/2022

Online Paging with Heterogeneous Cache Slots

It is natural to generalize the k-Server problem by allowing each reques...

Please sign up or login with your details

Forgot password? Click here to reset