TY - JOUR

T1 - Approximation Algorithms for Data Management in Networks

AU - Krick, Christof

AU - Räcke, Harald

AU - Westermann, Matthias

N1 - Funding Information:
∗The first two authors were partially supported by DFG-Sonderforschungsbereich 376 and the IST Programme of the EU under Contract Number IST-1999-14186 (ALCOM-FT). The third author was supported by a fellowship within the ICSI-Postdoc-Program of the German Academic Exchange Service (DAAD). This research was conducted while he was staying at the Heinz Nixdorf Institute, with support provided by DFG-Sonderforschungsbereich 376 and the IST Programme of the EU under Contract Number IST-1999-14186 (ALCOM-FT).

PY - 2003/9

Y1 - 2003/9

N2 - This paper deals with static data management in computer systems connected by networks. A basic functionality in these systems is the interactive use of shared data objects that can be accessed from each computer in the system. Examples for these objects are files in distributed file systems, cache lines in virtual shared memory systems, or pages in the WWW. In the static scenario we are given read and write request frequencies for each computer-object pair. The goal is to calculate a placement of the objects to the memory modules, possibly with redundancy, such that a given cost function is minimized. With the widespread use of commercial networks, as, e.g., the Internet, it is more and more important to consider commercial factors within data management strategies. The goal in previous work was to utilize the available resources, especially the bandwidth, as best as possible. We present data management strategies for a model in which commercial cost instead of the communication cost is minimized, i.e., we are given a metric communication cost function and a storage cost function. We introduce new deterministic algorithms for the static data management problem on trees and arbitrary networks. Our algorithms aim to minimize the total cost. Note that this problem is MaxSNP-hard on arbitrary networks. Our main result is a combinatorial algorithm that calculates a constant factor approximation for arbitrary networks in polynomial time. Further, we present a dynamic programming algorithm for trees that calculates an optimal placement of all objects in X on a tree T = (V, E) in time O(|X| · |V| · diam(T) · log(deg(T))).

AB - This paper deals with static data management in computer systems connected by networks. A basic functionality in these systems is the interactive use of shared data objects that can be accessed from each computer in the system. Examples for these objects are files in distributed file systems, cache lines in virtual shared memory systems, or pages in the WWW. In the static scenario we are given read and write request frequencies for each computer-object pair. The goal is to calculate a placement of the objects to the memory modules, possibly with redundancy, such that a given cost function is minimized. With the widespread use of commercial networks, as, e.g., the Internet, it is more and more important to consider commercial factors within data management strategies. The goal in previous work was to utilize the available resources, especially the bandwidth, as best as possible. We present data management strategies for a model in which commercial cost instead of the communication cost is minimized, i.e., we are given a metric communication cost function and a storage cost function. We introduce new deterministic algorithms for the static data management problem on trees and arbitrary networks. Our algorithms aim to minimize the total cost. Note that this problem is MaxSNP-hard on arbitrary networks. Our main result is a combinatorial algorithm that calculates a constant factor approximation for arbitrary networks in polynomial time. Further, we present a dynamic programming algorithm for trees that calculates an optimal placement of all objects in X on a tree T = (V, E) in time O(|X| · |V| · diam(T) · log(deg(T))).

UR - http://www.scopus.com/inward/record.url?scp=0142228764&partnerID=8YFLogxK

U2 - 10.1007/s00224-003-1085-7

DO - 10.1007/s00224-003-1085-7

M3 - Article

AN - SCOPUS:0142228764

SN - 1432-4350

VL - 36

SP - 497

EP - 519

JO - Theory of Computing Systems

JF - Theory of Computing Systems

IS - 5

ER -