TY - JOUR
T1 - Data management in networks
T2 - Experimental evaluation of a provably good strategy
AU - Krick, C.
AU - Auf der Heide, F. Meyer
AU - Räcke, H.
AU - Vöcking, B.
AU - Westermann, M.
N1 - Funding Information:
⁄The first three authors were partially supported by the DFG-Sonderforschungsbereich 376 and the IST Programme of the EU under Contract Number IST-1999-14186 (ALCOM-FT). The fourth author was partially supported by the IST Programme of the EU under Contract Number IST-1999-14186 (ALCOM-FT), and this research was conducted while he was staying at the Heinz Nixdorf Institute, with support provided by the DFG-Sonderforschungsbereich 376. The fifth author was supported by a fellowship within the ICSI-Postdoc-Programme of the German Academic Exchange Service (DAAD), and this research was conducted while he was staying at the Heinz Nixdorf Institute, with support provided by the DFG-Sonderforschungsbereich 376 and the IST Programme of the EU under Contract Number IST-1999-14186 (ALCOM-FT).
PY - 2002
Y1 - 2002
N2 - This paper deals with data management for parallel and distributed systems. We present the DIVA (Distributed Variables) library that provides direct access to shared data objects from each node in a network. The current implementations are based on mesh-connected massively parallel computers. Our algorithms dynamically create and discard copies of the data objects in order to reduce the communication overhead. We use a non-standard approach based on a randomized but locality preserving embedding of "access trees" into the network. The access tree strategy was previously analyzed only in a theoretical model. A competitive analysis proved that the strategy minimizes the network congestion up to small factors. In this paper the access tree strategy is evaluated experimentally. We test several variations of this strategy on three different applications of parallel computing, namely matrix multiplication, bitonic sorting, and Barnes-Hut N-body simulation. We compare the access tree strategy with a standard caching strategy using a fixed home for each data object. Our experiments show that the access tree strategy outperforms the fixed home strategy clearly as it avoids network congestion. Furthermore, we do comparisons with hand-optimized message passing strategies. In fact, we observe that the access tree strategy comes reasonably close to the performance of the hand-optimized message passing strategies while the fixed home strategy performs poorly.
AB - This paper deals with data management for parallel and distributed systems. We present the DIVA (Distributed Variables) library that provides direct access to shared data objects from each node in a network. The current implementations are based on mesh-connected massively parallel computers. Our algorithms dynamically create and discard copies of the data objects in order to reduce the communication overhead. We use a non-standard approach based on a randomized but locality preserving embedding of "access trees" into the network. The access tree strategy was previously analyzed only in a theoretical model. A competitive analysis proved that the strategy minimizes the network congestion up to small factors. In this paper the access tree strategy is evaluated experimentally. We test several variations of this strategy on three different applications of parallel computing, namely matrix multiplication, bitonic sorting, and Barnes-Hut N-body simulation. We compare the access tree strategy with a standard caching strategy using a fixed home for each data object. Our experiments show that the access tree strategy outperforms the fixed home strategy clearly as it avoids network congestion. Furthermore, we do comparisons with hand-optimized message passing strategies. In fact, we observe that the access tree strategy comes reasonably close to the performance of the hand-optimized message passing strategies while the fixed home strategy performs poorly.
UR - http://www.scopus.com/inward/record.url?scp=0036489339&partnerID=8YFLogxK
U2 - 10.1007/s00224-001-1045-z
DO - 10.1007/s00224-001-1045-z
M3 - Article
AN - SCOPUS:0036489339
SN - 1432-4350
VL - 35
SP - 217
EP - 245
JO - Theory of Computing Systems
JF - Theory of Computing Systems
IS - 2
ER -