An approximate algorithm for choosing the optimal subset of nodes in the Angara interconnect with failures

Authors

  • A.V. Mukosey Research Center for Electronic Computing
  • A.S. Semenov Research Center for Electronic Computing

DOI:

https://doi.org/10.26089/NumMet.v18r105

Keywords:

fault tolerance, interconnect, multidimensional torus, connectivity, deterministic routing, direction-order routing

Abstract

The Angara high-speed interconnect with multidimensional torus topology is under development in Scientific Research Center for Electronic Computer Technology. During the utilization of the Angara interconnect in cluster systems, there exist busy and failed nodes. Thus, there is a problem of finding an optimal cluster node subset such that the network traffic belongs to this node subset and the node subset size is not less than a given size. The paper presents an approximate algorithm for solving this problem.

Author Biographies

A.V. Mukosey

A.S. Semenov

References

  1. I. A. Zhabin, D. V. Makagon, D. A. Polyakov, et al., “First Generation of Angara High-Speed Interconnection Network,” Naukoemkie Tekhnol., No. 1, 21-27 (2014).
  2. A. A. Agarkov, T. F. Ismagilov, D. V. Makagon, et al., “Performance Evaluation of the Angara Interconnect,” in Proc. Int. Conf. on Russian Supercomputing Days, Moscow, Russia, September 26-27, 2016 (Mosk. Gos. Univ., Moscow, 2016), pp. 626-639.
  3. I. A. Pozhilov, A. S. Semenov, and D. V. Makagon, “Connectivity Problem Solution for Direction Ordered Deterministic Routing in nD Torus,” Programm. Inzhener., No. 3, 13-19 (2015).
  4. V. Puente, R. Beivide, J. A. Gregorio, et al., “Adaptive Bubble Router: A Design to Improve Performance in Torus Networks,” in Proc. Int. Conf. on Parallel Processing, Aizu-Wakamatsu, Japan, September 21-24, 1999 (IEEE Press, Washington, DC, 1999), pp. 58-67.
  5. N. R. Adiga, M. A. Blumrich, D. Chen, et al., “Blue Gene/L Torus Interconnection Network,” IBM J. Res. Develop. 49 (2/3), 265-276 (2005).
  6. S. L. Scott and G. M. Thorson, “The Cray T3E Network: Adaptive Routing in a High Performance 3D Torus,” in Proc. IV Symp. on Hot Interconnects, Palo Alto, USA August 15-17, 1996 (IEEE Press, Washington, DC, 1996), pp. 147-156.

Published

19-02-2017

How to Cite

Мукосей А., Семенов А. An Approximate Algorithm for Choosing the Optimal Subset of Nodes in the Angara Interconnect With Failures // Numerical Methods and Programming (Vychislitel’nye Metody i Programmirovanie). 2017. 18. 53-64. doi 10.26089/NumMet.v18r105

Issue

Section

Section 1. Numerical methods and applications