DOI: https://doi.org/10.26089/NumMet.v18r105

An approximate algorithm for choosing the optimal subset of nodes in the Angara interconnect with failures

Authors

  • A.V. Mukosey
  • A.S. Semenov

Keywords:

fault tolerance
interconnect
multidimensional torus
connectivity
deterministic routing
direction-order routing

Abstract

The Angara high-speed interconnect with multidimensional torus topology is under development in Scientific Research Center for Electronic Computer Technology. During the utilization of the Angara interconnect in cluster systems, there exist busy and failed nodes. Thus, there is a problem of finding an optimal cluster node subset such that the network traffic belongs to this node subset and the node subset size is not less than a given size. The paper presents an approximate algorithm for solving this problem.


Published

2017-02-19

Issue

Section

Section 1. Numerical methods and applications

Author Biographies

A.V. Mukosey

A.S. Semenov


References

  1. I. A. Zhabin, D. V. Makagon, D. A. Polyakov, et al., “First Generation of Angara High-Speed Interconnection Network,” Naukoemkie Tekhnol., No. 1, 21-27 (2014).
  2. A. A. Agarkov, T. F. Ismagilov, D. V. Makagon, et al., “Performance Evaluation of the Angara Interconnect,” in Proc. Int. Conf. on Russian Supercomputing Days, Moscow, Russia, September 26-27, 2016 (Mosk. Gos. Univ., Moscow, 2016), pp. 626-639.
  3. I. A. Pozhilov, A. S. Semenov, and D. V. Makagon, “Connectivity Problem Solution for Direction Ordered Deterministic Routing in nD Torus,” Programm. Inzhener., No. 3, 13-19 (2015).
  4. V. Puente, R. Beivide, J. A. Gregorio, et al., “Adaptive Bubble Router: A Design to Improve Performance in Torus Networks,” in Proc. Int. Conf. on Parallel Processing, Aizu-Wakamatsu, Japan, September 21-24, 1999 (IEEE Press, Washington, DC, 1999), pp. 58-67.
  5. N. R. Adiga, M. A. Blumrich, D. Chen, et al., “Blue Gene/L Torus Interconnection Network,” IBM J. Res. Develop. 49 (2/3), 265-276 (2005).
  6. S. L. Scott and G. M. Thorson, “The Cray T3E Network: Adaptive Routing in a High Performance 3D Torus,” in Proc. IV Symp. on Hot Interconnects, Palo Alto, USA August 15-17, 1996 (IEEE Press, Washington, DC, 1996), pp. 147-156.