Computer Engineering & Science ›› 2023, Vol. 45 ›› Issue (12): 2099-2112.
• High Performance Computing • Previous Articles Next Articles
SHI De-jun1,LI Hong-liang2,HU Shu-kai2
Received:
2022-12-13
Revised:
2023-04-03
Accepted:
2023-12-25
Online:
2023-12-25
Published:
2023-12-14
SHI De-jun, LI Hong-liang, HU Shu-kai . A Clos network based high-radix router structure[J]. Computer Engineering & Science, 2023, 45(12): 2099-2112.
[1] | TOP 500 the list[EB/OL].[2022-11-28].https://www.top500.org. |
[2] | Broadcom[EB/OL].[2022-11-28].https://investors.broadcom.com. |
[3] | Xsight[EB/OL].[2022-11-28].https://xsightlabs.com. |
[4] | Innovium TERALYNX 8 25.6 Tbps switch chip launched [EB/OL].[2022-11-28].https://www.servethehome.com/innovium-teralynx-8-25-6tbps-switch-chip-launched. |
[5] | Cisco silicon one[EB/OL].[2022-11-28].https://www.cisco.com/c/en/ us/solutions/silicon-one.html. |
[6] | Gao Jian-gang,Lu Hong-sheng,He Wang-quan,et al.The interconnection network and message machanism of Sunway exascale prototype system[J].Chinese Journal of Computers,2021,44(1):222-234.(in Chinese) |
[7] | Dally W J,Towles B.Principles and practices of interconnection networks[M].New York:Morgan Kaufmann Publishers,2004. |
[8] | Kim J,Dally W J,Towles B,et al.Microarchitecture of a high radix router[J]. ACM SIGARCH Computer Architecture News,2005,33(2):420-431. |
[9] | McKeown N. The iSLIP scheduling algorithm for input-queued switches[J].IEEE/ACM Trans- actions on Network- ing,1999,7(2):188-201. |
[10] | Karol M J,Hluchyj M G,Morgan S P.Input versus output queueing on a space-division packet switch[J].IEEE Trans- actions on Communications,1987,35(12):1347-1356. |
[11] | Chen M, Georganas N D,Yang O W W.A fast algorithm for multi-channel/port traffic assignment[C]∥Proc of 1994 International Conference on Communications,1994:96-100. |
[12] | Hluchyj M G,Karol M J.Queueing in high-performance packet switching[J].IEEE Journal on Selected Areas in Communications,1988,6(9):1587-1597. |
[13] | Tamir Y, Frazier G L.High-performance multi-queue buf- fers for VLSI communications switches[J]. ACM SIGARCH Computer Architecture,1988,16(2):343-354. |
[14] | Tamir Y, Frazier G L.Dynamically-allocated multi-queue buffers for VLSI communication switches[J].IEEE Trans- actions on Computers,1992,41(6):725-737. |
[15] | McKeown N W, Anantharam V,Walrand J C.Achieving 100% throughput in an input-queued switch[J].IEEE Transactions on Communications,1999,47(8):296-302. |
[16] | Ahmadi H, Denzel W E, Murphy C A, et al. A high- performance switch fabric for integrated circuit and packet switching[C]∥Proc of the 7th Annual Joint Conference of the IEEE Computer and Communcations Societies,1988:9-18. |
[17] | Suzuki H, Nagano H,Suzuki T Y,et al.Output-buffer switch architecture for asynchronous transfer mode[C]∥Proc of IEEE International Conference on Communications, World Prosperity Through Communications, 1989: 99-103. |
[18] | Prabhakar B, McKeown N. On the speedup required for combined input and output queued switching[J].Automat- ica,1999,35(12):1909-1920. |
[19] | Chuang S-T, Goel A,McKeown N,et al.Matching output queueing with a combined input/output-queued switch[J].IEEE Journal on Selected Areas in Communications,1999,17(6):1030-1039. |
[20] | Jun J A,Byun S H,Ahn B J,et al.Two-dimensional crossbar matrix switch architecture[C]∥Proc of Asia Pacific Conference on Communications,2002:411-415. |
[21] | Dai Y, Wang K F,Qu G, et al.A scalable and resilient microarchitecture based on multiport binding for high-radix router design[C]∥Proc of 2017 IEEE International Parallel and Distributed Processing Symposium,2017:429-438. |
[22] | Mora G, Flich J, Duato J,et al.Towards an efficient switch architecture for high-radix switches[C]∥Proc of the 2006 ACM/IEEE Symposium on Architecture for Networking and Communications Systems,2006:11-20. |
[23] | Wang K F, Fang M,Chen S Q.Design of a tile-based high-radix switch with high throughput[C]∥Proc of 2011 2nd International Conference on Networking and Information Technology,2011:277-285. |
[24] | Ahn J H,Choo S,Kim J.Network within a network approach to create a scalable high-radix router microarchitecture[C]∥Proc of IEEE International Symposium on High-Performance Computer Architecture,2012:1-12. |
[25] | Ahn J H,Binkert N,Davis A,et al.HyperX:Topology,routing,and packaging of efficient large-scale networks[C]∥Proc of the Conference on High Performance Computing Networking,Storage and Analysis,2009:1-11. |
[26] | Chiussi F M,Kneuer J G,Kumar V P.Low-cost scalable switching solutions for broadband networking:The ATLANTA architecture and chipset[J].IEEE Communications Magazine,1997,35(12):44-53. |
[27] | Chao H J,Jing Z,Liew S Y. Matching algorithms for three-stage bufferless Clos network switches[J].IEEE Communications Magazine,2003,41(10):46-54. |
[28] | Chrysos N,Minkenberg C,Rudquist M,et al.SCOC:High-radix switches made of bufferless Clos networks[C]∥Proc of 2015 IEEE 21st International Symposium on High Performance Computer Architecture,2015:402-414. |
[29] | Scott S,Abts D,Kim J,et al.The BlackWidow high-radix Clos network[C]∥Proc of the 33rd International Symposium on Computer Architecture,2006:16-28. |
[30] | McKeown N W. Scheduling algorithms for input-queued switches[D].California:UC Berkeley,1995. |
[31] | Chao H J,Lam C H,Guo X.A fast arbitration scheme for terabit packet switches[C]∥Proc of Global Telecommunications Conference,1999:1236-1243. |
[32] | Oki E,Jing Z,Rojas-Cessa R,et al.Concurrent round-robin-based dispatching schemes for Clos-network switches[J].IEEE/ACM Transactions on Networking,2002,10(6):830-844. |
[33] | Li Y,Panwar S S,Chao H J.On the performance of a dual round-robin switch[C]∥Proc of IEEE Conference on Computer Communications,2001:1688-1697. |
[34] | Rojas-Cessa R,Oki E,Chao H J.On the combined input-crosspoint buffered switch with round-robin arbitration[J].IEEE Transactions on Communications,2005,53(11):1945-1951. |
附中文参考文献: | |
[6] | 高剑刚,卢宏生,何王全,等.神威 E 级原型机互连网络和消息机制[J].计算机学报,2021,44(1):222-234. |
[1] | ZHANG Tian-yang, CHI Cheng-yue, GUO Wu, GAO Yi-qin, WEN Min-hua, WEI Jian-wen . Key techniques and practice on managing multi-site HPC clusters for university campus [J]. Computer Engineering & Science, 2023, 45(12): 2135-2145. |
[2] | XIAO Tiao-jie, ZHOU Feng, ZHENG Xuan-yu, LIU Jian, CHEN Lin, LIU Jie, YI Ming-kuan, CHEN Xu-guang, GONG Chun-ye, YANG Bo, GAN Xin-biao, LI Sheng-guo, ZUO Ke, . Large-scale 3D electromagnetic modeling in frequency domain using integration equation method [J]. Computer Engineering & Science, 2023, 45(11): 1901-1910. |
[3] | ZHU Wen-long, JIANG Jia-zhi, HUANG Dan, XIAO Nong. ParM: A heterogeneous programming model for domestic processors [J]. Computer Engineering & Science, 2023, 45(09): 1521-1531. |
[4] | WU Tie-bin, GUO Feng, WANG Di. A survey of core computing architecture of high performance processors for exascale computing [J]. Computer Engineering & Science, 2023, 45(05): 761-771. |
[5] | CHEN Feng-xian. Cluster job runtime prediction based on NR-Transformer [J]. Computer Engineering & Science, 2022, 44(07): 1181-1190. |
[6] | CAO Ji-jun. Review of reconfigurable optical interconnection network architecture for HPC and DC [J]. Computer Engineering & Science, 2022, 44(06): 951-963. |
[7] | LIANG Chong-shan, DAI Yi, XU Wei-xia. A super high-radix router based on Chiplet integration technology [J]. Computer Engineering & Science, 2022, 44(02): 207-213. |
[8] | WANG Xin, LIN Fang, LIU Yi, QIAN De-pei. A large-scale Infiniband interconnection network simulation system based on OMNet++ [J]. Computer Engineering & Science, 2021, 43(05): 792-798. |
[9] |
WU Jun-nan, OU Yang, LI Yan.
Design and implementation of a high performance computing user organization management system based on LAMP#br#
#br#
[J]. Computer Engineering & Science, 2021, 43(02): 235-241.
|
[10] | LIU Jie, GONG Chun-ye, YANG Bo, GUO Xiao-wei, GAN Xin-biao, LI Sheng-guo, LI Chao, CHEN Xu-guang, XIAO Tiao-jie, MU Li-an, SONG Min, ZHAO Dong-yong, JU Yu-zhong. YH-ACT:Parallel analysis code of thermohydraulics [J]. Computer Engineering & Science, 2021, 43(01): 58-69. |
[11] | WANG Chao, CAO Jijun, LUO Zhang, LAI Mingche, XU Weixia. Research and implementation of lowlatency forward error correction coding for HPC interconnection network [J]. Computer Engineering & Science, 2020, 42(11): 1965-1972. |
[12] | LI Zhe, TAN Yusong, LI Bao, YU Jie. Cold start optimization on function computing for high performance computing [J]. Computer Engineering & Science, 2020, 42(11): 1973-1980. |
[13] | LI Qiong, SONG Zhen-long, YUAN Yuan, XIE Xu-chao. A regional shared and high concurrent storage architecture based on NVMeoF storage pool [J]. Computer Engineering & Science, 2020, 42(10高性能专刊): 1711-1719. |
[14] | XIE Min, ZHANG Wei, ZHOU En-qiang, DONG Yong. Implementation of scalable communication framework on TH-express interconnection [J]. Computer Engineering & Science, 2020, 42(10高性能专刊): 1720-1729. |
[15] | JIANG Ju-ping, DONG De-zun, TANG Hong, QI Xing-yun, CHANG Jun-sheng, PANG Zheng-bin. Performance evaluation of large-scale HPC interconnection network topologies [J]. Computer Engineering & Science, 2020, 42(10高性能专刊): 1730-1736. |
Viewed | ||||||
Full text |
|
|||||
Abstract |
|
|||||
湘公网安备 43010502000083号
湘ICP备10006030号
Copyright © Computer Engineering & Science, All Rights Reserved.
Address:109 Deya Rd,Changsha,hunan(410073) Tel: 0731-87002567 Email: jsjgcykx@vip.163.com
Powered by Beijing Magtech Co., Ltd.