• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

Computer Engineering & Science ›› 2026, Vol. 48 ›› Issue (4): 580-589.

• High Performance Computing • Previous Articles     Next Articles

HSI:A high-bandwidth and low-latency protocol conversion mechanism for multiple chiplets

WANG Yong,YANG Qianming,FU Wenwen,WANG Yongwen   

  1. (1.College of Computer Science and Technology,National University of Defense Technology,Changsha 410073;
    2.Key Laboratory of Advanced Microprocessor Chips and Systems,Changsha 410073,China)
  • Received:2024-10-18 Revised:2025-01-18 Online:2026-04-25 Published:2026-04-29

Abstract: Chiplet technology has emerged as a promising approach to extend Moore’s Law and enhance chip performance due to its low cost, high yield, and high integration density. Currently, research on inter-chiplet transmission primarily focuses on high-speed interconnect interfaces, while the study of protocol conversion technology from the network-on-chip (NoC) to chiplet interfaces remains underexplored, posing a bottleneck for transmission latency and bandwidth between chiplets. This paper proposes a high-bandwidth and low-latency protocol conversion mechanism, named HSI. HSI employs a combined polling scheduling strategy to read multiple types of Flits from the NoC, thereby reducing transmission latency and enhancing bandwidth. It utilizes a multi-slice packet format to encapsulate Flits, improving effective bandwidth utilization, and adopts a multi-write single-read queue structure to support parallel memory access for multiple Flits, reducing parsing latency. To validate the superiority of HSI, this paper implements and verifies the HSI mechanism with respect to the mainstream CHI network protocol and UCIe chiplet interface protocol. The results demonstrate that HSI achieves a transmission bandwidth of up to 512  Gbit/s, which is compatible with the transmission bandwidth of 32-lane UCIe and the memory access bandwidth of DDR5.0. Moreover, the transmission latency for a single Flit is merely 6.05 ns, while the average transmission latency for burst Flit streams ranges from 1.2~1.7 ns.

Key words: multiple chiplets, network-on-chip(NoC), protocol transition mechanism, coherent hub interface(CHI) protocol, universal chiplet interconnect express(UCIe) protocol