• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

计算机工程与科学 ›› 2026, Vol. 48 ›› Issue (2): 209-215.

• 高性能计算 • 上一篇    下一篇

基于发送速率梯度的数据中心网络拥塞控制

江艺,吴向军,张经纬
  

  1. (武汉数字工程研究所网络与安全中心,湖北 武汉 430205)
  • 收稿日期:2024-05-16 修回日期:2024-07-23 出版日期:2026-02-25 发布日期:2026-03-10

Congestion control in data center networks based on sending rate gradient

JIANG Yi,WU Xiangjun,ZHANG Jingwei   

  1. (Network and Security Center,Wuhan Digital Engineering Institute,Wuhan 430205,China)
  • Received:2024-05-16 Revised:2024-07-23 Online:2026-02-25 Published:2026-03-10

摘要: 针对现有RoCE网络协议带宽利用率不足、收敛速度较慢和多流共享链路公平性不足等问题,在HPCC协议基础上,改进并提出了一种基于发送速率梯度的动态速率自适应增量算法DRAI。首先,交换机在数据包中添加包含链路容量及最大并发流数目等字段的INT信息;其次,接收端返回携带相同INT信息的ACK包;最后,发送端由INT信息计算得到拥塞点的速率梯度,并以此为信号实现动态的加性递增因子,采取MIMD调速策略控制发送速率。实验结果表明,与HPCC协议相比,所提出的拥塞控制算法收敛速度更快,多流共享链路公平性更佳;在短流FCT与HPCC协议基本持平的同时,能够在高负载场景降低长流的99分位FCT。

关键词: 数据中心网络, 远程直接内存访问, 拥塞控制, 带内网络遥测, RoCE网络协议

Abstract: To address issues such as insufficient bandwidth utilization, slow convergence speed, and inadequate fairness in multi-flow shared link scenarios with existing RoCE (RDMA over converged Ethernet) network protocols, a dynamic rate adaptive increment algorithm based on sending rate gradient (DRAI) is proposed and improved upon the foundation of the HPCC (high precision congestion control) protocol. First, the switch adds in-band network telemetry (INT) information containing fields such as link capacity and the maximum number of concurrent flows to data packets. Then, the receiver returns ACK packets carrying the same INT information. Finally, the sender calculates the rate gradient at the congestion points using the INT information and employs this as a signal to implement a dynamic additive-increase factor, adopting a multiplicative-increase multiplicative-decrease (MIMD) rate adjustment strategy to control the sending rate. Experimental results show that, compared to the HPCC protocol, the proposed congestion control algorithm achieves faster convergence and better fairness in multi-flow shared link scenarios. While maintaining comparable short-flow flow completion times (FCTs) to the HPCC protocol, it also reduces the 99th-percentile FCT for long flows in high-load scenarios.

Key words: data center network, remote direct memory access, congestion control, in-band network telemetry, RDMA over converged ethernet network protocol