• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

Computer Engineering & Science ›› 2022, Vol. 44 ›› Issue (05): 779-787.

• High Performance Computing • Previous Articles     Next Articles

A Flink load balancing strategy for cloud environment

XU Hao-tong1,2,3,HUANG Shan1,2,3,SUN Guo-zhang1,2,3,HE Fei-li1,2,3,DUAN Xiao-dong1,2,3   

  1. (1.College of Computer Science and Engineering,Dalian Minzu University,Dalian 116600;
    2.State Ethnic Affairs Commission Key Laboratory of Big Data Applied Technology(Dalian Minzu University),Dalian 116600;
    3.Dalian Key Laboratory of Digital Technology for National Culture(Dalian Minzu University),Dalian 116600,China)
  • Received:2021-11-09 Revised:2022-01-13 Accepted:2022-05-25 Online:2022-05-25 Published:2022-05-24

Abstract: As a new generation of big data computing engine, Flink has been widely used. When containers of Flink are deployed in cloud environment, its default task scheduling algorithm cannot perceive node resources information and adjust the load in time, and the capacity for independent equilibrium is poorer. Although mainstream container layout tools provide the possibility of container management, they fails to combine Flink characteristics to solve the problem of balancing the resource utilization while reducing the communication overhead in the container group. Aiming at the above problem, this paper proposes a Flink load balancing strategy for cloud environment, which comprehensively considers the distribution characteristics of operators in Flink cluster and the communication mechanism between containers, and takes the communication cost between nodes and load balancing as evaluation criteria. Experimental results show that, compared with Flink default scheduling algorithm, this algorithm can effectively improve the computing efficiency and system performance.

Key words: Flink, container, communication overhead, load balancing, migration