• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

Computer Engineering & Science ›› 2021, Vol. 43 ›› Issue (07): 1173-1184.

Previous Articles     Next Articles

Task scheduling optimization of Flink in container environment

HUANG Shan1,2,3 ,FANG Liu-yi1,2,3 ,XU Hao-tong1,2,3,DUAN Xiao-dong1,2,3#br#

#br#
  

  1. (1.College of Computer Science and Technology,Dalian Minzu University,Dalian 116600;

    2.State Ethnic Affairs Commission Key Laboratory of Big Data Applied Technology,Dalian 116600;

    3.Dalian Key Laboratory of Digital Technology for National Culture,Dalian 116600,China)

  • Received:2021-02-04 Revised:2021-04-12 Accepted:2021-07-25 Online:2021-07-25 Published:2021-08-16

Abstract: With the rapid development of Internet technology, human beings are moving towards the era of big data and cloud computing. As the latest generation of big data computing engine, Flink is favored by academia and industry for its advantages such as low latency and high throughput. When Flink is deployed in the cloud environment, its default task scheduling will lead to uneven load distribution due to the inability to obtain container deployment distribution information. To solve this problem, this paper proposes a Flink task scheduling load balancing algorithm in container environment to obtain the performance information of each node and the distribution information of the container on the node, give priority to the container of nodes with more free resources, and avoid the uneven load caused by the frequent selection of containers. The evaluation results show that the proposed algorithm can more evenly allocate tasks and improve resource utilization and computing speed when deployed in container environment


Key words: container, big data, Flink, task scheduling, load balancing, container environment