• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

Computer Engineering & Science ›› 2021, Vol. 43 ›› Issue (07): 1160-1167.

Previous Articles     Next Articles

Scheduling of heterogeneous tasks for distributed training

YANG Jian-wei,MENG Min,HUANG Jia-le,WU Ji-gang   

  1. (School of Computer Science and Technology,Guangdong University of Technology,Guangzhou 510006,China)
  • Received:2020-09-10 Revised:2020-11-13 Accepted:2021-07-25 Online:2021-07-25 Published:2021-08-16

Abstract: Workers in distributed machine learning often need to deal with heterogeneous tasks during the training process. However, the task publisher may not be able to determine which workers in the cluster of edge server (ES) are currently in training based on effective prior knowledge. To tackle the problem that the ES cluster cannot fulfill the maximization of the training performance and the quality of service at the same time, a scheduling algorithm of heterogeneous tasks is proposed. Firstly, the factors influencing the convergence performance of distributed training are analyzed under the constraints about cluster’s resources. Secondly, the optimization objective for maximizing training performance is established. Finally, the optimization problem is transformed into a multidimensional multiple-choice knapsack problem. The simulation results show that the proposed scheduling algorithm of heterogeneous tasks can maximize the performance of distributed training and simultaneously ensure the quality of ser- vice. 


Key words: distributed training, training performance, scheduling of heterogeneous tasks, multi- dimensional multiple-choice knapsack problem, convergence analysis