Deep learning parallel optimization mechanism
based on dynamic distribution of training data

Computer Engineering & Science

Previous Articles Next Articles

Deep learning parallel optimization mechanism

based on dynamic distribution of training data

YAN Zijie,CHEN Mengqiang,WU Weigang

(School of Data and Computer Science,Sun Yatsen University,Guangzhou 510006,China)

Received:2018-07-13 Revised:2018-09-20 Online:2018-11-26 Published:2018-11-25

Abstract

Abstract:

To solve the timeconsuming problem of collecting gradient updates under synchronous parallel training, we present a dynamic training data distribution algorithm under parallel synchronization of multiple machines. By calculating the computational efficiency of nodes, the amount of sample data that needs to be processed by nodes is dynamically assigned after each round of iteration. Such a mechanism allows the model to parallelize synchronously and reduce the waiting time it takes for gradient update. Finally, the mechanism is implemented via MXNet and evaluated at Tianhe2 supercomputers. Experimental results show that the proposed optimization mechanism achieves expected results.

Key words: deep learning, data assignment, synchronous parallel, parallel training, supercomputing

YAN Zijie,CHEN Mengqiang,WU Weigang.

Deep learning parallel optimization mechanism

based on dynamic distribution of training data

[J]. Computer Engineering & Science.

[1]	WU Yuhong, WANG Jian. Fault diagnosis of analog circuits based on Patches-CNN [J]. Computer Engineering & Science, 2025, 47(01): 35-44.
[2]	XU Chao, RUAN Rongyao, CHEN Yong, . A blockchain-based medical data auditing method [J]. Computer Engineering & Science, 2025, 47(01): 95-106.
[3]	CHEN Xinran, LIU Ning, YAN Zhongmin, LIU Lei, CUI Lizhen. An attention-guided dual-granularity cross-modal medical representation learning framework [J]. Computer Engineering & Science, 2025, 47(01): 150-159.
[4]	LUO Jing, YE Zhi-sheng, YANG Ze-hua, FU Tian-hao, WEI Xiong, WANG Xiao-lin, LUO Ying-wei, . Constructing and analyzing deep learning task dataset for R&D GPU clusters [J]. Computer Engineering & Science, 2024, 46(12): 2128-2137.
[5]	JING Chao, BI Yu-shen. OASIS: An interference-aware online scheduling algorithm for deep learning jobs [J]. Computer Engineering & Science, 2024, 46(12): 2138-2148.
[6]	HUANG Shan, WU Yu-fan, L He-xuan, DUAN Xiao-dong, . A heterogeneous differential synchronous parallel training algorithm [J]. Computer Engineering & Science, 2024, 46(11): 1949-1959.
[7]	CHEN Lei, LIANG Zheng-you, SUN Yu, CAI Jun-min. Mobile monocular depth estimation based on multi-scale feature fusion [J]. Computer Engineering & Science, 2024, 46(09): 1616-1524.
[8]	LIU Qiang, LI Mu-chun, WU Xiao-jie, WANG Yu-heng. S-JSMA: A fast JSMA adversarial example generation method with low disturbance redundancy [J]. Computer Engineering & Science, 2024, 46(08): 1395-1402.
[9]	DING Jian-ping, LI Wei-jun, LIU Xue-yang, CHEN Xu. A review of named entity recognition research [J]. Computer Engineering & Science, 2024, 46(07): 1296-1310.
[10]	HU Zhao-hua, WANG Chang-fu, . A small object detection algorithm of remote sensing image based on improved Faster R-CNN [J]. Computer Engineering & Science, 2024, 46(06): 1063-1071.
[11]	TAN Yu-song, WANG Wei, JIAN Song-lei, YI Chao-xiong. Weakly-supervised IDS with abnormal-preserving transformation learning [J]. Computer Engineering & Science, 2024, 46(05): 801-809.
[12]	GAO Shan, LI Shi-jie, CAI Zhi-ping. A survey of Chinese text classification based on deep learning [J]. Computer Engineering & Science, 2024, 46(04): 684-692.
[13]	LUO Yue-tong, LI Chao, ZHOU Bo, ZHANG Yan-kong. An interactive separation method for confusable defects in industrial defect classification [J]. Computer Engineering & Science, 2024, 46(03): 463-470.
[14]	Lv Fu, HAN Xiao-tian, FENG Yong-an, XIANG Liang. A texture image classification method based on adaptive texture feature fusion [J]. Computer Engineering & Science, 2024, 46(03): 488-498.
[15]	MA Ke-fan, LI Bao-feng, ZHOU Yue-jin, WU Yuan-yuan, YU Yong-lan, DUO Rui-hua. Design and implementation of a baseboard management controller on ZYNQ chip [J]. Computer Engineering & Science, 2024, 46(02): 217-223.

Deep learning parallel optimization mechanism

based on dynamic distribution of training data

PDF

Knowledge

Abstract

Cite this article

share this article

Related Articles 15

Recommended Articles

Metrics

Comments