[1]Lecun Y,Bengio Y,Hinton G.Deep learning[J].Nature,2015,521(7553):436444.
[2]Li X,Zhang G,Huang H H,et al.Performance analysis of GPUbased convolutional neural networks[C]∥Proc of IEEE International Conference on Parallel Processing,2016:6776.
[3]Li M.Scaling distributed machine learning with the parameter server[C]∥Proc of ACM International Conference on Big Data Science and Computing,2014:3.
[4]Smola A,Narayanamurthy S.An architecture for parallel topic models[J].VLDB Endowment,2010,3(12):703710.
[5]Ahmed A,Aly M,Gonzalez J,et al.Scalable inference in latent variable models[C]∥Proc of ACM International Conference on Web Search and Data Mining,2012:123132.
[6]Li M,Zhou L,Yang Z,et al.Parameter server for distributed machine learning[C]∥
Proc of ACM NIPS Workshop on Big Learning,2013:110.
[7]Dean J,Corrado G S,Monga R,et al.Large scale distributed deep networks[C]∥Proc of International Conference on Neural Information Processing Systems,
2012:12231231.
[8]Ho Q,Cipar J,Cui H,et al.More effective distributed ML via a stale synchronous parallel parameter server[C]∥Proc of
International Conference on Neural Information Processing Systems, 2013:12231231.
[9]Zhang H,Hu Z,Wei J,et al.Poseidon:A system architecture for efficient GPUbased deep learning on multiple machines[J].Computer Science,2015,arXiv:1512.06216.
[10]Abadi M,Barham P,Chen J,et al.TensorFlow:A system for largescale machine learning[C]∥Proc of the 12th ACM USENIX Conference on Operating Systems Design and Implementation,2016:265283.
[11]Wang M,Xiao T,Li J,et al.Minerva:A scalable and highly efficient training platform for deep learning[C]∥Proc of NIPS Workshop on Distributed Machine Learning & Matrix Computations,2014:19.
[12]Valiant L G.A bridging model for parallel computation[J].COMMUNICATIONS of the ACM,1990,33(8):103111.
[13]Mccoll W F.Bulk synchronous parallel computing[M].Oxford:Oxford University Press,1995.
[14]Cui H,Cipar J,Ho Q,et al.Exploiting bounded staleness to speed up big data analytics[C]∥Proc of Usenix Technical Conference,2014:3748.
[15]Zhang W,Gupta S,Lian X,et al.Stalenessaware asyncSGD for distributed deep learning[C]∥
Proc of International Joint Conference on Artificial Intelligence,2016:23502356.
[16]Lecun Y,Bottou L,Bengio Y,et al.Gradientbased learning applied to document recognition[J].Proceedings of the IEEE,
1998,86(11),22782324.
[17]He K,Zhang X,Ren S,et al.Deep residual learning for image recognition[C]∥Proc of IEEE Conference on Computer Vision and Pattern Recognition
,2016:770778. |