[1] |
Laguna I,Marshall R,Mohror K,et al.A large-scale study of MPI usage in open-source HPC applications[C]∥Proc of the International Conference for High Performance Computing,Networking,Storage and Analysis,2019:1-14.
|
[2] |
Gokalgandhi B,Seskar I.Distributed processing for encoding and decoding of binary LDPC codes using MPI[C]∥Proc of 2019 IEEE Conference on Computer Communications Workshops,2019:596-601.
|
[3] |
Sefidgar S M H,Firoozjaee A R,Dehestani M.Parallelization of torsion finite element code using compressed stiffness matrix algorithm[J].Engineering with Computers,2021,37:2439-2455.
|
[4] |
Tsuji Y,Osawa K,Ueno Y,et al.Performance optimizations and analysis of distributed deep learning with approximated second-order optimization method[C]∥Proc of the 48th International Conference on Parallel Processing,2019:1-8.
|
[5] |
Williams-Young D B,Yang C.Parallel shift-invert spectrum slicing on distributed architectures with GPU accelerators[C]∥Proc of the 49th International Conference on Parallel Processing,2020:1-11.
|
[6] |
MPICH home page[EB/OL].[2021-10-17]. http://www.mcs.anl.gov/mpi/mpich.
|
[7] |
Open MPI:Open source high performance computing[EB/OL].[2021-10-17]. http://www.open-mpi.org.
|
[8] |
Kang Q,Traff J L,Al-Bahrani R,et al.Scalable algorithms for MPI intergroup allgather and allgatherv[J].Parallel Computing,2019,85:220-230.
|
[9] |
Mallón D A,Taboada G L,Koesterke L.MPI and UPC broadcast,scatter and gather algorithms in Xeon Phi[J].Concurrency and Computation:Practice and Experience,2016,28 (8):2322-2340.
|
[10] |
Ascension A M,Araúzo-Bravo M J.BigMPI4py:Python module for parallelization of big data objects discloses germ layer specific DNA demethylation motifs[J].IEEE/ACM Transactions on Computational Biology and Bioinformatics, 2022,19(3):1507-1522.
|
[11] |
Jocksch A,Ohana N,Lanti E,et al.Optimised allgatherv, reduce_scatter and allreduce communication in message-passing systems[J].arXiv:2006.13112,2020.
|
[12] |
Pjeivac-Grbovic J .Towards automatic and adaptive optimizations of MPI collective operations[D].Knoxville:University of Tennessee,2007.
|
[13] |
Nuriyev E,Lastovetsky A.Accurate runtime selection of optimal MPI collective algorithms using analytical perfor- mance modelling[J]. arXiv:2004.11062,2020.
|
[14] |
Traff J L. Hierarchical gather/scatter algorithms with graceful degradation[C]∥Proc of the 18th International Par-allel and Distributed Processing Symposium,2004:1135-1144.
|
[15] |
Traff J L.On optimal trees for irregular gather and scatter collectives[J].IEEE Transactions on Parallel and Distribut- ed Systems,2019,30(9):2060-2074.
|
[16] |
MPI: A message-passing interface standard[EB/OL].[2021-10-17]. https://www.mpi-forum.org/docs/mpi- 3.1/mpi31-report.pdf.
|