An efficient large language model inference method for bandwidth-constrained digital signal processors
CHEN Yang,YANG Xi,SU Huayou,CHEN Kangkang
(1.College of Computer Science and Technology,National University of Defense Technology,Changsha 410073;
2.National Key Laboratory of Parallel and Distributed Computing,National University of Defense Technology,Changsha 410073,China)
CHEN Yang, YANG Xi, SU Huayou, CHEN Kangkang. An efficient large language model inference method for bandwidth-constrained digital signal processors[J]. Computer Engineering & Science, 2026, 48(4): 599-607.