• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

Computer Engineering & Science ›› 2024, Vol. 46 ›› Issue (08): 1331-1339.

• High Performance Computing • Previous Articles     Next Articles

A low-power keyword spotting system with SRAM buffer and computing-in-memory

HUANG Zhi-rui1,2,JIA Xin-ru1,2 ,ZHU Hao-zhe1,2 ,CHEN Chi-xiao1,2   

  1. (1.State Key Laboratory of Integrated Chips and Systems,Fudan University,Shanghai 200433;
    2.Frontier Institute of Chip and System,Fudan University,Shanghai 200438,China)
  • Received:2023-11-10 Revised:2023-12-27 Accepted:2024-08-25 Online:2024-08-25 Published:2024-09-02

Abstract: This paper proposes a low-power keyword spotting (KWS) system to overcome the problem of high-power consumption caused by deploying KWS algorithms on edge computing hardware, which can significantly impact the endurance of mobile devices. The proposed KWS system is based on computing-in-memory (CIM) technology and software-hardware co-design. In terms of algorithm, a ternary quantized MFCC-CNN joint algorithm based on the standard MFCC algorithm topology is proposed. All the general matrix multiplication (GEMM) in MFCC is mapped to the neural network accelerator. At the circuit level, the proposed system uses a computing-in-memory (CIM) core based on SRAM to overcome the power and memory walls in traditional von Neumann architecture accelerators. Additionally, a SRAM buffer circuit based on a look-up table is proposed to replace the register delay chain, which multiplexes the memory array in the CIM core. Both the SRAM-based CIM core and buffer are implemented using custom circuit units. At the system level, a low-power KWS system is proposed utilizing the two customized circuits discussed above. The system is implemented using ASIC and customized circuit design methods and synthesized using a 28 nm process library. The proposed system achieves a processing delay of 64 ms on 10 classification tasks, with a total power consumption of 645.28 μW. The dynamic power consumption of the MFCC pipeline accounts for 5.9% of the total dynamic power consumption, and the total power consumption accounts for only 1.3% of the system's power consumption.

Key words: keyword spotting, ternary quantized neural network, computing-in-memory, serial fast Fourier transform (FFT), software-hardware co-design