← 返回 JSSC 论文列表JSSC 2024第10期Digital CircuitsNeural Network Accelerator
A 0.67-to-5.4 TSOPs/W Spiking Neural Network Accelerator With 128/256 Reconfigurable Neurons and Asynchronous Fully Connected Synapses Xiang’ao Qi
一款支持128/256可重构神经元的SNN加速器,实现LIF和IZ神经元模型切换,峰值能效达5.37 TSOPs/W。
峰值能效5.37 TSOPs/W,吞吐量25.6 MSOPs/s,图像特征提取能耗9.27 pJ/pixel
脉冲神经网络神经形态计算可重构架构异步突触能效优化
▸创新点1:可重构神经元电路(支持LIF和IZ模型)——电路创新,通过动态切换LIF(低复杂度)和IZ(高精度)神经元模型,实现脑仿真与机器学习任务的双模式支持,提升硬件适用性。
▸创新点2:异步全连接突触结构——系统创新,采用异步通信机制消除时钟同步开销,结合全连接拓扑动态分配带宽,缓解网络拥塞,实现25.6 MSOPs/s的高吞吐量。
▸创新点3:近阈值操作降低能耗——电路创新,神经元工作在近阈值电压区域,结合异步突触设计,使图像特征提取能耗降至9.27 pJ/pixel,能效比达5.37 TSOPs/W。
▸创新点4:混合精度NoC优化——系统创新,针对稀疏脉冲分布设计动态路由算法,减少数据包冲突,提升突触并行处理效率(128/256神经元可配置)。
Abstract
Spiking neural networks (SNNs) are garnering increasing attention due to their potential to explore the com- plexities of the human brain and utilize its capabilities. The broad spectrum of applications presents challenges in designing SNN-based neuromorphic systems First, the SNN uses complex models [e.g., Izhikevich (IZ)] for brain simulations and simpler models [e.g., Leaky Integrate and Fire (LIF)] for efficient machine learning, presenting a challenge in realizing neuron circuits supporting diverse applications. Second, densely connected net- works with uneven spike distributions lead to Network-on-Chip (NoC) congestion and delays, complicating the optimization of throughput/area. An SNN accelerator, featuring 128/256 recon- figurable neurons and asynchronous fully connected synapses, has been developed to address these challenges. The reconfigurable neuron circuit is capable of switching between the LIF neuron model and the IZ neuron model. The proposed chip achieves a peak power efficiency of 5.37 TSOPs/W and throughput of 25.6 MSOPs/s. The near-threshold operation of neurons, in con- junction with asynchronous fully connected synapse, reduces energy by 9.42× to a 9.27 pJ/pixel in image feature extraction.