← 返回 JSSC 论文列表
📄 下载 JSSC 原文 PDF
JSSC 2023第10期Digital CircuitsIntel 22FFLNeural Network Accelerator

An Energy-Efficient Bayesian Neural Network Accelerator With CiM and a Time-Interleaved Hadamard Digital GRNG Using 22-nm FinFET

提出一种基于存内计算和高斯随机数生成器的能效贝叶斯神经网络加速器。
32.2 TOP/sW峰值能效, 7.31 GSamp/s随机数生成吞吐量, 1170 GOP/s/W系统能效
贝叶斯神经网络存内计算高斯随机数生成器能效优化不确定性建模
创新点1:多比特模拟存内计算SRAM宏(电路创新) - 采用Intel 22FFL工艺实现8位精度的多比特模拟CiM SRAM宏,峰值能效达32.2 TOP/s/W,计算误差低于0.5%,显著提升矩阵向量乘法的能效和精度。
创新点2:高效高斯随机数生成器(电路创新) - 设计可变精度GRNG模块,峰值吞吐量达7.31 GSamp/s,能效约1 TSamp/J,为BNN不确定性估计提供高质量统计特性支持。
创新点3:存内计算与随机数生成的紧耦合架构(系统创新) - 通过CiM宏与GRNG的协同优化,整体系统能效达1170 GOP/s/W,较现有BNN加速器提升35-133倍,MNIST准确率达98.14%。
创新点4:时间交织随机数生成技术(方法创新) - 采用时间交织调度策略优化GRNG资源利用率,在保证统计质量的同时实现高吞吐低功耗的随机数生成。
Abstract
Bayesian neural networks (BNNs) have been proposed to address the problems of overfitting and overconfident decision making, common in conventional neural networks (NNs), due to their ability to model and express uncertainty in their pre- dictions. However, BNNs require multiple inference passes to pro- duce the necessary posterior distributions used to generate these highly desirable uncertainty estimates. As such, BNNs require not only an efficient, high-performance multiply-accumulation (MAC) operation but also an efficient Gaussian random number generator (GRNG) with high-quality statistics. In this article, an NN accelerator chip, leveraging a multi-bit analog compute- in-memory (CiM) static random-access memory (SRAM) macro, with a tightly coupled and highly efficient GRNG scheme, is pre- sented in the Intel 22FFL process. The CiM macro achieves a peak energy efficiency of 32.2 TOP/sW, with 8-bit precision, while ensuring accurate on-chip matrix-vector multiplications (MVMs) with a computation error less than 0.5%. The variable precision GRNG achieves a peak throughput of 7.31 GSamp/s for an energy efficiency of ∼1 TSamp/J. Overall, our proposed system achieves a peak energy efficiency of 1170 GOP/s/W, a 35–133× improvement over the state-of-the-art BNN accelerators, with 98.14% accuracy for the MNIST dataset.