JSSC 2024第8期Memory28nmCIM

A 28-nm Floating-Point Computing-in-Memory Processor Using Intensive-CIM Sparse-Digital Architecture Shengzhe Yan

提出一种28nm浮点存内计算处理器，采用密集-稀疏架构提升能效。

28nm CMOS

存内计算浮点运算神经网络能效稀疏计算

▸创新点1：浮点到整数存内计算流程设计（方法创新） - 通过分析浮点数据指数集中特性，提出将浮点运算分解为密集和稀疏部分，设计FP-to-INT转换流程，显著减少存内计算周期数，提升能效比。

▸创新点2：灵活稀疏数字核心（电路创新） - 采用可配置数字处理单元处理稀疏浮点运算，支持动态精度调整，在保证计算精度的同时实现硬件资源的高效利用。

▸创新点3：低比特浮点训练方法（算法创新） - 针对存内计算特性开发专用训练算法，将浮点位宽压缩至低比特（如8位）同时保持模型精度，使能效提升2.3倍（论文实测数据）。

▸创新点4：密集-稀疏混合架构（系统创新） - 首创密集计算采用存内处理、稀疏计算采用数字核的异构架构，通过任务调度器实现两种计算模式的无缝切换，整体能效达16.3TOPS/W。

Abstract

Computing-in-memory (CIM) chips have demon- strated promising high energy efficiency on multiply–accumulate (MAC) operations for artificial intelligence (AI) applications. Though integral (INT) CIM chips are emerging, the floating-point (FP) CIM chip has not been well explored. The high-accuracy demand of larger models and complex tasks requires FP compu- tation. Besides, most of the neural network (NN) training tasks still rely on FP computation. This work presents an energy- efficient FP CIM processor. It is observed that most of the exponent values of FP data are concentrated in a small region. Therefore, the FP computations are divided into intensive and sparse parts and then executed on an intensive-CIM sparse- digital architecture. First, an FP-to-INT CIM workflow for the intensive FP operations is designed to reduce the CIM execution cycles. Second, a flexible sparse-digital core is proposed for the remaining sparse FP operations. Utilizing both the intensive- CIM and sparse-digital cores, this work can achieve both high energy efficiency and identical accuracy to the FP algorithm baseline. Considering the FP CIM execution flow, a CIM-friendly low-bit FP training method is proposed to further reduce the Manuscript received 30 October 2023; revised 27 December 2023; accepted 31 January 2024. Date of publication 16 February 2024; date of current version 25 July 2024. This article was approved by Associate Editor Meng-Fan Chang. This work was supported in part by the Nati