← 返回 JSSC 论文列表JSSC 2024第8期Memory28nmCIM
A 28-nm Floating-Point Computing-in-Memory Processor Using Intensive-CIM Sparse-
提出一种28nm浮点存内计算处理器,采用密集-稀疏架构提升能效。
28nm CMOS
存内计算浮点运算神经网络能效稀疏计算
▸创新点1:浮点到整数存内计算流程设计(方法创新) - 通过分析浮点数据指数集中特性,提出将浮点运算分解为密集和稀疏部分,设计FP-to-INT转换流程,显著减少存内计算周期数,提升能效比。
▸创新点2:灵活稀疏数字核心(电路创新) - 采用可配置数字处理单元处理稀疏浮点运算,支持动态精度调整,在保证计算精度的同时实现硬件资源的高效利用。
▸创新点3:低比特浮点训练方法(算法创新) - 针对存内计算特性开发专用训练算法,将浮点位宽压缩至低比特(如8位)同时保持模型精度,使能效提升2.3倍(论文实测数据)。
▸创新点4:密集-稀疏混合架构(系统创新) - 首创密集计算采用存内处理、稀疏计算采用数字核的异构架构,通过任务调度器实现两种计算模式的无缝切换,整体能效达16.3TOPS/W。
Abstract
Computing-in-memory (CIM) chips have demon-
strated promising high energy efficiency on multiply–accumulate
(MAC) operations for artificial intelligence (AI) applications.
Though integral (INT) CIM chips are emerging, the floating-point
(FP) CIM chip has not been well explored. The high-accuracy
demand of larger models and complex tasks requires FP compu-
tation. Besides, most of the neural network (NN) training tasks
still rely on FP computation. This work presents an energy-
efficient FP CIM p