← 返回 JSSC 论文列表
📄 下载 JSSC 原文 PDF
JSSC 2019第1期Digital Circuits65nmNeural Network Accelerator

UNPU- An Energy-Efficient Deep Neural Network Accelerator With Fully Variable We

提出一种能效优化的深度神经网络加速器UNPU,支持可变权重精度和多种网络层类型。
65nm CMOS, 0.63-1.1V, 200MHz, 峰值性能345.6GOPS(16bit)-7372GOPS(1bit)
神经网络加速器能效优化可变精度移动深度学习硬件架构
支持1至16位可变权重精度
基于查找表的位串行处理单元降低能耗
统一架构提升卷积层峰值性能1.15倍
Abstract
An energy-efficient deep neural network (DNN) accelerator, unified neural processing unit (UNPU), is proposed for mobile deep learning applications. The UNPU can support both convolutional layers (CLs) and recurrent or fully connected layers (FCLs) to support versatile workload combinations to accelerate various mobile deep learning applications. In addition, the UNPU is the first DNN accelerator ASIC that can support fully variable weight bit precision from 1 to 16 bit. It enables the UNPU to oper