← 返回 JSSC 论文列表
📄 下载 JSSC 原文 PDF
JSSC 2022第1期Other7nm

A 7-nm Four-Core Mixed-Precision AI Chip With 262-TFLOPS Hybrid-FP8 Training 104

一款7纳米四核混合精度AI芯片,支持FP16、HFP8、INT4和INT2计算精度,用于高效深度学习训练和推理。
262-TFLOPS Hybrid-FP8
混合精度AI芯片深度学习FP8训练INT4推理
支持四种计算精度(FP16、HFP8、INT4、INT2)
采用7纳米工艺实现高能效
支持8位浮点(FP8)训练和INT4推理
Abstract
Reduced precision computation is a key enabling factor for energy-efficient acceleration of deep learning (DL) applications. This article presents a 7-nm four-core mixed- precision artificial intelligence (AI) chip that supports four compute precisions—FP16, Hybrid-FP8 (HFP8), INT4, and INT2—to support diverse application demands for training and inference. The chip leverages cutting-edge algorithmic advances to demonstrate leading-edge power efficiency for 8-bit floating-point (FP8) training and IN