← 返回 JSSC 论文列表JSSC 2014第3期Memory65nmDRAM
Low-Power High-Throughput LDPC Decoder Using Non-Refresh Embedded DRAM
采用非刷新嵌入式DRAM的低功耗高吞吐量LDPC解码器设计
9 Gb/s峰值吞吐量,89.5 pJ/b能效,37.7 mW功耗
LDPC解码器非刷新eDRAM低功耗设计高吞吐量IEEE 802.11ad
▸创新点1:利用非刷新eDRAM减少功耗(电路创新)。通过设计3T eDRAM单元,利用LDPC解码器内存访问的确定性特性,消除内存刷新操作,显著降低功耗。实验表明,eDRAM功耗仅占总功耗的21%。
▸创新点2:通过时序优化提升访问速度(电路创新)。通过牺牲多余的保留时间,优化eDRAM的访问时间,平衡字线耦合,确保数据在快速访问期间的可靠保留,从而提高整体解码速度。
▸创新点3:采用行合并和双帧处理技术(系统创新)。通过行合并和双帧处理技术,进一步优化解码器的吞吐量,使其在65 nm工艺下实现9 Gb/s的峰值吞吐量,功耗低至89.5 pJ/b。
▸创新点4:电压和频率缩放技术(系统创新)。通过动态调整电压和频率,在1.5 Gb/s的吞吐量下将功耗降至37.7 mW,能效提升至35.6 pJ/b,显著优化了低功耗应用场景下的性能。
Abstract
The majority of the power consumption of a high- throughput LDPC decoder is spent on memory. Unlike in a gen- eral-purpose processor, the memory access in an LDPC decoder is deterministic and the access win dow is short. We take advantage of the unique memory access characteristic to design a non-refresh eDRAM that holds data for the necessary access window, and fur- ther improve its access time by trading off the excess retention time. The resulting 3T eDRAM cell is designed to balance wordline cou- pling to reliably retain data for a fast access. We integrate 32 5x210 non-refresh eDRAM arrays in a row-parallel LDPC decoder suit- able for the IEEE 802.11ad standard. Memory refresh is eliminated and random access is replaced with a simple sequential addressing. With row merging and dual-frame processing, the 1.6 mm 2 65 nm LDPC decoder chip achieves a peak throughput of 9 Gb/s at 89.5 pJ/b, of which only 21% is spent on eDRAMs. With voltage and frequency scaling, the power consumption of the LDPC decoder is reduced to 37.7 mW for a 1.5 Gb/s throughput at 35.6 pJ/b.