← 返回 JSSC 论文列表JSSC 2007第1期Digital Circuits90nm
A Power-Efficient High-Throughput 32-Thread SPARC Processor
一款高效能多线程SPARC处理器,采用多核多线程架构优化商业负载吞吐量。
1.2 GHz, 1.2 V, 63 W, 279 million transistors
多线程SPARC高带宽低功耗CMT架构
▸创新点1:采用多核多线程(CMT)架构,通过八个四线程64位核心实现高吞吐量,显著提升商业负载处理能力,同时降低硬件复杂度和功耗。
▸创新点2:高带宽互联交叉开关设计,优化数据传输效率,支持多核间高速通信,显著提升整体系统性能。
▸创新点3:共享L2缓存设计,通过3MB共享缓存减少数据访问延迟,提高多核协同工作效率,同时优化硅面积利用率。
▸创新点4:集成电源和热监控技术,动态调整功耗和散热策略,增强芯片可靠性和稳定性,适用于高负载场景。
Abstract
This first generation of “Niagara” SPARC proces- sors implements a power-efficient Chip Multi-Threading (CMT) architecture which maximizes overall throughput performance for commercial workloads. The target performance is achieved by exploiting high bandwidth rather than high frequency, thereby reducing hardware complexity and power. The UltraSPARC T1 processor combines eight four-threaded 64-b cores, a floating-point unit, a high-bandwidth interconnect crossbar, a shared 3-MB L2 Cache, four DDR2 DRAM interfaces, and a system interface unit. Power and thermal monitoring techniques further enhance CMT performance benefits, increasing overall chip reliability. The 378-mm /50 die is fabricated in Texas Instrument’s 90-nm CMOS technology with nine layers of copper interconnect. The chip contains 279 million transistors and consumes a maximum of 63 W at 1.2 GHz and 1.2 V. Key functional units employ special circuit techniques to provide the high bandwidth required by a CMT architecture while optimizing power and silicon area. These include a highly integrated integer register file, a high-bandwidth interconnect crossbar, the shared L2 cache, and the IO subsystem. Key aspects of the physical design methodology are also discussed.