ISSCC 2013
Session 24
Digital Circuits
Reliable and Energy-Efficient 1MHz 0.4V Dynamically Reconfigurable SoC for ExG Applications in 40nm LP CMOS
Tobias Gemmeke1, Changmoo Kim2, Jos Hulzink1, Jan Stuyt1, Mookyung Jung2, Jos Huisken1, Soojung Ryu2, Jungwook Kim2, Harmke de Groot1 imec - Holst Centre, Eindhoven, The Netherlands, Samsung Advanced Institute of Technol
ISSCC 2013
Session 24
Digital Circuits
A Low-Power 1GHz Razor FIR Accelerator with Time-Borrow Tracking Pipeline and Approximate Error Correction in 65nm CMOS
ARM, Cambridge, United Kingdom The unrelenting demands of wireless/multimedia DSP workloads necessitate specialized hardware to achieve higher performance and power efficiency. Razor systems offer even greater power effi
ISSCC 2013
Session 24
Digital Circuits
Self-Super-Cutoff Power Gating with State Retention on a 0.3V 0.29fJ/Cycle/Gate 32b RISC Core in 0.13μm CMOS
Using ultra low-voltage (ULV) is a viable approach towards lowering power consumption. However, due to the narrowing gap between the supply voltage and the threshold voltage, ULV designs inevitably suffer from either low
ISSCC 2013
Session 24
Digital Circuits
Ultra-Wide Body-Bias Range LDPC Decoder in 28nm UTBB FDSOI Technology
Bertrand Pelloux-Prayer1, Fabien Giner1, Deepak-Kumar Arora3, Franck Arnaud1, Nicolas Planes1, Julien Le Coz1, Olivier Thomas2, Sylvain Engels1, Giorgio Cesana1, Robin Wilson1, Pascal Urard1 STMicroelectronics, Crolles,
ISSCC 2013
Session 24
Digital Circuits
A 1.15Gb/s Fully Parallel Nonbinary LDPC Decoder with Fine-Grained Dynamic Clock Gating
The primary design goal of a communication or storage system is to allow the most reliable transmission or storage of more information at the lowest signalto-noise ratio (SNR). State-of-the-art channel codes including tu
ISSCC 2013
Session 14
Digital Circuits
Razor-Lite: A Side-Channel Error-Detection Register for Timing-Margin Recovery in 45nm SOI CMOS
University of Michigan, Ann Arbor, MI Advanced CMOS technologies are highly susceptible to process, voltage, and temperature (PVT) variations due to sub-wavelength lithography and other manufacturing challenges. These va
ISSCC 2013
Session 14
Digital Circuits
A 95fJ/b Current-Mode Transceiver for 10mm On-Chip Interconnect
sense-amplifier load in the receiver. In this work, IDRV and IPE were set to 95μA and 45μA, respectively. For the receiver equalization, the PMOS diode in the sense-amplifier load is modified to form an active inductor c
ISSCC 2013
Session 14
Digital Circuits
All-Digital Hybrid Temperature Sensor Network for Dense Thermal Monitoring
spatial temperature distribution, which is essential for dynamic thermal management [1,2]. The number of on-chip temperature sensors in highperformance processors is increasing, with state-of-the-art commercial processor
ISSCC 2013
Session 14
Digital Circuits
3D Clock Distribution Using Vertically/HorizontallyCoupled Resonators
Clock distribution with low skew, low jitter, and low power for high-performance microprocessors is a significant design challenge. Although traditional H-tree clock distribution circuits are widely used, the clock skew
ISSCC 2013
Session 14
Digital Circuits
A 2.5GHz 2.2mW/25μW On/Off-State Power 2psrmsLong-Term-Jitter Digital Clock Multiplier with 3-Reference-Cycles Power-On Time
dissipation and increase battery life. By turning off the circuits that are not in use, power cycling provides a viable means to make power dissipation proportional to workload, hence achieving energy proportional operat
ISSCC 2013
Session 14
Digital Circuits
A 0.026mm2 5.3mW 32-to-2000MHz Digital Fractional-N Phase Locked-Loop Using a Phase-Interpolating Phase-to-Digital Converter
from analog circuits to their digital counterparts, with digital PLLs (DPLLs) being an example of this trend [1]. All-digital or fully synthesizable approaches, which exploit the merits of advanced processes, suffer from
ISSCC 2013
Session 14
Digital Circuits
An All-Digital PLL Using Random Modulation for SSC Generation in 65nm CMOS
This paper introduces a digital PLL which uses high-frequency random modulation (RM), as opposed to low-frequency periodic modulation, to generate a spread spectrum clock (SSC). The implementation is straightforward and
ISSCC 2013
Session 14
Digital Circuits
A 0.032mm2 3.1mW Synthesized Pixel Clock Generator with 30psrms Integrated Jitter and 10-to-630MHz DCO Tuning Range
digital TVs and also in other video applications. A low integrated jitter is required for good display quality. However, an extremely low input frequency coming from the horizontal synchronization signal (HSYNC) makes it
ISSCC 2013
Session 14
Digital Circuits
A 0.022mm2 970μW Dual-Loop Injection-Locked PLL with -243dB FOM Using Synthesizable All-Digital PVT Calibration Circuits
include low area, low power consumption, environmental insensitivity, and the lowest possible jitter performance. Multiplying Delay-Locked Loop (MDLL) [12], subharmonically injection-locked techniques [3], and sub-sampli
ISSCC 2012
Session 14
Digital Circuits
A Digitally Stabilized Type-III PLL Using Ring VCO with 1.01psrms Integrated Jitter in 65nm CMOS
control gain, KVCO, increases the phase noise contribution arising from the charge pump and loop filter. To resolve this problem, dual-tuning PLLs (DT-PLLs) have been studied [1-4]. The DT-PLL structure adds a narrow-ban
ISSCC 2012
Session 14
Digital Circuits
A TDC-Less ADPLL with 200-to-3200MHz Range and 3mW Power Dissipation for Mobile SoC Clocking in 22nm CMOS
Mobile SoC designs demand a low-power clocking system to maximize battery life. The host PLL is critical since it must remain enabled to support always-on, always-connected operation. In addition, the host PLL should off
ISSCC 2012
Session 14
Digital Circuits
A 6.7MHz-to-1.24GHz 0.0318mm2 Fast-Locking All-Digital DLL in 90nm CMOS
synchronization in high performance digital systems. The design of analog DLLs has become a challenge due to the trends associated with CMOS scaling, namely, high leakage current, low supply voltage, etc. Consequently, m
ISSCC 2012
Session 14
Digital Circuits
A 1.5GHz 890µW Digital MDLL with 400fsrms
implemented using digital phase-locked loops (DPLLs), are evolving as the preferred means for synthesizing on-chip clocks. Their main benefits include small area, reduced sensitivity to analog circuit imperfections, and
ISSCC 2012
Session 14
Digital Circuits
A 0.004mm2 250µW ΔΣ TDC with Time-Difference Accumulator and a 0.012mm2 2.5mW Bang-Bang Digital PLL Using PRNG for Low-Power SoC Applications
clock and timing generators are in high demand to avoid complex analog operations and to meet stringent phase noise requirements. There have been several approaches to convert analog systems to their digital counterparts
ISSCC 2012
Session 10
Digital Circuits
K Computer: 8.162 PetaFLOPS Massively Parallel Scalar Supercomputer Built with Over 548k Cores
Tatsumi Nakada1, Ken Seki1, Toshiyuki Shimizu1, Naoki Shinjo1, Fumiyoshi Shoji2, Atsuya Uno2, Motoyoshi Kurokawa2 Fujitsu, Kanagawa, Japan RIKEN, Hyogo, Japan 1 2 Many high-performance CPUs employ a multicore architectur
ISSCC 2012
Session 10
Digital Circuits
Centip3De: A 3930DMIPS/W Configurable Near-Threshold 3D Stacked System with 64 ARM Cortex-M3 Cores
Sangwon Seo, Matthew Fojtik, Sudhir Satpathy, Yoonmyung Lee, Daeyeon Kim, Nurrachman Liu, Michael Wieckowski, Gregory Chen, Trevor Mudge, Dennis Sylvester, David Blaauw University of Michigan, Ann Arbor, MI Recent high p
ISSCC 2012
Session 10
Digital Circuits
3D-MAPS: 3D Massively Parallel Processor with Stacked Memory
Mohammad Hossain1, Moongon Jung1, Ilya Khorosh1, Gokul Kumar1, Young-Joon Lee1, Dean Lewis1, Tzu-Wei Lin1, Chang Liu1, Shreepad Panth1, Mohit Pathak1, Minzhen Ren1, Guanhao Shen1, Taigon Song1, Dong Hyuk Woo1, Xin Zhao1,
ISSCC 2012
Session 10
Digital Circuits
A 3D System Prototype of an eDRAM Cache Stacked Over Processor-Like Logic Using Through-Silicon Vias
IBM Systems and Technology Group, Fishkill, NY 1 2 3D integration (3DI) holds promise for improved performance of integrated systems by increasing interconnect bandwidth [1]. A processor stacked with cache memory is one
ISSCC 2012
Session 10
Digital Circuits
A 2.05GVertices/s 151mW Lighting Accelerator for 3D Graphics Vertex and Pixel Shading in 32nm CMOS
rendering realistic images in high-throughput 3D graphics pipelines. It is the most performance and power-critical operation in programmable vertex and pixel shaders due to the large number of complex floating-point (FP)
ISSCC 2012
Session 10
Digital Circuits
A 1.45GHz 52-to-162GFLOPS/W Variable-Precision Floating-Point Fused Multiply-Add Unit with Certainty Tracking in 32nm CMOS
blocks of 3D graphics, signal processing and high-performance computing workloads [1,2]. Higher floating-point precisions offer improved accuracy at the expense of performance and energy efficiency, with variable-precisi
ISSCC 2012
Session 10
Digital Circuits
A Source-Synchronous 90Gb/s Capacitively Driven Serial On-Chip Link Over 6mm in 65nm CMOS
an ever increasing number of cores in modern MPSoCs, power reduction and meeting on-chip bandwidth requirements are pressing concerns. Energy efficiency can be increased by percore dynamic voltage and frequency scaling (
ISSCC 2012
Session 10
Digital Circuits
A 280mV-to-1.1V 256b Reconfigurable SIMD Vector Permutation Engine with 2-Dimensional Shuffle in 22nm CMOS
maximizing high-performance microprocessor vector datapath utilization in multimedia, graphics, and signal processing workloads [1-3]. A wide SIMD vector permutation engine is required to achieve high-throughput data rea
ISSCC 2011
Session 19
Digital Circuits
A 0.27V 30MHz 17.7nJ/transform 1024-pt Complex FFT Core with Super-Pipelining
University of Michigan, Ann Arbor, MI, 2Arizona State University, Tempe, AZ Recently, aggressive voltage scaling was shown as an important technique in achieving highly energy-efficient circuits. Specifically, scaling Vd
ISSCC 2011
Session 19
Digital Circuits
A 62mV 0.13µm CMOS Standard-Cell-Based Design Technique Using Schmitt-Trigger Logic
University of Freiburg - IMTEK, Freiburg, Germany, HSG-IMIT, Villingen-Schwenningen, Germany Sub-threshold circuits have recently gained attention mainly due to the possibility of operating at the minimum energy per oper
ISSCC 2011
Session 19
Digital Circuits
A 77% Energy-Saving 22-Transistor Single-PhaseClocking D-Flip-Flop with Adaptive-Coupling Configuration in 40nm CMOS
Flip-flops (FF) typically consume more than 50% of random-logic power in an SoC chip, due to their redundant transition of internal nodes, when the input and the output are in the same state. Several low-power techniques
ISSCC 2011
Session 19
Digital Circuits
Comparison of 65nm LP Bulk and LP PD-SOI with Adaptive Power Gate Body Bias for an LDPC Codec
Alexandre Valentian2, Marc Belleville2, Christine Raynaud1, Damien Croain1, Pascal Urard1 1 STMicroelectronics, Crolles, France, CEA-LETI-MINATEC, Grenoble, France 2 A Low-Density Parity-Check (LDPC) codec circuit is imp
ISSCC 2011
Session 19
Digital Circuits
An 82µA/MHz Microcontroller with Embedded FeRAM for Energy-Harvesting Applications
Ronald Nerlich1, Marcus Herzog1, Ralph Ledwa1, Christian Sichert1, Volker Rzehak1, Priya Thanigai2, Bjoern Oliver Eversmann1 1 Texas Instruments, Freising, Germany, Texas Instruments, Dallas, TX 2 In recent years energy-
ISSCC 2011
Session 19
Digital Circuits
A Voltage-Scalable Biomedical Signal Processor Running ECG Using 13pJ/cycle at 1MHz and 0.4V
Filipa Duarte1, Arjan Breeschoten1, Jos Huisken1, Jan Stuyt1, Harmke de Groot1, Francisco Barat2, Johan David2, Johan Van Ginderdeuren2 1 imec - Holst Centre, Eindhoven, The Netherlands, NXP Semiconductors, Leuven, Belgi
ISSCC 2010
Session 9
Digital Circuits
Early Detection of Oxide Breakdown Through In Situ Degradation Sensing
high-performance design as it determines the maximum supply voltage, degrading maximum performance and SRAM stability [1]. OBD is an inherently statistical process where some devices fail long before others. Hence, a pri
ISSCC 2010
Session 9
Digital Circuits
In Situ Delay-Slack Monitor for High-Performance Processors Using An All-Digital Self-Calibrating 5ps Resolution Time-to-Digital Converter
susceptible to process, voltage, and temperature (PVT) variation. The standard approach for addressing this issue is to increase timing margin at the expense of power and performance. One approach to reclaim these losses
ISSCC 2010
Session 9
Digital Circuits
Accurate Characterization of Random Process Variations Using a Robust Low-Voltage HighSensitivity Sensor Featuring Replica-Bias Circuit
random threshold voltage (Vth) fluctuations is crucial in process optimization and yield learning, particularly for matching critical transistors such as SRAMs, sense amplifiers, differential amplifiers, etc. Traditional
ISSCC 2010
Session 9
Digital Circuits
A Microcontroller-Based PVT Control System For A 65nm 72Mb Synchronous SRAM
Cypress Semiconductor, San Jose, CA As scaling of silicon CMOS technology continues more challenges emerge in dealing with process variations. In SRAMs process variations directly affect operating margin, subthreshold le
ISSCC 2010
Session 9
Digital Circuits
High-Bandwidth and Low-Energy On-Chip Signaling with Adaptive Pre-Emphasis in 90nm CMOS
wires pose well-known latency, bandwidth, and energy challenges to the designers of high-performance VLSI systems. Repeaters effectively mitigate wire RC effects but do little to improve their energy costs. Moreover, pro
ISSCC 2010
Session 9
Digital Circuits
A 1.2 TB/s On-Chip Ring Interconnect for 45nm 8-Core Enterprise Xeon® Processor
Xeon microprocessors targeting high performance, low-power products [1, 2]. Several key design requirements for this product include high bandwidth, low-latency shared L3 cache access, design modularity to support effici
ISSCC 2010
Session 9
Digital Circuits
POWER7TM Local Clocking and Clocked Storage Elements
Joshua Friedrich5, Victor Zyuban2, Ethan Cannon6, A.J. KleinOsowski6 1 IBM Systems and Technology Group, Yorktown Heights, NY IBM Research, Yorktown Heights, NY 3 IBM Systems and Technology Group, Boeblingen, Germany 4 I
ISSCC 2010
Session 9
Digital Circuits
Low-Skew Clock Distribution Using Zero-PhaseClock-Buffer DLLs
Carl Werner, Ken Chang Rambus, Los Altos, CA Clock distribution continues to be a challenging task in digital clocked systems. In a typical clocking architecture (Fig. 9.2.1, top), a phase-locked loop (PLL) produces the
ISSCC 2010
Session 9
Digital Circuits
A Precise-Tracking NBTI-Degradation Monitor Independent of NBTI Recovery Effect
Scaling has accelerated transistor degradation with respect to aging, especially for Negative Bias Temperature Instability (NBTI), which can cause more than a 10% degradation in delay [1]. It is known that in NBTI condit
ISSCC 2010
Session 9
Digital Circuits
Within-Die Variation-Aware Dynamic-VoltageFrequency Scaling Core Mapping and Thread Hopping for an 80-Core Processor
Tiju Jacob2, Keith Bowman1, Jason Howard1, James Tschanz1, Vasantha Erraguntla2, Nitin Borkar1, Vivek De1, Shekhar Borkar1 1 Intel, Hillsboro, OR Intel, Bangalore, India 2 Many-core processors with on-die network-on-chip
ISSCC 2010
Session 26
Digital Circuits
A 1GHz ADPLL with a 1.25ps Minimum-Resolution Sub-Exponent TDC in 0.18µm CMOS
since the minimum resolvable time quantity is proportional to one-inverter delay [1]. For fine time resolution, vernier delay chains are frequently used [2,3]. Since the time resolution is determined by the difference be
ISSCC 2010
Session 26
Digital Circuits
A 86MHz-to-12GHz Digital-Intensive PhaseModulated Fractional-N PLL Using a 15pJ/Shot 5ps TDC in 40nm digital CMOS
IMEC, Leuven, Belgium K.U. Leuven, Leuven, Belgium 2 Digital-intensive PLL architectures emerge [1]-[4], exploiting the benefits of CMOS scaling. This work presents a digital-intensive, reconfigurable 86MHz12GHz synthesi
ISSCC 2010
Session 26
Digital Circuits
A 1.4psrms-Period-Jitter TDC-Less Fractional-N Digital PLL with Digitally Controlled Ring Oscillator in 65nm CMOS
State of the art digital PLLs can be divided in two categories, depending on the implementation of the digital phase detector. Digital clocking and wireline applications mostly use a Bang-Bang detector (BBPLLs) [1-3], of
ISSCC 2010
Session 26
Digital Circuits
A 3MHz-BW 3.6GHz Digital Fractional-N PLL with
spurs [1], [2]. However, the actual results depend dramatically on the linearity of the time-to-digital converter (TDC). This paper presents a 3MHz bandwidth fractional-N synthesizer, which combines a 4ps TDC with digita
ISSCC 2010
Session 26
Digital Circuits
Spur-Reduction Techniques for PLLs Using SubSampling Phase Detection
University of Twente, Enschede, Netherlands National Semiconductor, Santa Clara, CA 2 In PLL designs, a wide loop bandwidth is often desired as it offers fast settling time, reduces on-chip loop filter area and sensitivi
ISSCC 2010
Session 26
Digital Circuits
A Calibration-Free 800MHz Fractional-N Digital PLL with Embedded TDC
Atheros Communications, Santa Clara, CA Digital Phase-Locked Loops (DPLLs), which are amenable to CMOS process scaling, have recently been demonstrated for both wireless and wireline applications as alternatives to conve
ISSCC 2010
Session 26
Digital Circuits
A 2.1-to-2.8GHz All-Digital Frequency Synthesizer with a Time-Windowed TDC
NEC, Kawasaki, Japan NEC Electronics, Kawasaki, Japan 2 All-digital phase-locked loops (ADPLLs) offer the advantages of eliminating the large on-chip passive filter and not suffering from poor low-supply-voltage operatio