Page 1 C opy right by Zent ru m fü r M ikr oe lekt ron ik Aa rgau , C H -5 210 Win di sch .
High-Speed Electronics
Mentor User Conference 2005 - München
Dr. Alex Huber, [email protected]
Zentrum für Mikroelektronik Aargau, 5210 Windisch, Switzerland
C opy right by Zent ru m fü r M ikr oe lekt ron ik Aa rgau , C H -5 210 Win di sch . Outline 1. Motivation
2. Speed Limitation of CMOS logic
3. Current Mode Logic: Advantages and Features 4. Current Mode Logic Gates
5. Example Application: Clock-Data Recovery 6. Conclusion
Page 3 C opy right by Zent ru m fü r M ikr oe lekt ron ik Aa rgau , C H -5 210 Win di sch . 1. Motivation
2. Speed Limitation of CMOS logic
3. Current Mode Logic: Advantages and Features 4. Current Mode Logic Gates
5. Example Application: Clock-Data Recovery 6. Conclusion
C opy right by Zent ru m fü r M ikr oe lekt ron ik Aa rgau , C H -5 210 Win di sch . Applications
• Serial Data Communication (PCI-Express, Serial-ATA, etc.) • Fiber-Optical Communication (e.g. 10G Ethernet)
• Flash-ADC, high-speed Σ∆-ADC
• Ultra-Wideband (UWB) = “wireless digital interconnect” • Fractional-N PLL (Prescaler)
Page 5 C opy right by Zent ru m fü r M ikr oe lekt ron ik Aa rgau , C H -5 210 Win di sch .
Parallel I/O Bottleneck
•CPU •Graphics-P. •Display •Harddisk •Memory 64bit, 3GHz D0 D1 Dn CLK D0 D1 Dn CLK fCLK Ccoup⇒ X-Talk • Clock/Data Skew • Clock Jitter • Data Jitter •CPU •Graphics-P. •Display •Harddisk •Memory fCLK < 166…500 MHz @ l = 1 m…10 cm, 16…64bit 64bit, 3GHz
C opy right by Zent ru m fü r M ikr oe lekt ron ik Aa rgau , C H -5 210 Win di sch .
Serial Data Communication
D0 D1 Dn CLK fCLK fser MUX DEMUX D0 D1 Dn CLK fCLK CR DEC fser 2/n D0 fser = n/2 · fCLK D1D2 D3 Dn fCLK
High-Speed Digital Electronics required! CLK g
enera
ted at Rx
fCLK = 2 GHz, n = 16 bit fSER = 8 · 2 GHz = 16 GHz (32 Gb/s)
Clock not transmitted: Unlimited number of parallel links
Page 7 C opy right by Zent ru m fü r M ikr oe lekt ron ik Aa rgau , C H -5 210 Win di sch .
I/O Technology Trend
Optical PCI Express Serial-ATA IEEE 1394b PCI Bus ISA Bus
f
SER = signaling clock rateYear 66MHz 1GHz 5GHz 10GHz 20GHz 1980 1990 2000
12GHz Copper Signaling Limit (attenuation)
1GHz Parallel Bus Limit (cross-talk)
MCAEISA VESA PCI-X AGP 2x Parallel Bus Architectures
Serial Bus Architectures
C opy right by Zent ru m fü r M ikr oe lekt ron ik Aa rgau , C H -5 210 Win di sch . Technologies
• Earlier: Dominated by bipolar processes (GaAs, Si, SiGe, InP)
• Nowadays: CMOS at 130nm, 90nm, (65nm)
Page 9 C opy right by Zent ru m fü r M ikr oe lekt ron ik Aa rgau , C H -5 210 Win di sch . 1. Motivation
2. Speed Limitation of CMOS logic
3. Current Mode Logic: Advantages and Features 4. Current Mode Logic Gates
5. Example Application: Clock-Data Recovery 6. Conclusion
C opy right by Zent ru m fü r M ikr oe lekt ron ik Aa rgau , C H -5 210 Win di sch .
Repetion: Some Basics of MOSFET
Transconductance
(
)
0 2 2 > − ≥ − = GS T DS GS T n,p ox D V U V V U L W C I µ for vGS gmvGS iD G S S D CGS CDSDC Drain Current Threshold Voltage
Design Variables Wn L ID VGS VDS G D S VDS Wp L ID VGS S G D 2 2 GS T n,p ox GS D m U V L W C V I g = ⋅ − ∂ ∂ = µ
Page 11 C opy right by Zent ru m fü r M ikr oe lekt ron ik Aa rgau , C H -5 210 Win di sch .
Speed Limitation of CMOS Gates
VinCgs,n Cgs,p Vgs,n Vgs,p id,p=gm,p*Vgs,p id,n=gm,n*Vgs,n Cds,p Cds,n Vout Vin V out Wp L Wn L VDD
• With ideal voltage drive: td = ∆Vout·Cds,p/id,n
• For Vp=VDD/2: Wp ≈ 2…3 Wn (due to µn ≈ 2...3µp) • To reduce delay td: Cds,p L ; Wp Wn id,n L ; gm,n Wn Given technology: No improvement possible! Technology scaling: L=1µm (1990) L=90nm (2004)
C opy right by Zent ru m fü r M ikr oe lekt ron ik Aa rgau , C H -5 210 Win di sch .
Reduction of Delay Time
Basic idea: Decouple charging current and switch transistor
Vin Vout Load Wn L VDD icharge
Common Mode Voltage Vcm?
Load: PMOS, resistor, inductor, …?
• Icharge ; Wn td
• Limits: Wn large enough for a) V-Gain = gm,n·RLoad > 1
Page 13 C opy right by Zent ru m fü r M ikr oe lekt ron ik Aa rgau , C H -5 210 Win di sch .
Current Mode Logic (CML)
Advantages for high-speed operation:
1. Independent choice of Wn and i0
2. Differential mode: higher SNR 3. Free choice of load:
PMOS, Resistor, Inductor
4. With Inductor: BWind ≈ 2·BWres
⇒ fCLK,CML ≈ (2..3)·fCLK,CMOS @ same L
Disadvantages:
1. Static DC power consumption VDD·i0
2. Area consumption:
2 {PMOS | Res | Ind}, 3 NMOS
Vin Vout i0 Load Wn L VDD Load Wn L VDD Vcm Differential topology: independent of Vcm
C opy right by Zent ru m fü r M ikr oe lekt ron ik Aa rgau , C H -5 210 Win di sch . 1. Motivation
2. Speed Limitation of CMOS logic
3. Current Mode Logic: Advantages and Features
4. Current Mode Logic Gates
5. Example Application: Clock-Data Recovery 6. Conclusion
Page 15 C opy right by Zent ru m fü r M ikr oe lekt ron ik Aa rgau , C H -5 210 Win di sch .
CML vs. CMOS: Figures of Merit
Delay td CMOS CML I C V N RC N ⋅ = ∆ ⋅ α µ( DD t) ox DD U V C V C N − ⋅ ⋅
Logic swing Load Capacitance
~∆V Power P N ⋅I ⋅VDD ( 1 ) , 2 d CLK CMOS d DD t f t C V N = ⋅ ⋅ Power-Delay Product
[
]
C V V N I C V N V I N t P DD DD d ⋅ ∆ = ⎥⎦ ⎤ ⎢⎣ ⎡ ∆ ⋅ ⋅ ⋅ ⋅ = ⋅ 2 C V N t t C V N DD CMOS d CMOS d DD ⋅ ⋅ = ⋅ ⋅ ⋅ 2 , , 2 Energy-Delay Product[
]
I C V V N I C V N C V V N t P DD DD d 2 2 3 2 2 ) (∆ = ⎥⎦ ⎤ ⎢⎣ ⎡ ∆ ⋅ ⋅ ⋅ ∆ = ⋅ α µ( ) 2 3 2 , 2 t DD ox DD CMOS d DD U V C C V N t C V N − = ⋅ ⋅ ⋅ Logic depth ~IC opy right by Zent ru m fü r M ikr oe lekt ron ik Aa rgau , C H -5 210 Win di sch . CML vs. CMOS: Trade-Off Energy-Delay-Product EDP: α µ( ) 2 3 2 t DD ox DD CMOS U V C C V N EDP − = I C V V N EDPCML DD 2 2 3 (∆ ) = VDD [V] 0.5 1 1.5 2 100 200 300 400 Delay CMOS td [ps] I[µA] 1 10 100 100 200 300 400 Delay CML td [ps] 1000 1 1.5 2 2.5 EDP [pJ·ps] Energy-Delay (N=4) Advantage for CML: low logic depth, very high speed
Example: • 0.25µm technology • VDD=2.5V • PMOS load No low bound on CML delay with current!
Page 17 C opy right by Zent ru m fü r M ikr oe lekt ron ik Aa rgau , C H -5 210 Win di sch . 1. Motivation
2. Speed Limitation of CMOS logic
3. Current Mode Logic: Advantages and Features
4. Current Mode Logic Gates
5. Example Application: Clock-Data Recovery 6. Conclusion
C opy right by Zent ru m fü r M ikr oe lekt ron ik Aa rgau , C H -5 210 Win di sch . Inverter Logic Level: Vh = I0·RC Gate Delay td VDD
Page 19 C opy right by Zent ru m fü r M ikr oe lekt ron ik Aa rgau , C H -5 210 Win di sch . Multiplexer VDD VDD
C opy right by Zent ru m fü r M ikr oe lekt ron ik Aa rgau , C H -5 210 Win di sch . EXOR VA=1, VB=1: VQ=-1~0 VA=-1, VB=1: VQ=1 VA=1, VB=-1: VQ=1 VA=-1, VB=-1: VQ=-1~0 VDD VDD VDD VDD
Page 21 C opy right by Zent ru m fü r M ikr oe lekt ron ik Aa rgau , C H -5 210 Win di sch . D-Latch VDD VDD
C opy right by Zent ru m fü r M ikr oe lekt ron ik Aa rgau , C H -5 210 Win di sch . Master-Slave D-FlipFlop Negative edge triggered FF with respect to CLK
Page 23 C opy right by Zent ru m fü r M ikr oe lekt ron ik Aa rgau , C H -5 210 Win di sch .
Design Techniques for CML
• CMOS design kits typically don‘t provide CML standard cells
(some Bipolar/BiCMOS kits might, but typically only in very mature = “old” = “slow” technologies and only symbol/layout)
• System design can be done on cell-level
• Cell design is done with analog (= transistor-level) techniques
• Simulation must reflect the analog nature of CML gates (= dependence of delay/transition-time on input/output impedance)
C opy right by Zent ru m fü r M ikr oe lekt ron ik Aa rgau , C H -5 210 Win di sch . 1. Motivation
2. Speed Limitation of CMOS logic
3. Current Mode Logic: Advantages and Features 4. Current Mode Logic Gates
5. Example: Clock-Data Recovery for Serial Data Communication
Page 25 C opy right by Zent ru m fü r M ikr oe lekt ron ik Aa rgau , C H -5 210 Win di sch .
Clock-Data Recovery (I)
DEMUX D0 D1 Dn CR DEC fser 2/n CLK Din PFD LF VCO DFF Din DEMUX Full-Rate
Architecture external?Passive,
PLL Analog
DFF (CML) and VCO operate at fser
(e.g. 40GHz at 40Gb/s)
Γlow” complexity
C opy right by Zent ru m fü r M ikr oe lekt ron ik Aa rgau , C H -5 210 Win di sch .
Clock-Data Recovery (II)
DEMUX D0 D1 Dn CR DEC fser 2/n CLK Din VCO Din DFF DFF DFF Edge Detector Loop Filter D0 D1 Dn 2n DFF 2n VCO Phases Reference Frequency n-th Rate Architecture Digital (CML) Analog (?) • Clock frequency at DFF: fCLK, full-rate/n • 2n phases required (∆t = 1/(2fCLK, full-rate))
Page 27 C opy right by Zent ru m fü r M ikr oe lekt ron ik Aa rgau , C H -5 210 Win di sch . Edge Detection Din DFF DFF DFF Edge Detector 2n DFF
2n Phase Clk to Loop Filter
d0 e0 en CLK locked: CLK in-phase with DATA D0 D1 D2 D3 d0 d1 e0 e1 dn≠ dn+1 <en> = 0.5 DATA CLK CLK D0 D1 D2 D3 CLK is early w/ respect to DATA d0 d1 e0 e1 dn = en DATA CLK
C opy right by Zent ru m fü r M ikr oe lekt ron ik Aa rgau , C H -5 210 Win di sch .
Generation of Variable Clock Phase
V
ctrlVCO: cos([ω0+Kvco·Vctrl]·t) = cos(ω0t + ∆φ)
∆φ = Kvco·Vctrl·t = ∫Kvco·Vctrldt
Phase Rotator:
n-phase VCO φ-rot cos(ω0t) φ1
···
∆ϕ = δϕ·n·T no Integration! cos(ω0t +∆φ) δϕ UP DN φnI
I
Q
Q
= ϕ0 = ϕ1 = ϕ2 = ϕ3Page 29 C opy right by Zent ru m fü r M ikr oe lekt ron ik Aa rgau , C H -5 210 Win di sch .
Clock-Data Recovery (III): HIGHSCORE 16 16 Edge- Detec-tor D E Early Late 16 16 8 parallel Sampler Rate Reduction Early Late 1 1 D D D D E E E E 4 0°, 45°, ... , 315° 8-phase DLL up
dn up/downcounter Digital LoopFilter
0°, 45°, ... , 315° Ref-VCO (8 phase) Phase-Rotator 40 Gb/s optical input 4x 10Gb/s electrical output 10GHz 2.5GHz 1.25GHz 1.25GHz 1x/Link 1x for 8 Links
CML (“analog”) CMOS (“digital”)
8:32 DEMUX
“High-Speed Communication Receiver for 40 Gb/s in CMOS”: CTI Project of zma, BFH Burgdorf, ETHZ, IBM ZRL
C opy right by Zent ru m fü r M ikr oe lekt ron ik Aa rgau , C H -5 210 Win di sch .
HIGHSCORE Key Features
• Goal: Serial Communication Receiver (=CDR) at 40 Gb/s • 90 nm CMOS state-of-the-art (IBM)
• FO4-Delay of CMOS: 20 ps
• Typical clock frequencies of CMOS logic: 1.25 GHz
• CML operates at fCLK = 10 GHz, partially inductor-peaked • Quarter-Rate Architecture
• No external passive components (except 1 XTAL) • Fully digital loop-filter (complex functions possible!)
• Simulation-friendly: VHDL/Verilog representation for system characterization possible
Page 31 C opy right by Zent ru m fü r M ikr oe lekt ron ik Aa rgau , C H -5 210 Win di sch . Conclusion
• CMOS logic maximum speed can only be scaled by technology improvements (reduce Gate-Length)
• CML logic offers 2-3 times higher clock frequencies with
improved Energy-Delay product compared to CMOS at same frequencies and same Gate-Length
• CML gates are designed and simulated with analog
techniques while the system function is digital by nature • A main application of very high speed digital electronics is
Serial Data Communication for highest I/O Bandwidth to overcome the parallel I/O Bottleneck
• Highly parallel architectures allow fully digital designs with complex functions in CMOS replacing bulky passive