The Inverter

The Inverter

CMOS inverter modeling.

Observations:
Fully restored (V _DD and GND) output levels results in high noise margins.
Ratioless : Logic levels are not dependent on the relative device sizes.
Low output impedance in steady state: increases robustness to noise.
High input impedance : fanout is theoretically unlimited.

The Inverter DC current characteristics

Inverter Models

It is possible to approximate the transient response to an RC model.

This model assumes the transistors switch instanteously.

Load capacitance, C _L , is due to diffusion, routing and downstream gates.

This "ideal" model predicts transient response is R _on C _L .

This indicates a fast gate is built by keeping either or both of R _on and C _L small.

Unfortunately, R _on in the actual device is nonlinear function of the voltage across it.

Inverter Threshold

Previously, we defined V _M as the inverter threshold voltage but did not derive an analytical expression for it.

V _M is defined as the point where V _in = V _out in the VTC of the inverter.

In this region, both the NMOS and PMOS transistors are in saturation.

Inverter Threshold

Therefore, the value of V _M can be obtained by equating the NMOS and PMOS currents.

V _M is situated in the middle of the available voltage swing (V _DD /2) when k _p = k _n (assuming the threshold voltages are similar).

This requires sizing given by:

From previous analysis, this means making the PMOS three times wider.

Inverter Threshold

Having V _M at V _DD /2 results in comparable low and high noise margins, which is desirable.

Observations from plot:
V _M is relatively insensitive to variations of k _p /k _n around the center point.

Small variations in the ratio (0.7 -> 1.5) do not disturb the VTC much.
Industry sets the ratio of PMOS width to NMOS width to 2 instead of 3.
Secondary effects, e.g. velocity saturation, also helps with this.

Inverter Threshold

Changing the ratio shifts the transient region:

Constructing an inverter with an asymmetrical VTC w.r.t. V _M is useful in filtering noisy input low or high signals.

Dynamic Behavior

Propagation delay is determined by the time it takes to charge/discharge the load cap, C _L .

Simple propagation delay models lumps all capacitances into C _L .

In this analysis, assume V _in is driven by an ideal voltage source with fixed rise/fall times.

Dynamic Behavior

Observations:
C _gd12 : Capacitance between the gate and drain of the first inverter.

M ₁ and M ₂ are either in cut-off or in saturation in steady-state.

It is reasonable to assume that only overlap capacitances contribute.

(Remember, gate cap is either completely between gate/bulk (cut-off) or gate/src (sat).

Here, in the lumped model, we will replace the C _gd12 with a capacitor to GND.

The value of this capacitor is given as C _gd = 2*C _GD0 *W where C _GD0 is overlap capacitance per unit width.

Note it is doubled due to the Miller effect .

Dynamic Behavior

Observations (cont):
C _db1 and C _db2 : Capacitances due to the reversed biased pn- junction.

These caps are quite nonlinear (voltage dependent).

We linearized these caps over the voltage range of interest:

with C _j0 the junction cap per unit area under zero bias conditions.

The bottom plate and sidewall zero bias values can be obtained from the SPICE model CJ and CJSW parameters.

Keq was derived in an earlier lecture.

Example

Consider a 1.2 m m 5V technology and the previous inverter chain.

Assume f₀ is 0.6V for both NMOS and PMOS and m = 0.5.

Let's compute C _db1 for the NMOS transistor.

Propagation delay is computed between the 50% points.

This is the time-instance when V _out reaches 2.5V.
For the high-to-low transition, we linearize over {5V, 2.5V} and for the low-to-high transition over {0, 2.5V}.

High-to-low : V _out is initially 5V: V _high = -5V. At 50%, V _low = -2.5V. K _eq = 0.375 .
Low-to-high : V _out is initially 0V: V _low = 0. At 50%, V _high = -2.5V. K _eq = 0.611 .

The same, but reversed, values are obtained for PMOS device.

Therefore, the junction capacitance can be replaced by a linear component with only minor effects on logic delays.

Dynamic Behavior

Observations (cont):
C _w : The capacitance is negligible (<1fF) and can be ignored, in this case.

C _g3 and C _g4 : We simplify by assuming all gate cap due to M3 and M4 is connected between V _out and GND (or VDD).

Overlap and gate capacitance clustered into C _g = C _ox WL.

But what about the Miller effect ?

We can safely ignore it here by assuming the driven gate's output does not change until after the 50% point of the input is reached.

We also assume, with minor errors, that the channel cap of the driven gate remains constant over this interval.

Text gives a good example of the capacitance calculated from the layout of a two-inverter chain, as shown above.

Loads given as 32.75fF for high-to-low and 32.6fF for low-to-high.

Propagation Delay: First-Order Analysis

Computed by integrating capacitor (dis)charge current:

But i(v) is a nonlinear function of v (the voltage across the cap).

An approximation can be obtained by replacing the time-varying charging current by a fixed current I _av .

This is the average of the currents at the end points of the voltage transition.

Calculation between the 50% points yield:

Low-to-high: v ₁ = V _OL and v ₂ = (V _OH +V _OL )/2.
High-to-low: v ₁ = V _OH and v ₂ = (V _OH +V _OL )/2.

Propagation Delay: First-Order Analysis

t _pLH and t _pHL is given by:

Consider t _pLH for an inverter.

Assume V _in changes abruptly from V _DD to 0, leaving the NMOS off and the PMOS in saturation while V _out < |V _Tp |, after which it is in linear mode.

Here, V _OH - V _OL = V _DD and, for I _av , we have the boundary cases:

Propagation Delay: First-Order Analysis

Solving for I _av gives:

Assuming the PMOS stays in saturation, simplifies things considerably and only adds a small error (5%-8% for VDD 3-5V).

Here, I _av is just the saturation expression.

With V _DD assumed >> |V _Tp |:

Propagation Delay: First-Order Analysis

Therefore, the k _n /= k _p , average propagation delay is:

Minimizing propagation delay amounts to:
Reducing C _L .

Which is composed of self-loading (diffusion) cap, routing cap and fan-out cap.

Increase k _p and k _n .

e.g. increase the W/L ratio of the transistors.
Warning: doing so increases the self-loading and fan-out factor, and therefore C _L !

Increase V _DD .

Not a design parameter. Also, trend is to reduce it to deal with electric field density and power consumption issues.

Propagation Delay

Text gives an analysis using a two-inverter sequence.

Several observations can be made from the analysis:
The p-transistor was made 3 times larger than the n-transistor.

For symmetrical high-to-low and low-to-high propagation delays.
This also triples the p-transistor gate and diffusion capacitances.

It is possible to speed-up the inverter by reducing the width of the PMOS device!

Propagation Delay

This increases t _pLH but reduces t _pHL .

Optimal width ratio of PMOS to NMOS can be shown to be:

This reduces NM _L a little, but it is usually acceptable.
NOTE: This holds true only if C _w is negligable!

Half of the load capacitance is due to the inverter itself ( intrinsic or self-loading ) and half is due to the fan-out ( extrinsic ) gate capacitance.

The extrinsic capacitance dominates the propagation delay for large fan-outs.
Propagation delay increases linearly with the fan-out N :

with t _p (0) and t _p (1) the propagation delay under 0 fan-out and a fan-out of 1 , respectively.

Second-Order Performance Issues

Finite rise/fall time of the input signal cause both devices to remain on:

Here, t _pHL increases approximately linearly with increasing rise-time values, t _r > t _pHL .

A high performance design challenge is to keep the signal rise times <= the gate propagation delay , for speed and power consumption.

Second-Order Performance Issues

Velocity Saturation

Saturation (dis)charge current proportional to V 2_DD previously assumed.
Velocity saturation makes I _av proportional to V _DD instead.

Second-Order Performance Issues

The inside curve illustrates the lack of a first-order dependence of t _p on V _DD .

For larger values of V _DD , e.g., V _DD > 4V _T , t _p is relatively constant.

For smaller values, e.g., V _DD < 2 V _T , a sharp increase in t _p is observable.

In this case, the simplification used to derive the first order approximation are no longer valid.

Source Resistance

We indicated previously that R _S and R _D are a more accurate model.

Second-Order Performance Issues

Two effects of R _S and R _D .
The V _GS of the discharge transistor is reduced (since V _S is > 0), hence lowering the current.
The threshold of the transistor is increased, since the source is no longer grounded.

The value of R _S ranges from 10 s of ohms to several k Ohms, depending on the manufacturing process and the device width.

Assuming a minimal size device, for 1.2 mm process, R _S /R _D equal 70 Ohms.

Here, saturation current is reduced by only 1.3% for V _DD = 5V.

For a process that uses a lightly doped drain ( LDD ) approach, R _S /R _D ~= 1-1.5k Ohms.

Here, saturation current is reduced by 20 % for V _DD = 5V!

Power Consumption

The almost ideal VTC of the CMOS inverter is not the main reason that high-complexity designs are implemented in static CMOS.

Rather, its the almost zero power consumption in steady-state mode.

The reversed-bias diode current is, in general, very small.

Typical values are 0.1 to 0.5nA at room temperature.
For a device at 5V with 1 million devices, power consumption is 0.5mW.

A more serious source is the subthreshold current.

The closer V _T is to zero, the larger the leakage with V _GS = 0V.
This establishes a firm lower bound on V _T , which is > 0.5V today.

Power Consumption

For both sources of leakage, the resulting static power dissipation is given by:

The junction leakage currents are caused by thermally generated carriers .

Their value increases exponentially with increasing junction temperature.

For example, 85 degrees C (a common junction temperature) results in an increase by a factor of 60 over room temperature.

Dynamic power is much larger than static power and can be broken into 2 parts.
Load capacitance , C _L , power.
Power consumed via direct path currents (crow-bar currents).

Power Consumption

C _L power (we derived this previously):

Charging C _L to V _DD draws C _L * V 2_DD energy from the power supply.
Half of this energy is stored on the cap (C _L *V 2_DD /2) and later dissipated through the NMOS device.

So, an energy = C _L * V 2_DD is consumed for every L->H and H->L transition.

Therefore, for a clock frequency of f ,

Technology advances decrease t _p and increase f and C _L (higher integration).

For example, at 30fF/gate at 100MHz and V _DD = 5V, 75 m W is dissipated per gate. With 200K gates and a = 20%, 3W are dissipated.
1W is consumed with 100 output pins at 20pF/pin and f = 20MHz.

One of the driving forces for lower supply voltages ( quadratic effect).

For example, 5V -> 3V drops 4W to 1.44W (assuming the same f).

Power Consumption

Direct-path currents.

Zero rise/fall times is not a realistic assumption.

Using triangles and V _DD >> |V _T |, the power consumed is

Avoid large values for t _f and t _r to minimize.

Direct-path power is typically only about 20% of the dynamic power.

Power Consumption

Total power is then:

The Power-Delay product was also defined previously.

It is the energy consumed by the gate per switching event.

We've defined a switching event to consist of a 0 -> 1 and a 1 -> 0 event.

This results in a PDP of

Under the condition that the static and direct-path currents are ignored.