# Fast Statistical Circuit Analysis with Finite-Point Based Transistor Model 

Min Chen Wei Zhao Frank Liu* Yu Cao<br>Arizona State University, Phoenix, USA, \{min.chen, wei.zhao, ycao\}@asu.edu<br>*IBM Research Laboratory, Austin, USA, frankliu@us.ibm.com


#### Abstract

A new approach of transistor modeling is developed for fast statistical circuit simulation in the presence of variations. For both I-V and C-V characteristics of a transistor, finite data points are identified by their physical meaning; the impact of process and design variations is embedded into these points as closed-form expressions. Then, the entire I-V and $\mathrm{C}-\mathrm{V}$ are extrapolated using polynomial formulas. This novel approach significantly enhances the simulation speed with sufficient accuracy. The model is implemented in Verilog-A at 65 nm node. Compared to simulations with the BSIM model, the computation time can be reduced by 7 x in transient analysis and 9 x in Monte-Carlo simulations.


## 1. Introduction

CMOS technology will arguably be the technology of choice into sub-45nm regime. Such extremely small devices inevitably suffer from severe variations due to the fabrication process and circuit operations [1-3]. Due to the statistical nature of these variations, current deterministic design flow needs to be shifted toward statistical circuit analysis and optimization. This avoids overly pessimistic design margins [4-6].

Conducting statistical circuit analysis demands highly efficient simulations. The problem of computation cost is further exacerbated as the total number of transistors in a chip rapidly increases into the gigascale regime. For instance, it usually takes more than weeks for a contemporary design to run full-chip SPICE simulations; the cost to run Monte-Carlo simulations will be inhibitive in this situation. Therefore, to enable a smooth transition to statistical design flow, it is critical to develop simulation techniques that are highly efficient in computation and keep the fidelity to parameter variations.

The speed of circuit-level simulations depends on both the complexity of the transistor model (e.g., BSIM), as well as the efficiency of circuit simulation tools (e.g., SPICE). In this work, our focus is to significantly simplify the transistor model, without sacrificing its accuracy and the sensitivity to variations. Traditional compact transistor models, such as BSIM or PSP, consist of a large number of parameters and complicated equations in order to capture many physical mechanisms for a short-channel


Figure 1. Application of new model in the design flow
device [7-8]. Such a high-level of complexity provides sufficient physicality of models, but dramatically slows down the simulation speed. At the other extreme end, lookup tables for I-V and C-V are purely based on the empirical behavior of a transistor [9]. It is much more efficient than the compact models, if the size of the table is small. However, the empirical nature of lookup tables limits their scalability to process variations. As the number of process corners increases, the size of lookup tables rapidly rises and thus, leads to higher simulation cost. In this context, a desirable approach should be to have the efficiency comparable to lookup tables, but keep the scalability of compact models.

To achieve this objective, a finite-point based transistor model is proposed in this paper. The key idea is to extract only a finite number of data points from I-V, based on their physical meanings and the importance in circuit operation. The dependence on process and design variations is embedded in these points through physical models or lookup tables. The entire behavior of I-V and CV is then extrapolated from these points with simple polynomial equations (Section 2). Such an approach combines the simplicity of lookup tables and physically maintains the dependence on variations. Fig. 1 illustrates the application of this model in statistical design flow to replace traditional compact models. Its accuracy is calibrated by compact models, while this approach is general for all types of compact models and circuit simulations. Section 3 describes the implementation of the model in Verilog-A and evaluates the convergence, accuracy, and speed as compared to BSIM4 at 65 nm node. Finally, the work is concluded in Section 4.

## 2. Finite-Point Based Transistor Model

The I-V and the C-V characteristics of a transistor determine its behavior in any circuit operation. They are modeled in Section 2.1 and 2.2, respectively. Their variability due to variations in the threshold voltage ( $V_{t h}$ ) and the effective channel length $\left(L_{e f f}\right)$ are calibrated by the variational model discussed in Section 2.3.

### 2.1 I-V modeling

The I-V characteristic of a transistor can be classified into three regions: saturation, linear, and subthreshold. Thus, there are three points that are the most important to discriminate these regions. We use $\mathrm{X}\left(V_{x, g}, V_{x, d,} I_{x}\right)$ to represent a data point $\mathrm{X} . V_{x, g}, V_{x, d}$ and $I_{x}$ are gate voltage, drain voltage and the drain source current of $X$, respectively. As shown in Fig. 2, these three points are:
$\mathrm{A}\left(V_{D D}, V_{D D}, I_{o n}\right)$ : the maximum current $I_{o n}$.
$\mathrm{B}\left(V_{D D}, \quad V_{\text {lin }}, \quad I_{\text {lin }}\right)$ : the separation between the saturation and linear region.
$\mathrm{C}\left(V_{\text {start }}, V_{D D}, I_{\text {start }}\right)$ : the boundary of the saturation and subthreshold region.

Given a specific technology, these points can be quantified either from compact transistor models or directly from the measurement. At 65 nm node, $V_{\text {lin }}$ of B is approximately half $V_{d d} ; V_{\text {start }}$ of C is around $V_{t h}+0.1$ to ensure the linearity in the saturation region.

These three points are essential to determine the entire I-V and separate different regions. From these three points, we can rely on the I-V characteristic of each region to extrapolate other points. Such extrapolations can be either physical or empirical, as long as they are sufficiently accurate.

In the saturation region, we can use a linear curve to model the dependence of $I_{d s}$ on gate voltage $V_{g s}$., while $I_{d s}$ is insensitive to $V_{d s}$. The parameter $\alpha$ in the alpha-power law is almost equal to one in the nanoscale regime [10].:

$$
\begin{equation*}
I_{d s} \propto \cdot W \cdot\left(V_{g s}-V_{t h}\right)^{\alpha}=W \cdot\left(V_{g s}-V_{t h}\right) \tag{1}
\end{equation*}
$$

This linear dependence is due to the strong velocity saturation of a short channel device, and is more


Figure 2. Critical points and model definition
pronounced in 90 nm and 65 nm nodes. In the linear region, the drive current has a quadratic dependence on $V_{d s}$. The choice of a quadratic function in the linear region further guarantees the first order continuity of the current from the linear to saturation region.

Besides these three primary points, two more points, $\mathrm{D}\left(V_{\text {start }}, V_{s s a}, I_{s s a}\right)$ and $\mathrm{E}\left(V_{\text {sub }}, V_{D D}, I_{\text {sub }}\right)$, are added to improve the model accuracy. As shown in Fig. 2, D improves the accuracy in the linear region, especially when $V_{g s}$ is low. E is in the subthreshold region and defines the leakage when $V_{g s}<V_{s u b}$. The leakage is further matched by fitting the subthreshold swing.

From the perspective of circuit analysis, the importance of these sampling points can be understood by studying the transistor switching trajectory [10], where the transistor is modeled as a current source driving a load capacitor. These points could be treated as input metrics in gate delay analysis instead of $I_{v d s a t}$ along to achieve better accuracy [11]. Fig. 3 shows the typical NMOS trajectories of an inverter and an NAND2 gate. In a fast input switching, the trajectory will start from point C , travel towards A and then fall to zero through B. In a slow input case, it will start from C, travel towards B then fall to zero. Therefore, by carefully matching of those critical points, we can preserve the accuracy of circuit analysis under various operation conditions. The worst case of this new model happens when there is extremely large fan-out and slow input. Fortunately, this situation is quite rare in an appropriately optimized digital design.

The closed-form piece-wise I-V model is developed based on the above definitions. The model format is summarized as equations (2)-(5):
Deep-subthreshold region:

$$
\begin{equation*}
I_{d s} \propto \exp \left(\frac{V_{g s}-V_{s u b}}{n v_{T}}\right) \cdot V_{d s} \tag{2}
\end{equation*}
$$

## Subthreshold region:

$$
\begin{equation*}
I_{d s} \propto \exp \left(\frac{\ln \left(I_{\text {start }}\right)-\ln \left(I_{\text {sub }}\right)}{V_{\text {start }}-V_{\text {sub }}}\left(V_{g s}-V_{\text {sub }}\right)\right) \cdot V_{d s} \tag{3}
\end{equation*}
$$

Linear region:
$I_{d s}=J \cdot \frac{I_{o n}-I_{\text {lin }}}{V_{D D}-V_{\text {lin }}} \cdot V_{d s}+K \cdot\left(V_{g s}-V_{\text {start }}\right) \cdot\left(\left(M \cdot\left(V_{\text {lin }}-V_{d s}\right)^{2}+I_{\text {lin }}\right)\right.$


Figure 3. The switching trajectory of Inverter and NAND2


Figure 4. I-V model matching evaluation for 65 nm node

Saturation region:

$$
\begin{equation*}
I_{d s}=J \cdot \frac{I_{o n}-I_{\text {lin }}}{V_{D D}-V_{\text {lin }}} \cdot V_{d s}+K \cdot\left(V_{g s}-V_{\text {start }}\right) \cdot I_{\text {lin }} \tag{5}
\end{equation*}
$$

where $n$ is subthreshold swing and $v_{t}$ is the thermal voltage. $\mathrm{J}, \mathrm{K}, \mathrm{M}$ are coefficients decided by the linear dependence of equation (1). The equations (4) and (5) have the same first term, which keeps the equation first order derivative continuous at the boundary. In turn, this guarantees the simulation convergence.

Fig. 4 shows the I-V matching of the model at the 65 nm node with zero body bias and 1V supply voltage, for both PMOS and NMOS devices. When there is non-zero body bias, the model captures its impact through the parameter of $V_{t h}$ at the critical points. More details will be discussed in section 3.3 to model this effect.

### 2.2 C-V modeling

The transistor capacitance model can be divided into two groups, the intrinsic and the extrinsic. The intrinsic capacitances are decided by channel charges and operating regions, which are functions of both geometry and voltages. The extrinsic capacitances are decided by the dimensional parameters of the transistor (i.e., area and periphery of source and drain). In this work, the capacitance model is developed based on the derivations of BSIM4 capmod=1 [12]. The key idea is to find the charge of each node and make sure of their first-order continuity at the boundaries with minimal computation overhead.

A considerable portion of $\mathrm{C}-\mathrm{V}$ equations is smoothing functions, used to enhance convergence at the boundaries of different regions. To lower the computational cost for the C-V model, we can simplify those smoothing functions to reduce computational costly terms such as square, exponential and logarithm. For example, one smoothing function used in BSIM is:
$z=x-0.5\left\{(x-y-\delta)+\sqrt{(x-y-\delta)^{2}+4 \delta x}\right\}$
which is used to connect
$z=\left\{\begin{array}{ll}x & x<y \\ y & x \geq y\end{array}\right\}$
It can be simplified as a piece-wise function:

$$
z=\left\{\begin{array}{cl}
x & x \leq y-\sigma  \tag{8}\\
y-(x-y-\sigma)^{2} /(4 \sigma) & y-\sigma<x<y+\sigma \\
y & x \geq y+\sigma
\end{array}\right\}
$$

where $\sigma$ is a fitting parameter similar to $\delta$.
Experiments show that around $10 \%$ speed improvement can be achieved for $\mathrm{C}-\mathrm{V}$ model once this second order polynomial smooth function is applied. We apply similar techniques all over the capacitance model to enhance the simulation efficiency.

### 2.3 Process variations

Process variations from $V_{t h}$ and $L_{\text {eff }}$ can dramatically change the drive current of a transistor. This effect severely impacts circuit performance such as delay and leakage. Their impact is incorporated at these critical points through analytical equations. Once the variability of these critical points is decided, the whole I-V can be extrapolated using the same method in section 2.1.

In saturation and linear regions, the whole I-V is scalable with the current at A and B. Therefore, $I_{o n}$ and $I_{l i n}$ variability models are important to determine other current points. In a nanoscale transistor, short channel effects, such as drain induced barrier lowering (DIBL), are critical. In particular, $V_{t h}$ exhibits a strong dependence on $L_{e f f}$ and $V_{d s}$ due to DIBL. Based on the physical derivation in [12], we simplify its model for DIBL as:
$V_{t h}\left(L_{e f f}\right)=V_{t h 0}-\alpha \cdot V_{d s} \cdot \frac{0.5}{\cosh \left(\beta \cdot L_{e f f}\right)-1}$
where $V_{t h 0}$ is the long channel $V_{t h}$ and $\alpha, \beta$ is the DIBL coefficient. Combined with the variation of $V_{\text {tho }}$ due to random doping, we have the close form equation:
$V_{t h}=V_{t h, n o \min a l}+\Delta V_{t h}\left(L_{e f f}\right)+\Delta V_{t h}\left(V_{t h 0}\right)$
in which $V_{t h, \text { nominal }}$ is the threshold voltage in nominal case. Furthermore, considering the velocity saturation ( $I_{o n}$ and $I_{l i n}$ both satisfies this condition), we express $I_{d s}$ as:
$I_{d s} \propto v_{s a t} \cdot W \cdot\left(V_{d s}-V_{t h}-V_{d s a t} / 2\right)$
where $v_{s a t}$ and $V_{d s a t}$ are the velocity and voltage at saturation, respectively. Equations (9)-(11) accurately predict the current variability and apply to both $I_{o n}$ and $I_{l i n}$.

Fig. 5 demonstrates the accuracy of this model using $I_{o n}$ of NMOS as an example in 65 nm node. In the presence of both $V_{t h}$ and $L_{\text {eff }}$ variations around $\pm 25 \%$, error is less than $2.5 \%$ for each of the tested corners.

The situations at point C and E are different. Taking C as an example, if the voltage $V_{\text {start }}$ is fixed and the current $I_{\text {start }}$ is changed, C may fall into the subthreshold region due to the reduction of current, which is not allowed. Therefore, it is more practical to fix the current


Figure $5 . I_{n n}$ variability of process variations
and make voltages scalable to the variation for these two points. Using equation (9) and (10), we derive $V_{\text {start }}$ and $V_{\text {sub }}$ as:
$V_{\text {start }}=V_{\text {satr, }, \text { onimalal }}+\Delta V_{\text {th }}$
$V_{\text {sthb }}=\frac{n_{\text {nominal }}}{n} \cdot\left(V_{\text {sul, ,nominal }}+\Delta V_{t h}\right)$
where subthreshold swing $n$ could be fixed to the nominal value for the simplicity reason.

For point D , we observe the dependence of $V_{s s a}$ on $L_{\text {eff }}$, where it can be modeled as a piece-wise linear function:
$V_{\text {sas }}=\left\{\begin{array}{ll}\delta 1 \cdot \Delta L_{e f f}+V_{\text {sasarominal }} & \Delta L_{e f f}>0 \\ \delta 2 \cdot \Delta L_{e f f}+V_{\text {ssar, onminal }} & \Delta L_{e f f} \leq 0\end{array}\right\}$
in which $\delta 1$ and $\delta 2$ are fitting parameters decided in the calibration.

Equations (11)-(14) propose a complete set of analytical variation models for the finite critical points. For the C-V model, because the node charges are function of $V_{t h}$ and $L_{e f f}$, the variability due to them has already been included. With the increasing number of process corner, the finite-point model requires much smaller sizes compared to that of the lookup tables. It also benefits from excellent model scalability.

## 3. Model Evaluation

The proposed finite-point based model is simple in mathematics, requires a much fewer number of parameters, and ease the extraction and simulation. The evaluation of its convergence, accuracy, sensitivity and speed is presented in this section, as compared to BSIM.

### 3.1 Experimental setup

The model is implemented and tested under various conditions by varying input slew $T_{r,}$, load capacitor $C_{\text {load }}$, fan-out $F O, V_{t h 0}$ and $L_{e f f}$ : Low to high switching delay $t_{L H}$, high to low switching delay $t_{H L}, 10 \%-90 \%$ output rising transition time $t_{\text {rise }}, 90 \%-10 \%$ output falling transition time $t_{\text {fall }}$, and oscillating frequency $f$ are set as evaluating


Figure 6. Waveform evaluation using 2-stage NAND2
objects. As shown in Fig. 1, the evaluation procedure is listed as following:

1) Implement the model using Verilog-A.
2) Extract five critical points with HSPICE using BSIM4 65nm PTM model card [13].
3) Calibrate the variation model for these points.
4) Simulate benchmark circuits such as inverter, NAND2, AOI and NAND chain oscillator in nominal and variational cases in SPICE.
5) Implement BSIM4 model using Verilog-A. Compare both accuracy and efficiency.

### 3.2 Nominal circuit analysis

The convergence and accuracy of our model can be evaluated through the switching waveform, delay, and output slew of benchmark circuits.

The switching waveforms are used to evaluate the overall quality of the model. Non-smooth waveform or even possible failure of the simulation may happen when there is any convergence problem. The new model exhibits good convergence during simulation due to the continuity of derivatives for both the I-V and C-V. As an example, the waveforms produced by a 2 -stage NAND2 are evaluated in Fig. 6, in which the model convergence can be evaluated with stack gates and gate capacitance. The switching waveforms coordinate in all the switching moments and capture the overshoot/undershoot smoothly without showing any convergence difficulty. An excellent waveform matching is achieved.

The input slew and fan-out play important roles during timing analysis. To ensure the generality needed in real design, the model needs to be accurate enough in a reasonable range for both the variables. Delay and output slews of a NAND2 gate are verified in Fig. 7 and Fig. 8, in which we tune the range of input slew and fan-out respectively. Fig. 7 shows good matching when input slew varies from 20 ps to $250 \mathrm{ps}(\mathrm{FO}=4)$. The error is less than $6 \%$. Excellent matching is achieved in Fig. 8 when FO varies from 1 to 9 and the input slew is 60 ps. The error is less than $5 \%$. These results confirm the model accuracy and generality.


Figure 7. Output slew and delay of NAND2 with various $T_{r}$.


Figure 8. Output slew and delay of NAND2 with various FO.

### 3.3 Variational circuit simulation

The model is developed with the ability to capture delay variability under process variations. In Fig. 9, sensitivity tests are performed in NAND2 with $\mathrm{FO}=3$. Only NMOS variations are included for the simulation simplicity. As the drive current through NMOS is significantly impacted, $t_{H L}$ suffers from a large amount of variability. Excellent matching is achieved compared with the BSIM simulation results, with error less than $6 \%$.

To further illustrate the sensitivity of the model, the frequency of a 9-stage $\mathrm{FO}=3$ NAND2 chain oscillator is tested under a various corners. We assume that all the NMOS are $100 \%$ correlated; all the PMOS transistors are strongly correlated, but totally independent on NMOS variations. The fast (F) corner is when both $V_{t h}$ and $L_{e f f}$ have $-30 \%$ variations; the slow corner (S) has $30 \%$ variations. Results are listed in Table 1. The overall error is less than $5 \%$ in this study. It demonstrates the model's capability to match all the extreme cases under process variations.

In statistical circuit simulation, the model is required to predict not only the nominal, but more importantly, the distribution of gate delay under statistical variations. We implement the model into Monte-Carlo simulation to get the probability density function (PDF) of delay. Based on


Figure 9. $T_{\text {HL }}$ of NAND2 under NMOS variations

Table 1. Comparison of frequency $f$ of 9 -stage NAND2 ring oscillator ( $\mathrm{FO}=3$ ).

| Variation Corners |  | Frequency (Ghz) |  |  |
| :---: | :---: | :---: | :---: | :--- |
| NMOS | PMOS | Model | HSPICE | Error |
| F | F | 5.38 | 5.52 | $2.5 \%$ |
| F | N | 4.12 | 4.31 | $4.4 \%$ |
| F | S | 3.15 | 3.15 | $0 \%$ |
| N | F | 3.85 | 3.98 | $0.7 \%$ |
| N | N | 2.89 | 2.98 | $2.7 \%$ |
| N | S | 2.27 | 2.21 | $2.7 \%$ |
| S | F | 2.82 | 2.84 | $0.7 \%$ |
| S | N | 2.11 | 2.08 | $1.4 \%$ |
| S | S | 1.66 | 1.59 | $4.4 \%$ |

the assumption that both NMOS and PMOS have independent $L_{\text {eff }}$ and $V_{t h}$ variations that follow the Gaussian distribution with $3 \sigma$ at $30 \%$ of mean and 30 mV respectively, 1000 times of SPICE simulations are performed for statistical analysis. Fig. 10 shows an example of the NAND2 gate delay distribution under various input slews. With larger input slew, the distribution spreads to a wider range. The model successfully captures this trend. A more complicated AOI gate is further implemented to check the delay distribution under two different switching conditions, i.e., the worst case and the best case. The worst case is when NAND2 switches with the lower gate and NOR2 switches with the upper gate; vise versa is the best case. As shown in Fig. 11, the worst case switching leads to a wider delay distribution, since the variations of both stack transistors affect the delay. Our model accurately captures this phenomenon with error of less than $6 \%$.

### 3.4 Simulation speed

In addition to sufficient accuracy, the newly developed model achieves significant speed-up from the model simplicity. Fewer parameters and lower order polynomial equations greatly improve the simulation efficiency. Speed tests are conducted in both nominal transient simulations and Monte-Carlo simulations using various benchmark circuits in SPICE. The CPU running


Figure 10. The distribution of delay variability of NAND2


Figure 11. The distribution of delay variability of AOI
time are compared between our model and the BSIM4 model. To make it generic, two step size modes, fixed step size and dynamic step size, are used in the tests. The result from the first mode clearly indicates the computation speed of the model, while the result from the second mode reflects the iteration condition of the simulator. Test results are listed in Tab. 2, which shows that our model is at least six times faster than BSIM4 in fixed step size mode (maximal step size $<1 \mathrm{ps}$ ) and even faster in dynamic st ep size mode due to less iterations. Monte-Carlo simulation results in Table 2 show a further speed-up. The speed of statistical simulations is nine times faster due to significant reduction in setup time, which is the result of

Table 2. Model simulation speed compared with BSIM4.

| Transient Simulation |  |  |  |  |  |  |
| :---: | :---: | :---: | :---: | :---: | :---: | :---: |
|  | Transient time (10ns) | BSIM4 |  | Finite Point |  | Speed |
|  |  | step | time(s) | step | time(s) |  |
| $\begin{array}{\|l\|} \hline 9 \text {-stage FO3 } \\ \text { NAND2 } \\ \text { oscillator } \\ \hline \end{array}$ | Max step=1ps | 10008 | 314 | 10073 | 48 | 1/6.5x |
|  | Dynamic step | 4310 | 156 | 2993 | 19 | 1/8.2x |
| 4-bit static adder | Max step=1ps | 10289 | 587 | 10190 | 95 | 1/6.2x |
|  | Dynamic step | 4232 | 293 | 3583 | 43 | 1/6.8x |
| Monte-Carlo Simulation (1000 times) |  |  |  |  |  |  |
|  | $\begin{aligned} & \text { Transient time } \\ & (1000 * 10 \mathrm{~ns}) \end{aligned}$ |  | $\begin{aligned} & \text { IM4 } \\ & 1 \text { time } \end{aligned}$ |  | Point lime | Speed |
| AOI | Dynamic step | 556 | 6 (s) |  | (s) | 1/9.6x |
| NAND2 oscillator | Dynamic step | 181 | 0 (m) |  | (m) | 1/9.3x |

much fewer parameters. This advantage of shorter setup time is especially important when the total number of transistors keeps increasing in a VLSI design. The speed test results along with other evaluation results indicate that our model improves model efficiency significantly without any tradeoff of accuracy.

## 4. Conclusion

The innovation of this work is to develop a transistor model based on five critical data points. The impact of process variations is further embedded through these points. Using simple equations to extrapolate I-V and C-V, the model achieves high speed with excellent accuracy in statistical circuit analysis. Experimental results prove the simulation efficiency over conventional compact transistor models. This model is compatible with SPICE simulators. It can be easily calibrated by various compact models, such as BSIM or PSP, or silicon measurement.

## Acknowledgment

The authors would like to thank N. Hakim at Intel, K. Singhal and M. Shahram at Synopsys for valuable discussions to this project. This work is partially sponsored by MARCO MSD and C2S2, and Intel.

## References

[1] P. A. Stolk, F. P. Widdershoven, and D. B. M. Klaassen, "Modeling statistical dopant fluctuations in MOS transistors," IEEE TED, vol. 45, no. 9, pp. 1960-1971, 1998.
[2] D. Boning and S. Nassif, "Models of process variations in device and interconnect," Design of High-Performance Microprocessor Circuit, Chapter 6, pp. 98-115, IEEE Press, 2000.
[3] M. Orshansky, K. Milor, P. Chen, K. Keutzer, and C. Hu, "Impact of spatial intrachip gate length variability on the performance of high-speed digital circuits," $T C A D$, vol. 21, pp. 544-553, 2002.
[4] M. Orshansky and K. Keutzer, "A general probabilistic framework for worst case timing analysis," DAC, pp. 556-561, Jun. 2002
[5] C. Viswesvariah, "Statistical timing of digital integrated circuits," ISSCC, 2004.
[6] Y. Cao and L. Clark, "Mapping statistical process variations toward circuit performance variability: An Analytical Modeling Approach," $D A C$, pp. 658-663, 2005.
[7] Y. Cheng, et al., "A physical and Scalable $I-V$ Model in BSIM3v3 for Analog /Digital Circuit Simulation," IEEE Trans. Electron Devices, vol. 44, no. 2, pp. 277-287, Feb. 1997.
[8] G. Gildenblat, et al., "PSP: an advanced surface-potential-based MOSFET model for circuit simulation," IEEE Trans. Electron Devices, vol. 53, no. 9, pp. 1979-1993, Sep. 2006,
[9] D. Nadezhin, et al., "SOI transistor model for fast transient simulation," ICCAD, pp 120-127, Nov. 2003.
[10] T. Sakurai and A. R. Newton, "Alpha-power law MOSFET model and its application to CMOS inverter delay and other formulas," JSSC, vol. 25, no. 2, pp. 584-594, Apr. 1990.
[11] M. H. Na, E.J. Nowak, W.Haensch, and J. Cai, "The effective drive current in CMOS Inverters," IEDM, pp. 121-124, 2004
[12] BSIM4 MOSFET Model - User's Manual, 2006.
[13] W. Zhao and Y. Cao, "New generation of predictive technology model for sub-45nm early design exploration," IEEE Trans. Electron Devices, vol. 53, no. 11, pp. 2816-2823, Nov. 2006.

