# **High-Performance Heat Sinking for VLSI**

D. B. TUCKERMAN AND R. F. W. PEASE

Abstract-The problem of achieving compact, high-performance forced liquid cooling of planar integrated circuits has been investigated. The convective heat-transfer coefficient h between the substrate and the coolant was found to be the primary impediment to achieving low thermal resistance. For laminar flow in confined channels, h scales inversely with channel width, making microscopic channels desirable. The coolant viscosity determines the minimum practical channel width. The use of high-aspect ratio channels to increase surface area will, to an extent, further reduce thermal resistance. Based on these considerations, a new, very compact, water-cooled integral heat sink for silicon integrated circuits has been designed and tested. At a power density of 790 W/cm<sup>2</sup>, a maximum substrate temperature rise of 71°C above the input water temperature was measured, in good agreement with theory. By allowing such high power densities, the heat sink may greatly enhance the feasibility of ultrahigh-speed VLSI circuits.

### INTRODUCTION

THE advent of systems employing high-speed, high-density, very-large-scale integrated (VLSI) circuits implies the requirement for effective and compact heat removal. For example, high-speed digital circuits employing submicron channel lengths yet dissipating 1 mW per gate have been reported recently [1]. A VLSI circuit containing  $10^5$  such gates would thus dissipate 100 W. Conventional IC packages typically have thermal resistances of  $50^{\circ}$ C/W and hence would be totally unsuitable for such circuits.

Although an isolated chip dissipating 100 W could be cooled by forced-air convection, an array of such chips (closely spaced to minimize propagation delays) presents a far more difficult cooling problem because of the large size ( $\geq 10$  cm on a side) of high-performance forced-air heat exchangers. Liquid cooling promises to be a more compact arrangement and its use has been reported recently for cooling the central processing unit of a large computing system [2]. In that system, heat is conducted through aluminum pistons spring-loaded onto the back of each chip, through cylinder walls surrounding the pistons, and into a heat exchanger. The thermal resistance of such a package allows a dissipation of about 3 or 4 W per chip. Even more compact configurations have been proposed by integrating the heat exchanger with the silicon chip [3].

There have been suggestions that physical limits of heattransfer technology will limit the power density of arrays of planar circuits to 20 W/cm<sup>2</sup> or so [4]. In this letter we show that by scaling liquid-cooled heat-exchanger technology to microscopic dimensions, circuit power densities of more than 1000 W/cm<sup>2</sup> should be feasible. To demonstrate these principles, we have constructed a very compact water-cooled heat sink which

Manuscript received March 10, 1981; revised March 31, 1981.

The authors are with Stanford Electronics Laboratories, Stanford, CA 94305.

is an integral part of the silicon substrate, which exhibits a maximum thermal resistance of  $0.09^{\circ}$  C/W over 1 cm<sup>2</sup> area, and which has been tested up to 790 W/cm<sup>2</sup>.

## THEORY

The performance of a heat sink is measured by its thermal resistance  $\theta = \Delta T/Q$ , where  $\Delta T$  is the temperature rise of the circuit above the input coolant temperature (often room temperature) and  $\dot{Q}$  is the dissipated power. In forced-convection cooling,  $\theta$  is nearly independent of power level. Because semiconductor ICs typically have maximum operating temperatures of  $\Delta T_{max} = 50^{\circ}$ C to  $100^{\circ}$ C above room temperature, thermal resistance determines the maximum power at which an IC can operate. In general  $\theta$  is the sum of three components:  $\theta_{cond}$ , due to conduction from the circuits through the substrate, package, and heat-sink interface;  $\theta_{conv}$ , due to convection from the heat sink to the coolant fluid, and  $\theta_{heat}$  due to heating of the fluid as it absorbs energy passing through the heat exchanger.

We can make  $\theta_{cond}$  very small by locating the heat exchanger (containing the flowing coolant) very near to the heat source. Fortunately silicon, the substrate used for most planar ICs, has a high thermal conductivity ( $k_{si} = 1.48 \text{ W/°C-cm}$  at 27°C for lightly-doped Si; about 1/3 of copper's thermal conductivity) [5]. If an IC substrate is thinned to 100  $\mu$ m and its back side is in intimate thermal contact with the heat exchanger, then  $\theta_{cond}$  is only 0.007°C/W for a 1-cm<sup>2</sup> circuit.

We can reduce  $\theta_{\text{heat}}$  by using a coolant of high volumetric heat capacity  $\rho C_p$  at a sufficiently high flow rate  $f(\theta_{\text{heat}} = 1/\rho C_p f)$ . Water is a particularly good choice ( $\rho C_p = 4.18 \text{ J/°C-} \text{cm}^3$ ), with a modest flow rate of 10 cm<sup>3</sup>/s contributing only  $0.024^{\circ}\text{C/W}$  to the thermal resistance.

Because  $\theta_{cond}$  and  $\theta_{heat}$  can be made very small by rather obvious means, we expect that convective thermal resistance,  $\theta_{conv}$  will be the dominant consideration in high-performance heat sink design. In fact a naive approach to liquid cooling in which water simply flows over the back of a circuit substrate can result in  $\theta_{conv}$  being orders of magnitude above the other thermal resistances. It is therefore necessary to examine some aspects of convective heat-transfer theory [6].

Consider a collection of *n* parallel channels each of length *L*, imbedded in a substrate of the same length *L* and width *W*. A coolant flows in each channel, absorbing a constant heat flow per unit length  $\dot{Q}/nL$  from it walls (the substrate). For example, these channels might be etched directly in the back of a silicon IC chip. The use of many separate channels, rather than a single coolant flow over the entire back substrate surface, allows us to multiply the substrate surface area by a factor  $\alpha$ . Specifically, we define  $\alpha =$  (total surface area of channel walls in contact with

0193-8576/81/0500-0126\$00.75 © 1981 IEEE

Experimental values of maximum thermal resistance  $\Theta_{max}$  for three integral water-cooled silicon heat sinks of specified channel with  $w_c$  and depth z, wall thickness  $w_w$ , water pressure P, and flow rate f. The heated area was approximately  $(1 \text{ cm}) \times (1 \text{ cm})$ , and the heat sinks were tested up to a specified maximum power density  $\dot{Q}$ .

| Expt | $w_c(\mu m)$ | <i>w</i> <sub>w</sub> (µm) | <i>z</i> (μm) | P(psi) | $f(\text{cm}^{3}/\text{s})$ | $\Theta_{\max}(^{\circ}C/W)$ | $\dot{Q}(W/cm^2)$ |
|------|--------------|----------------------------|---------------|--------|-----------------------------|------------------------------|-------------------|
| 1    | 56           | 44                         | 320           | 15     | 4.7                         | 0.110                        | 181               |
| 2    | 55           | 45                         | 287           | 17     | 6.5                         | 0.113                        | 277               |
| 3    | 50           | 50                         | 302           | 31     | 8.6                         | 0.090                        | 790               |

fluid)  $\div$  (area of circuit). At each cross-section along the length of the channel, we initially assume that the walls are infinitely thermally conductive so that the temperature is uniform around the perimeter. The convective heat-transfer coefficient *h* is then defined as  $h = \dot{Q} / nLp(T_w - T_f)$ , where  $T_w$  is the wall temperature,  $T_f$  is the mean fluid temperature, and *p* is the cross-sectional perimeter. Then  $\theta_{conv} = 1/hnLp = 1/h\alpha LW$ , so that for a given circuit area LW, we clearly want to make both *h* and  $\alpha$  large. Whereas it is well known that the use of extendedsurface (large- $\alpha$ ) structures such as fins will enhance heat transfer, the importance of making *h* large has received less attention.

It is customary to calculate h using dimensionless groups:

- $Nu = hD/k_f$ , the Nusselt number, a dimensionless heattransfer coefficient;
- $Pr = \mu C_p/k_f$ , the Prandtl number, a property of the fluid (Pr = 6.4 for water at 23°C);

 $\operatorname{Re} = vD\rho/\mu$ , the Reynolds number.

Here D is a "characteristic width" of the channel, defined as D = 4 (cross-sectional area) ÷ (perimeter p). For high-aspect ratio rectangular channels, D is equal to twice the channel width. The terms  $\mu$ ,  $k_f$ ,  $\rho$ ,  $C_p$ , and v denote respectively the viscosity, thermal conductivity, density, specific heat, and mean velocity of the coolant fluid. Noting that the channel width D is likely to be small because the channels must be very close to the circuits to minimize  $\theta_{cond}$  we tentatively assume laminar flow (a valid assumption when  $\text{Re} \leq 2100$ ). For calculating Nu, we further assume that the flow is "fully-developed," i.e. invariant along the channel length (a good assumption if  $\text{Pr} \geq 5$ , as is the case for most liquids). Then Nu is a monotonically decreasing function of  $x/(D \cdot \text{Re} \cdot \text{Pr})$ , where x is the distance from the entrance of the channel  $(0 \leq x \leq L)$ . Asymptotic formulas are:

Nu 
$$\propto \left(\frac{x}{D \cdot \text{Re} \cdot \text{Pr}}\right)^{-1/3}$$
 for  $x/(D \cdot \text{Re} \cdot \text{Pr}) \ll 0.02$ ;

 $Nu \simeq Nu_{\infty}$ , a constant, for  $x/D \cdot Re \cdot Pr \ge 0.02$  ("fullydeveloped temperature profile").

Not knowing *a priori* which region we are in, we conservatively assume that Nu has the minimum, asymptotic (large x) value  $Nu_{\infty}$ ; in any case the dependence of Nu on x is weak. The exact value of  $Nu_{\infty}$  depends on the shape of the channel cross section but is usually between 3 and 9.

Thus we approximate  $h=k_f \operatorname{Nu}_{\infty}/D$ , where  $\operatorname{Nu}_{\infty}$  is between 3 and 9. This result is consistent with an intuitive model for convection in which the heat is conducted through the fluid to the middle of the channel, where it is transported away by the flow. For a given coolant fluid, clearly the only way to significantly increase h is to reduce D. Achieving very high values of h therefore requires channels of microscopic width.

The only important lower limit on channel size is set by the coolant viscosity. For a given pump pressure, the volumetric flow rate decreases rapidly as D is reduced, resulting in an increase in  $\theta_{heat}$ . By assuming a practical limit on the available pressure, we can calculate an optimum channel size D which minimizes the sum of  $\theta_{conv}$  and  $\theta_{heat}$ . A more fundamental limit on channel size occurs when the pumping power becomes comparable to the circuit power dissipation (and hence viscous heating becomes significant), but this only occurs at impractically high pressures.

Increasing the channel aspect ratio (i.e., increasing  $\alpha$ ) can further reduce  $\theta_{conv}$ . However, we had assumed infinitelyconductive channel walls; for a substrate with finite thermal conductivity, there is little benefit in increasing  $\alpha$  beyond the point at which thermal resistance due to conduction along the length of the walls becomes comparable to convective thermal resistance.

## DESIGN

Figure 1 is a diagram of a high-performance IC heat sink which embodies the principles just discussed. The front surface of the substrate (length L, width W) contains a planar heat source (the circuits), and the back surface contains deep rectangular channels of width  $w_c$  and depth z which carry the coolant, separated by walls of width  $w_w$ . Neglecting the heat transferred at the top and bottom of the channels, the surfacearea multiplication factor due to the channels is  $\alpha = 2z/(w_c + w_w)$ . A cover plate is bonded to the back of the substrate to confine the coolant to the channels. We will neglect  $\theta_{cond}$  in our discussion, because it can be made very small independently of  $\theta_{conv}$  and  $\theta_{heat}$  by making the substrate only slightly thicker than the channel depth z.

Recall that  $\theta_{conv} = 1/h\alpha LW = D/k_f Nu_{\infty} \alpha LW$  for infinitely conductive walls. To account for a finite wall conductivity  $k_w$ (which implies a nonuniform temperature up the walls), we can multiply by a correction factor  $\eta^{-1}$ , where  $\eta$  is known as the "fin efficiency." Approximating D as  $2w_c$  for high-aspect ratio



Fig. 1. Schematic view of the compact heat sink incorporated into an integrated circuit chip. For a 1 cm<sup>2</sup> silicon IC using a water coolant, the optimum dimensions are approximately  $w_w = w_c 57 \ \mu m$  and  $z = 365 \ \mu m$ .

channels, we have

$$\theta_{\rm conv} = \frac{2}{k_f \, {\rm Nu}_\infty L W} \left( w_c \alpha^{-1} \eta^{-1} \right). \tag{1}$$

We can get an analytical approximation for  $\eta$  by assuming a constant heat-transfer coefficient h up the walls (a good assumption provided  $\eta$  is not too small) and modeling the heat flow in the walls as one-dimensional:

$$\eta = \frac{\tanh N}{N},$$
where

$$N = (2h/k_w w_w)^{1/2} z$$

$$= (Nu_{\infty} k_f / k_w)^{1/2} \frac{w_c + w_w}{2(w_c w_w)^{1/2}} \alpha.$$
(2)

 $\eta$  is thus a monotonically decreasing function of N, with  $\eta \simeq 1$  for  $N \ll$  and  $\eta \simeq N^{-1}$  for  $N \gg 1$ .

As discussed, there will probably be some maximum pressure P available to pump the coolant. The mean flow velocity v in our high-aspect ratio channels can then be calculated, assuming laminar flow between parallel plates:  $v = w_c^2 P/12\mu L$ . The total volume flow rate is easily seen to be  $f = \frac{1}{2}vWw_c\alpha$ , whence

$$\theta_{heat} = \frac{1}{\rho C_p f} = \frac{24\mu L}{\rho C_p P W} (w_c^{-3} \alpha^{-1}).$$
(3)

We seek an optimum choice of design variables  $w_w, w_c$ , and  $\alpha$  which minimizes the total thermal resistance  $\theta_{conv}(w_w, w_c, \alpha) + \theta_{heat}(w_c, \alpha)$ . Referring to equations 1 and 2, we see that for any  $w_c$  and  $\alpha$ , we can minimize thermal resistance by maximizing  $\eta$ , which means  $w_w = w_c$ .

Both  $\theta_{conv}$  and  $\theta_{heat}$  decrease monotonically with increasing  $\alpha$ , so there is no theoretical optimum value for  $\alpha$ . However, the fin efficiency  $\eta$  rolls off as  $\alpha^{-1}$  for large  $\alpha$ , hence  $\theta_{conv}$ 

#### IEEE ELECTRON DEVICE LETTERS, VOL. EDL-2, NO. 5, MAY 1981

asymptotically approaches a lower limit  $\theta_{\min} = w_c/k_{eff}LW$ , where  $k_{eff} = \sqrt{\frac{1}{4}Nu_{\infty}k_wk_f}$  can be viewed as an effective thermal conductivity for the heat sink assembly. This result is significant, for it indicates the highest performance (lowest  $\theta$ ) which we can expect to achieve with liquid cooling (within the framework of our model), given the channel width and substrate and coolant thermal conductivities. For a water-cooled silicon substrate with very high-aspect ratio channels,  $k_{eff}=0.13$  $W/^{\circ}$ C-cm. The maximum allowable circuit power density is  $(\dot{Q}/LW) = (\Delta T_{\max})k_{eff}/w_c$ , which confirms that microscopically narrow channels are the key to efficient heat removal: if  $w_c < 50$  $\mu$ m and  $\Delta T_{\max} = 50^{\circ}$ C, then over 1300  $W/\text{cm}^2$  can be dissipated!

For a practical design, we choose an aspect ratio  $\alpha_c = \sqrt{k_w/k_f N u_{\infty}}$ , for which N = 1,  $\eta = 0.76$ , and hence

$$\theta_{\rm conv} \bigg|_{\substack{w_w = w_c \\ \alpha = \alpha_c}} = 1.31 \theta_{\rm min} = \frac{1.31}{LWk_{\rm eff}} w_c.$$
(4)

Further increases in alpha would provide only small reductions in  $\theta_{conv}$  as it approaches  $\theta_{min}$ . Referring to equations 3 and 4 and setting  $\alpha = \alpha_c$ , we see that  $\theta_{heat}$  varies as  $w_c^{-3}$  and  $\theta_{conv}$ varies as  $w_c$ , hence an optimum channel width exists which minimizes their sum,  $\theta$ :

$$w_c = 2.29 \sqrt[4]{\mu k_f L^2 Nu_{\infty}} / \rho C_p P$$
,

for which

$$\theta = \frac{4}{3}\theta_{\rm conv} = \frac{8.01}{WL^{1/2}} \sqrt[4]{\mu/k_f k_w^2 \rho C_p P \operatorname{Nu}_{\infty}},$$

For a water-cooled silicon heat sink on a  $(1 \text{ cm}) \times (1 \text{ cm})$ substrate, a water pressure of P=30 psi $=2.07 \times 10^6$  dynes/cm<sup>2</sup>, our design procedure gives:

$$w_c = w_w = 57 \ \mu m;$$
  
 $\alpha_c = 6.4, \text{ so } z = 365 \ \mu m, \text{ which conveniently is a typical}$   
IC silicon wafer thickness!  
 $\theta = 0.086^{\circ}\text{C/W}$  at  $f = 11 \ \text{cm}^3/\text{s}.$ 

(We have used  $Nu_{\infty} = 6$ , which is about right for this aspect ratio). Note that  $L/(D \cdot \text{Re} \cdot \text{Pr}) = 0.018$  and Re = 730 for our design, so our assumptions were self-consistent (laminar flow and an almost fully-developed temperature profile).

#### Experiments

Using the preceding parameters as guidelines, we have fabricated and tested several high-performance heat sinks. In a series of experiments, 50- $\mu$ m wide channels with 50- $\mu$ m wide walls were etched vertically using KOH (an orientationdependent etch) [7] to a depth of about 300  $\mu$ m in  $\langle 110 \rangle$  silicon wafers of thickness 400 $\mu$ m. A Pyrex cover plate was anodically bonded [9] over the channels and over a pair of etched manifolds at the ends of the channel array. Deionized water at approximately 23°C was fed into the input manifold through a hole in the cover plate at pressures up to 31 psi, and drained from the output manifold through a similar hole. Heat was supplied by a thin-film WSi<sub>2</sub> resistor approximately (1 cm) × (1 cm) in area and 1  $\mu$ m thick, which was sputtered onto the



Fig. 2. Measured values of maximum (downstream) thermal resistance  $\Theta$  as a function of inverse flow rate 1/f for heat sink no. 3 of Table 1. As predicted, the data fall on a straight line, implying fully-developed temperature profiles.

thermally-oxidized front surface of the wafer. Thermocouples monitored the temperature of the input and output water and the heater resistor (the latter was measured near the downstream end, where the temperature is highest). We confirmed that the flow rate obeyed Poiseuille's equation and that the thermal resistance was independent of power level. Table 1 summarizes the results obtained for three different heat sinks having similar parameters; all had maximum (downstream) thermal resistances of about  $0.1^{\circ}C/W$  for a 1 cm<sup>2</sup> area, as expected. One device was tested to 790 W/cm<sup>2</sup>.

A further confirmation of the theory was obtained by examining the dependence of the maximum thermal resistance on water flow rate  $f(\text{cm}^3/\text{s})$ .  $\theta_{\text{conv}}$ , the thermal resistance due to conduction from the front of the wafer to the channel region, is clearly independent of f. The same is true for  $\theta_{\text{conv}}$ , provided we have the predicted fully-developed temperature profile.  $\theta_{\text{heat}}$ will be inversely proportional to the flow rate. Thus a plot of  $\theta = \theta_{\text{cond}} + \theta_{\text{conv}} + \theta_{\text{heat}}$  vs.  $f^{-1}$  should yield a straight line, and experimentally this was indeed the case (Fig.2).

Although a uniform thin-film resistor was used as a heat source in our experiments, in an actual IC the heat is generated in localized areas such as p-n junctions. This will result in an extra contribution to  $\theta$  due to thermal spreading resistance.

This term would be exceedingly small ( $\ll 0.01^{\circ}C/W$ ) in a VLSI circuit consisting of thousands of uniformly-distributed devices, but it may be important in specialized ICs consisting of only a few localized high-power heat sources.

The dramatic (forty-fold) improvement in practical, compact IC heat-sinking capability presented here offers a new degree of freedom for the system designer. For example, speed-power tradeoffs can now be resolved in favor of more speed, and in particular ECL circuitry may now be a more attractive candidate for high-speed VLSI. The low thermal resistance may also be useful for moderate-power ICs where the temperatures of different components must match closely or be held close to the coolant temperature. The incorporation of this very compact integral heat sink into a conventional IC package is relatively straight forward.

## ACKNOWLEDGMENTS

We would like to thank K. Bean of Texas Instruments, Inc. and P. Barth, J. Beaudouin, W. Kays, J. Plummer, K. Saraswat, J. Shott, and R. Swanson of Stanford University for their help. One of us (D.B.T.) was supported by the Fannie and John Hertz Foundation. This work was partially supported by the Joint Services Electronics Program.

#### References

- R. K. Watts, W. Fichtner, E. N. Fuls, L. R. Thibault, and R. L. Johnston, "Electron Beam Lithography for Small MOSFETs," *IEDM Technical Digest*, pp. 772-775, 1980.
- [2] "Logic Packaging in the IBM 3081," Electronic News, p. 47, vol.17 Nov. 1980.
- [3] W. Anacker, "Liquid Cooling of Integrated Circuit Chips," *IBM Tech. Disclosure Bulletin*, vol. 20, pp. 3742–3743, 1978.
  [4] R. W. Keyes, "Physical Limits in Digital Electronics," *Proc.*
- [4] R. W. Keyes, "Physical Limits in Digital Electronics," Proc. IEEE, vol. 63, pp. 740-767, May 1975; "Fundamental Limits in Digital Information Processing," Proc. IEEE, vol. 69, pp. 267– 278, Feb. 1981.
- [5] C. Y. Ho, R. W. Powell, and P. E. Liley, J. Phys. Chem. Ref. Data, vol. 3, Suppl. 1, I-588, 1974.
- [6] W. M. Kays and M. E. Crawford, Convective Heat and Mass Transfer. New York: McGraw-Hill, 1980, ch. 8.
- [7] W. M. Kays and A. L. London, Compact Heat Exchangers, 2nd ed. New York: McGraw-Hill, 1964, p. 14.
- [8] K. Bean, "Anisotropic Etching of Silicon," IEEE Trans. Electron Devices, vol. ED-25, no. 10, pp. 1185-1193, Oct. 1978.
- [9] G. Wallis and D. I. Pomerantz, "Field-Assisted Glass-Metal Sealing," J. Appl. Phys., vol. 40, no. 10, Oct. 1969, pp. 3946– 3949.