Engineering:Physics of failure

From HandWiki

Physics of failure is a technique under the practice of reliability design that leverages the knowledge and understanding of the processes and mechanisms that induce failure to predict reliability and improve product performance. Other definitions of Physics of Failure include:

  • A science-based approach to reliability that uses modeling and simulation to design-in reliability. It helps to understand system performance and reduce decision risk during design and after the equipment is fielded. This approach models the root causes of failure such as fatigue, fracture, wear, and corrosion.
  • An approach to the design and development of reliable product to prevent failure, based on the knowledge of root cause failure mechanisms. The Physics of Failure (PoF) concept is based on the understanding of the relationships between requirements and the physical characteristics of the product and their variation in the manufacturing processes, and the reaction of product elements and materials to loads (stressors) and interaction under loads and their influence on the fitness for use with respect to the use conditions and time.[1]

Overview

The concept of Physics of Failure, also known as Reliability Physics, involves the use of degradation algorithms that describe how physical, chemical, mechanical, thermal, or electrical mechanisms evolve over time and eventually induce failure. While the concept of Physics of Failure is common in many structural fields,[2] the specific branding evolved from an attempt to better predict the reliability of early generation electronic parts and systems.

The beginning

Within the electronics industry, the major driver for the implementation of Physics of Failure was the poor performance of military weapon systems during World War II.[3] During the subsequent decade, the United States Department of Defense funded an extensive amount of effort to especially improve the reliability of electronics,[4] with the initial efforts focused on after-the-fact or statistical methodology.[5] Unfortunately, the rapid evolution of electronics, with new designs, new materials, and new manufacturing processes, tended to quickly negate approaches and predictions derived from older technology. In addition, the statistical approach tended to lead to expensive and time-consuming testing. The need for different approaches led to the birth of Physics of Failure at the Rome Air Development Center (RADC).[6] Under the auspices of the RADC, the first Physics of Failure in Electronics Symposium was held in September 1962.[7] The goal of the program was to relate the fundamental physical and chemical behavior of materials to reliability parameters.[8]

Early history – integrated circuits

The initial focus of physics of failure techniques tended to be limited to degradation mechanisms in integrated circuits. This was primarily because the rapid evolution of the technology created a need to capture and predict performance several generations ahead of existing product.

One of the first major successes under predictive physics of failure was a formula[9] developed by James Black of Motorola to describe the behavior of electromigration. Electromigration occurs when collisions of electrons cause metal atoms in a conductor to dislodge and move downstream of current flow (proportional to current density). Black used this knowledge, in combination with experimental findings, to describe the failure rate due to electromigration as

[math]\displaystyle{ MTTF=A(J^{-n})e^{\frac{Ea}{kT}} }[/math]

where A is a constant based on the cross-sectional area of the interconnect, J is the current density, Ea is the activation energy (e.g. 0.7 eV for grain boundary diffusion in aluminum), k is the Boltzmann constant, T is the temperature and n is a scaling factor (usually set to 2 according to Black).

Physics of failure is typically designed to predict wearout, or an increasing failure rate, but this initial success by Black focused on predicting behavior during operational life, or a constant failure rate. This is because electromigration in traces can be designed out by following design rules, while electromigration at vias are primarily interfacial effects, which tend to be defect or process-driven.

Leveraging this success, additional physics-of-failure based algorithms have been derived for the three other major degradation mechanisms (time dependent dielectric breakdown [TDDB], hot carrier injection [HCI], and negative bias temperature instability [NBTI]) in modern integrated circuits (equations shown below). More recent work has attempted to aggregate these discrete algorithms into a system-level prediction.[10]

TDDB: τ = τo(T) exp[ G(T)/ εox][11] where τo(T) = 5.4*10-7 exp(-Ea / kT), G(T) = 120 + 5.8/kT, and εox is the permittivity.

HCI: λHCI = A3 exp(-β/VD)exp(-Ea / kT) [12] where λHCI is the failure rate of HCI, A3 is an empirical fitting parameter, β is an empirical fitting parameter, VD is the drain voltage, Ea is the activation energy of HCI, typically −0.2 to −0.1eV, k is Boltzmann's constant, and T is temperature in Kelvin.

NBTI: λ = A εoxm VTμp exp( -Ea / kT)[13] where A is determined empirically by normalizing the above equation, m = 2.9, VT is the thermal voltage, μP is the surface mobility constant, Ea is the activation energy of NBTI, k i s Boltzmann's constant, and T is the temperature in Kelvin

Next stage – Electronic packaging

The resources and successes with integrated circuits, and a review of some of the drivers of field failures, subsequently motivated the reliability physics community to initiate physics of failure investigations into package-level degradation mechanisms. An extensive amount of work was performed to develop algorithms that could accurately predict the reliability of interconnects. Specific interconnects of interest resided at 1st level (wire bonds, solder bumps, die attach), 2nd level (solder joints), and 3rd level (plated through holes).

Just as integrated circuit community had four major successes with physics of failure at the die-level, the component packaging community had four major successes arise from their work in the 1970s and 1980s. These were

Peck:[14] Predicts time to failure of wire bond / bond pad connections when exposed to elevated temperature / humidity

[math]\displaystyle{ TTF=A_0 (RH)^{-2.7} f(V) \exp\left(\frac{E_a}{K_B T}\right) }[/math]

where A is a constant, RH is the relative humidity, f(V) is a voltage function (often cited as voltage squared), Ea is the activation energy, KB is Boltzmann's constant, and T is temperature in Kelvin.

Engelmaier:[15] Predicts time to failure of solder joints exposed to temperature cycling

[math]\displaystyle{ N_f(50\%)=\frac{1}{2}\left[\frac{2\epsilon'_f}{\Delta D}\right]^{\frac{-1}{c} }\quad\Delta D(\text{leadless})=\left[\frac{F L_D \Delta(\alpha \Delta T)}{h}\right] }[/math]

where εf is a fatigue ductility coefficient, c is a time and temperature dependent constant, F is an empirical constant, LD is the distance from the neutral point, α is the coefficient of thermal expansion, ΔT is the change in temperature, and h is solder joint thickness.

Steinberg:[16] Predicts time to failure of solder joints exposed to vibration

[math]\displaystyle{ Z_0=\frac{9.8\times 3\sqrt{\pi/2\times PSD\times f_n\times Q} }{f_n^2}\quad Z_c=\frac{0.00022B}{chr\sqrt{L} } }[/math]

where Z is maximum displacement, PSD is the power spectral density (g2/Hz), fn is the natural frequency of the CCA, Q is transmissibility (assumed to be square root of natural frequency), Zc is the critical displacement (20 million cycles to failure), B is the length of PCB edge parallel to component located at the center of the board, c is a component packaging constant, h is PCB thickness, r is a relative position factor, and L is component length.

IPC-TR-579:[17] Predicts time to failure of plated through holes exposed to temperature cycling

[math]\displaystyle{ \sigma=\frac{(\alpha_E-\alpha_{Cu})\Delta T A_E E_E E_{Cu} }{A_E E_E+A_{Cu}E_{Cu} },\quad \text{for }\sigma\le S_Y }[/math]
[math]\displaystyle{ N_f^{-0.6}D_f^{0.75}+0.9\frac{S_u}{E} \left[ \frac{\exp(D_f)}{0.36}\right]^{0.1785\log\frac{10^5}{N_f} }-\Delta\epsilon=0 }[/math]

where a is coefficient of thermal expansion (CTE), T is temperature, E is elastic modules, h is board thickness, d is hole diameter, t is plating thickness, and E and Cu correspond to board and copper properties, respectively, Su being the ultimate tensile strength and Df being ductility of the plated copper, and De is the strain range.

Each of the equations above uses a combination of knowledge of the degradation mechanisms and test experience to develop first-order equations that allow the design or reliability engineer to be able to predict time to failure behavior based on information on the design architecture, materials, and environment.

Recent work

More recent work in the area of physics of failure has been focused on predicting the time to failure of new materials (i.e., lead-free solder,[18][19] high-K dielectric[20] ), software programs,[21] using the algorithms for prognostic purposes,[22] and integrating physics of failure predictions into system-level reliability calculations.[23]

Limitations

There are some limitations with the use of physics of failure in design assessments and reliability prediction. The first is physics of failure algorithms typically assume a ‘perfect design’. Attempting to understand the influence of defects can be challenging and often leads to Physics of Failure (PoF) predictions limited to end of life behavior (as opposed to infant mortality or useful operating life). In addition, some companies have so many use environments (think personal computers) that performing a PoF assessment for each potential combination of temperature / vibration / humidity / power cycling / etc. would be onerous and potentially of limited value.

See also

References

  1. JEDEC JEP148, April 2004, Reliability Qualification of Semiconductor Devices Based on Physics of Failure Risk and Opportunity Assessment
  2. http://www.iagtcommittee.com/downloads/08-3-1%20Prakash%20Patnaik%20-%20Life%20Evaluation%20and%20Extension%20Program.pdf, Gas Turbine Materials/Components Life Evaluation & Extension Programs, Dr. Prakash Patnaik, Director SMPL, National Research Council Canada, Institute for Aerospace Research, Ottawa, Canada, 21 October 2008
  3. http://theriac.org/DeskReference/PDFs/2011Q1/2011Q1-article2.pdf, A Short History of Reliability.
  4. R. Lusser, Unreliability of Electronics – Cause and Cure, Redstone Arsenal, Huntsville, AL, DTIC Document
  5. J. Spiegel and E.M. Bennett, Military System Reliability: Department of Defense Contributions, IRE Transactions on Reliability and Quality Control, Dec. 1960, Volume: RQC-9 Issue:3
  6. George H. Ebel, Reliability Physics in Electronics: A Historical View, IEEE TRANSACTIONS ON RELIABILITY, VOL 47, NO. 3-SP 1998 SEPTEMBER SP-379
  7. This would eventually evolve into the current International Reliability Physics of Symposium (IRPS)
  8. Vaccaro “Reliability and the physics of failure program at RADC”, Physics of Failure in Electronics, 1963, pp 4 -10; Spartan.
  9. James Black, Mass Transport of Aluminum by Momentum Exchange with Conducting Electrons, 6th Annual Reliability Physics Symposium, November 1967
  10. http://www.dfrsolutions.com/uploads/publications/ICWearout_Paper.pdf, E. Wyrwas, L. Condra, and A. Hava, Accurate Quantitative Physics-of-Failure Approach to Integrated Circuit Reliability, IPC APEX Expo, Las Vegas, NV, April 2011
  11. Schuegraf and Hu, "A Model for Gate Oxide Breakdown", IEEE Trans. Electron Dev., May 1994.
  12. Takeda, E. Suzuki, N. "An empirical model for device degradation due to Hot-Carrier Injection," IEEE Electron Device Letters, Vol 4, Num 4, 1983, p111-113.
  13. Chen, Y.F. Lin, M.H. Chou, C.H. Chang, W.C. Huang, S.C. Chang, Y.J. Fu, K.Y. "Negative Bias Temperature Instability (NBTI) in Deep Sub-micron p+-gate pMOSFETS," 2000 IRW Final Report, p98-101
  14. Peck, D.S.; "New concerns about integrated circuit reliability," Electron Devices, IEEE Transactions on, vol.26, no.1, pp. 38- 43, Jan 1979
  15. Engelmaier, W.; "Fatigue Life of Leadless Chip Carrier Solder Joints During Power Cycling," Components, Hybrids, and Manufacturing Technology, IEEE Transactions on, vol.6, no.3, pp. 232- 237, Sep 1983
  16. D. S. Steinberg, Vibration Analysis For Electronic Equipment, John Wiley & Sons Inc., New York, first ed.1973, second ed.1988, third ed. 2000
  17. IPC-TR-579, Round Robin Reliability Evaluation of Small Diameter Plated-Through Holes in Printed Wiring Boards, September 1988
  18. http://www.dfrsolutions.com/uploads/publications/2006_Blattau_IPC_working.pdf, N. Blattau and C. Hillman "An Engelmaier model for leadless ceramic chip devices with Pb-free solder", J. Reliab. Inf. Anal. Cntr., vol. First Quarter, p.7, 2007.
  19. O. Salmela, K. Andersson, A. Perttula, J. Sarkka and M. Tammenmaa "Modified Engelmaier's model taking account of different stress levels", Microelectron. Reliab., vol. 48, p.773, 2008
  20. Raghavan, N.; Prasad, K.; "Statistical outlook into the physics of failure for copper low-k intra-metal dielectric breakdown," Reliability Physics Symposium, 2009 IEEE International, vol., no., pp.819-824, 26–30 April 2009
  21. Bukowski, J.V.; Johnson, D.A.; Goble, W.M.; "Software-reliability feedback: a physics-of-failure approach," Reliability and Maintainability Symposium, 1992. Proceedings., Annual, vol., no., pp.285-289, 21-23 Jan 1992
  22. NASA.gov NASA Prognostic Center of Excellence
  23. http://www.dfrsolutions.com/uploads/publications/2010_01_RAMS_Paper.pdf, McLeish, J.G.; "Enhancing MIL-HDBK-217 reliability predictions with physics of failure methods," Reliability and Maintainability Symposium (RAMS), 2010 Proceedings - Annual, vol., no., pp.1-6, 25-28 Jan. 2010