Fourier amplitude sensitivity testing

From HandWiki

Fourier amplitude sensitivity testing (FAST) is a variance-based global sensitivity analysis method. The sensitivity value is defined based on conditional variances which indicate the individual or joint effects of the uncertain inputs on the output. FAST first represents conditional variances via coefficients from the multiple Fourier series expansion of the output function. Then the ergodic theorem is applied to transform the multi-dimensional integral to a one-dimensional integral in evaluation of the Fourier coefficients. A set of incommensurate frequencies is required to perform the transform and most frequencies are irrational. To facilitate computation a set of integer frequencies is selected instead of the irrational frequencies. The integer frequencies are not strictly incommensurate, resulting in an error between the multi-dimensional integral and the transformed one-dimensional integral. However, the integer frequencies can be selected to be incommensurate to any order so that the error can be controlled meeting any precision requirement in theory. Using integer frequencies in the integral transform, the resulted function in the one-dimensional integral is periodic and the integral only needs to evaluate in a single period. Next, since the continuous integral function can be recovered from a set of finite sampling points if the Nyquist–Shannon sampling theorem is satisfied, the one-dimensional integral is evaluated from the summation of function values at the generated sampling points.

FAST is more efficient to calculate sensitivities than other variance-based global sensitivity analysis methods via Monte Carlo integration. However the calculation by FAST is usually limited to sensitivities referred to as “main effects” or “first-order effects” due to the computational complexity in computing higher-order effects.

History

The FAST method originated in study of coupled chemical reaction systems in 1973[1][2] and the detailed analysis of the computational error was presented latter in 1975.[3] Only the first order sensitivity indices referring to “main effect” were calculated in the original method. A FORTRAN computer program capable of analyzing either algebraic or differential equation systems was published in 1982.[4] In 1990s, the relationship between FAST sensitivity indices and Sobol’s ones calculated from Monte-Carlo simulation was revealed in the general framework of ANOVA-like decomposition [5] and an extended FAST method able to calculate sensitivity indices referring to “total effect” was developed.[6]

Foundation

Variance-based sensitivity

Main page: Variance-based sensitivity analysis

Sensitivity indices of a variance-based method are calculated via ANOVA-like decomposition of the function for analysis. Suppose the function is [math]\displaystyle{ Y = f\left(\mathbf{X}\right)=f\left(X_1,X_2,\dots,X_n\right) }[/math] where [math]\displaystyle{ 0 \leq X_j \leq 1, j=1, \dots, n }[/math]. The ANOVA-like decomposition is

[math]\displaystyle{ f\left(X_1,X_2,\ldots,X_n\right)=f_0+\sum_{j=1}^nf_j\left(X_j\right)+\sum_{j=1}^{n-1}\sum_{k=j+1}^n f_{jk}\left(X_j,X_k\right)+ \cdots +f_{12 \dots n} }[/math]

provided that [math]\displaystyle{ f_0 }[/math] is a constant and the integral of each term in the sums is zero, i.e.

[math]\displaystyle{ \int_0^1 f_{j_1 j_2 \dots j_r}\left(X_{j_1},X_{j_2},\dots,X_{j_r}\right)dX_{j_k}=0, \text{ } 1 \leq k \leq r. }[/math]

The conditional variance which characterizes the contribution of each term to the total variance of [math]\displaystyle{ f\left(\mathbf{X}\right) }[/math] is

[math]\displaystyle{ V_{j_1 j_2 \dots j_r}=\int_0^1 \cdots \int_0^1 f_{j_1 j_2 \dots j_r}^2\left(X_{j_1},X_{j_2},\dots,X_{j_r}\right)dX_{j_1}dX_{j_2}\dots dX_{j_r}. }[/math]

The total variance is the sum of all conditional variances

[math]\displaystyle{ V = \sum_{j=1}^n V_j + \sum_{j=1}^{n-1} \sum_{k=j+1}^n V_{jk} + \cdots + V_{12\dots n}. }[/math]

The sensitivity index is defined as the normalized conditional variance as

[math]\displaystyle{ S_{j_1 j_2 \dots j_r} = \frac{V_{j_1 j_2 \dots j_r}}{V} }[/math]

especially the first order sensitivity

[math]\displaystyle{ S_j=\frac{V_j}{V} }[/math]

which indicates the main effect of the input [math]\displaystyle{ X_j }[/math].

Multiple Fourier series

One way to calculate the ANOVA-like decomposition is based on multiple Fourier series. The function [math]\displaystyle{ f\left(\mathbf{X}\right) }[/math] in the unit hyper-cube can be extended to a multiply periodic function and the multiple Fourier series expansion is

[math]\displaystyle{ f\left(X_1,X_2,\dots,X_n\right) = \sum_{m_1=-\infty}^{\infty} \sum_{m_2=-\infty}^{\infty} \cdots \sum_{m_n=-\infty}^{\infty} C_{m_1m_2...m_n}\exp\bigl[2\pi i\left( m_1X_1 + m_2X_2 + \cdots + m_nX_n \right) \bigr], \text{ for integers }m_1, m_2, \dots, m_n }[/math]

where the Fourier coefficient is

[math]\displaystyle{ C_{m_1m_2...m_n} = \int_0^1 \cdots \int_0^1 f\left(X_1,X_2,\dots,X_n\right) \exp\bigl[-2\pi i \left( m_1X_1+m_2X_2+\dots+m_nX_n \right) \bigr]. }[/math]

The ANOVA-like decomposition is

[math]\displaystyle{ \begin{align} f_0 &= C_{00 \dots 0} \\ f_j &= \sum_{m_j \neq 0} C_{0 \dots m_j \dots 0} \exp\bigl[2\pi i m_jX_j \bigr] \\ f_{jk} &= \sum_{m_j \neq 0} \sum_{m_k \neq 0} C_{0 \dots m_j \dots m_k \dots 0} \exp\bigl[2\pi i \left( m_jX_j + m_kX_k \right) \bigr] \\ f_{12 \dots n} &= \sum_{m_1 \neq 0} \sum_{m_2 \neq 0} \cdots \sum_{m_n \neq 0} C_{m_1 m_2 \dots m_n} \exp\bigl[ 2\pi i \left( m_1X_1+m_2X_2+\cdots+m_nX_n \right) \bigr]. \end{align} }[/math]

The first order conditional variance is

[math]\displaystyle{ \begin{align} V_j &= \int_0^1 f_j^2\left(X_j\right)dX_j\\ &= \sum_{ m_j \neq 0 } \left| C_{0 \dots m_j \dots 0} \right|^2\\ &= 2\sum_{m_j=1}^{\infty} \left( A_{m_j}^2+B_{m_j}^2 \right) \end{align} }[/math]

where [math]\displaystyle{ A_{m_j} }[/math] and [math]\displaystyle{ B_{m_j} }[/math] are the real and imaginary part of [math]\displaystyle{ C_{0 \dots m_j \dots 0} }[/math] respectively

[math]\displaystyle{ A_{m_j} = \int_0^1 \cdots \int_0^1 f \left(X_1, X_2, \dots, X_n\right) \cos\left(2\pi m_jX_j\right)dX_1dX_2 \dots dX_n }[/math]

[math]\displaystyle{ B_{m_j} = \int_0^1 \cdots \int_0^1 f \left(X_1, X_2, \dots, X_n\right) \sin\left(2\pi m_jX_j\right)dX_1dX_2 \dots dX_n }[/math]

Ergodic theorem

A multi-dimensional integral must be evaluated in order to calculate the Fourier coefficients. One way to evaluate this multi-dimensional integral is to transform it into a one-dimensional integral by expressing every input as a function of a new independent variable [math]\displaystyle{ s }[/math], as follows

[math]\displaystyle{ X_j \left( s \right) = \frac{1}{2\pi}\left(\omega_j s \text{ mod } 2\pi \right), j = 1,2,\dots,n }[/math]

where [math]\displaystyle{ \left\{\omega_j\right\} }[/math] is a set of incommensurate frequencies, i.e.

[math]\displaystyle{ \sum_{j=1}^n \gamma_j\omega_j = 0 }[/math]

for an integer set of [math]\displaystyle{ \left\{\gamma_j\right\} }[/math] if and only if [math]\displaystyle{ \gamma_j = 0 }[/math] for every [math]\displaystyle{ j }[/math]. Then the Fourier coefficients can be calculated by a one-dimensional integral according to the ergodic theorem [7]

[math]\displaystyle{ A_{m_j} = \lim_{T \to \infty} \frac{1}{2T} \int_{-T}^T f\bigl(X_1\left(s\right),X_2\left(s\right),\dots,X_n\left(s\right)\bigr)\cos\bigl(2\pi m_jX_j\left(s\right)\bigr)ds }[/math]

[math]\displaystyle{ B_{m_j} = \lim_{T \to \infty} \frac{1}{2T} \int_{-T}^T f\bigl(X_1\left(s\right),X_2\left(s\right),\dots,X_n\left(s\right)\bigr)\sin\bigl(2\pi m_jX_j\left(s\right)\bigr)ds }[/math]

Implementation

Integer frequencies

At most one of the incommensurate frequencies [math]\displaystyle{ \left\{\omega_j\right\} }[/math] can be rational with all others being irrational. Since the numerical value of an irrational number cannot be stored exactly in a computer, an approximation of the incommensurate frequencies by all rational numbers is required in implementation. Without loss of any generality the frequencies can be set as integers instead of any rational numbers. A set of integers [math]\displaystyle{ \left\{\omega_j\right\} }[/math] is approximately incommensurate to the order of [math]\displaystyle{ M }[/math] if

[math]\displaystyle{ \sum_{j=1}^n \gamma_j\omega_j \neq 0 }[/math]

for

[math]\displaystyle{ \sum_{j=1}^n \left| \gamma_j \right| \leq M + 1 }[/math]

where [math]\displaystyle{ M }[/math] is an integer. The exact incommensurate condition is an extreme case when [math]\displaystyle{ M \to \infty }[/math].

Using the integer frequencies the function in the transformed one-dimensional integral is periodic so only the integration over a period of [math]\displaystyle{ 2\pi }[/math] is required. The Fourier coefficients can be approximately calculated as

[math]\displaystyle{ \begin{align} A_{m_j} &\approx \frac{1}{2\pi} \int_{-\pi}^{\pi} f\bigl(X_1\left(s\right),X_2\left(s\right),\dots,X_n\left(s\right)\bigr)\cos\left(m_j\omega_j s\right)ds := \hat{A}_{m_j}\\ B_{m_j} &\approx \frac{1}{2\pi} \int_{-\pi}^{\pi} f\bigl(X_1\left(s\right),X_2\left(s\right),\dots,X_n\left(s\right)\bigr)\sin\left(m_j\omega_j s\right)ds := \hat{B}_{m_j} \end{align} }[/math]

The approximation of the incommensurate frequencies for a finite [math]\displaystyle{ M }[/math] results in a discrepancy error between the true Fourier coefficients [math]\displaystyle{ A_{m_j} }[/math], [math]\displaystyle{ B_{m_j} }[/math] and their estimates [math]\displaystyle{ \hat{A}_{m_j} }[/math], [math]\displaystyle{ \hat{B}_{m_j} }[/math]. The larger the order [math]\displaystyle{ M }[/math] is the smaller the error is but the more computational efforts are required to calculate the estimates in the following procedure. In practice [math]\displaystyle{ M }[/math] is frequently set to 4 and a table of resulted frequency sets which have up to 50 frequencies is available. (McRae et al., 1982)

Search curve

The transform, [math]\displaystyle{ X_j \left( s \right) = \frac{1}{2\pi}\left(\omega_j s \text{ mod } 2\pi \right) }[/math], defines a search curve in the input space. If the frequencies, [math]\displaystyle{ \omega_j, j = 1,\dots,n }[/math], are incommensurate, the search curve can pass through every point in the input space as [math]\displaystyle{ s }[/math] varies from 0 to [math]\displaystyle{ \infty }[/math] so the multi-dimensional integral over the input space can be accurately transformed to a one-dimensional integral along the search curve. However, if the frequencies are approximately incommensurate integers, the search curve cannot pass through every point in the input space. If fact the search is repeated since the transform function is periodic, with a period of [math]\displaystyle{ 2\pi }[/math]. The one-dimensional integral can be evaluated over a single period instead of the infinite interval for incommensurate frequencies; However, a computational error arises due to the approximation of the incommensuracy.

Sampling

The approximated Fourier can be further expressed as

[math]\displaystyle{ \hat{A}_{m_j}= \begin{cases} 0 & m_j \text{ odd} \\ \frac{1}{\pi}\int_{-\pi/2}^{\pi/2}f\bigl(\mathbf X(s)\bigr)\cos\left(m_j\omega_js\right)ds & m_j \text{ even} \end{cases} }[/math]

and

[math]\displaystyle{ \hat{B}_{m_j}= \begin{cases} \frac{1}{\pi}\int_{-\pi/2}^{\pi/2}f\bigl(\mathbf X(s)\bigr)\sin\left(m_j\omega_js\right)ds & m_j \text{ odd} \\ 0 & m_j \text{ even} \end{cases} }[/math]

The non-zero integrals can be calculated from sampling points

[math]\displaystyle{ \begin{align} \hat{A}_{m_j} &= \frac{1}{2q+1}\sum_{k=-q}^q f\bigl(\mathbf X(s_k)\bigr)\cos\left( m_j \omega_j s_k\right), m_j \text{ even}\\ \hat{B}_{m_j} &= \frac{1}{2q+1}\sum_{k=-q}^q f\bigl(\mathbf X(s_k)\bigr)\sin\left( m_j \omega_j s_k\right), m_j \text{ odd } \end{align} }[/math]

where the uniform sampling point in [math]\displaystyle{ \left[-\pi/2, \pi/2\right] }[/math] is

[math]\displaystyle{ s_k = \frac{\pi k}{2q+1}, k=-q,\dots,-1,0,1,\dots,q. }[/math]

The total number of sampling points is [math]\displaystyle{ 2q+1 }[/math] which should satisfy the Nyquist sampling criterion, i.e.

[math]\displaystyle{ 2q+1 \geq N\omega_{max}+1 }[/math]

where [math]\displaystyle{ \omega_{max} }[/math] is the largest frequency in [math]\displaystyle{ \left\{\omega_k\right\} }[/math] and [math]\displaystyle{ N }[/math] is the maximum order of the calculated Fourier coefficients.

Partial sum

After calculating the estimated Fourier coefficients, the first order conditional variance can be approximated by

[math]\displaystyle{ \begin{align} V_j &= 2\sum_{m_j=1}^{\infty} \left( A_{m_j}^2+B_{m_j}^2 \right) \\ &\approx 2\sum_{m_j=1}^{\infty} \left( \hat{A}_{m_j}^2+\hat{B}_{m_j}^2 \right) \\ &\approx 2\sum_{m_j=1}^{2} \left( \hat{A}_{m_j}^2+\hat{B}_{m_j}^2 \right) \\ &= 2\left( \hat{A}_{m_j=2}^2 + \hat{B}_{m_j=1}^2 \right) \end{align} }[/math]

where only the partial sum of the first two terms is calculated and [math]\displaystyle{ N=2 }[/math] for determining the number of sampling points. Using the partial sum can usually return an adequately good approximation of the total sum since the terms corresponding to the fundamental frequency and low order frequencies usually contribute most to the total sum. Additionally, the Fourier coefficient in the summation are just an estimate of the true value and adding more higher order terms will not help improve the computational accuracy significantly. Since the integer frequencies are not exactly incommensurate there are two integers [math]\displaystyle{ m_j }[/math] and [math]\displaystyle{ m_k }[/math] such that [math]\displaystyle{ m_j\omega_j = m_k\omega_k. }[/math] Interference between the two frequencies can occur if higher order terms are included in the summation.

Similarly the total variance of [math]\displaystyle{ f\left( \mathbf X \right) }[/math] can be calculated as

[math]\displaystyle{ V \approx \hat{A}_0\left[ f^2 \right] - \hat{A}_0\left[ f \right]^2 }[/math]

where [math]\displaystyle{ \hat{A}_0\left[ f^2 \right] }[/math] denotes the estimated Fourier coefficient of the function of [math]\displaystyle{ f^2 }[/math] inside the bracket and [math]\displaystyle{ \hat{A}_0\left[ f \right]^2 }[/math] is the squared Fourier coefficient of the function [math]\displaystyle{ f }[/math]. Finally the sensitivity referring to the main effect of an input can be calculated by dividing the conditional variance by the total variance.

References

  1. Cukier, R.I., C.M. Fortuin, K.E. Shuler, A.G. Petschek and J.H. Schaibly (1973). Study of the sensitivity of coupled reaction systems to uncertainties in rate coefficients. I Theory. Journal of Chemical Physics, 59, 3873–3878.
  2. Schaibly, J.H. and K.E. Shuler (1973). Study of the sensitivity of coupled reaction systems to uncertainties in rate coefficients. II Applications. Journal of Chemical Physics, 59, 3879–3888.
  3. Cukier, R.I., J.H. Schaibly, and K.E. Shuler (1975). Study of the sensitivity of coupled reaction systems to uncertainties in rate coefficients. III. Analysis of the approximations. Journal of Chemical Physics, 63, 1140–1149.
  4. McRae, G.J., J.W. Tilden and J.H. Seinfeld (1982). Global sensitivity analysis—a computational implementation of the Fourier Amplitude Sensitivity Test (FAST). Computers & Chemical Engineering, 6, 15–25.
  5. Archer G.E.B., A. Saltelli and I.M. Sobol (1997). Sensitivity measures, ANOVA-like techniques and the use of bootstrap. Journal of Statistical Computation and Simulation, 58, 99–120.
  6. Saltelli A., S. Tarantola and K.P.S. Chan (1999). A quantitative model-independent method for global sensitivity analysis of model output. Technometrics, 41, 39–56.
  7. Weyl, H. (1938). Mean motion. American Journal of Mathematics, 60, 889–896.