Observability Gramian

From HandWiki

In control theory, we may need to find out whether or not a system such as [math]\displaystyle{ \begin{array}{c} \dot{\boldsymbol{x}}(t)\boldsymbol{=Ax}(t)+\boldsymbol{Bu}(t)\\ \boldsymbol{y}(t)=\boldsymbol{Cx}(t)+\boldsymbol{Du}(t) \end{array} }[/math]

is observable, where [math]\displaystyle{ \boldsymbol{A} }[/math], [math]\displaystyle{ \boldsymbol{B} }[/math], [math]\displaystyle{ \boldsymbol{C} }[/math] and [math]\displaystyle{ \boldsymbol{D} }[/math] are, respectively, [math]\displaystyle{ n\times n }[/math], [math]\displaystyle{ n\times p }[/math],[math]\displaystyle{ q\times n }[/math] and [math]\displaystyle{ q\times p }[/math] matrices.

One of the many ways one can achieve such goal is by the use of the Observability Gramian.

Observability in LTI Systems

Linear Time Invariant (LTI) Systems are those systems in which the parameters [math]\displaystyle{ \boldsymbol{A} }[/math], [math]\displaystyle{ \boldsymbol{B} }[/math], [math]\displaystyle{ \boldsymbol{C} }[/math] and [math]\displaystyle{ \boldsymbol{D} }[/math] are invariant with respect to time.

One can determine if the LTI system is or is not observable simply by looking at the pair [math]\displaystyle{ (\boldsymbol{A},\boldsymbol{C}) }[/math]. Then, we can say that the following statements are equivalent:

1. The pair [math]\displaystyle{ (\boldsymbol{A},\boldsymbol{C}) }[/math] is observable.

2. The [math]\displaystyle{ n\times n }[/math] matrix

[math]\displaystyle{ \boldsymbol{W_{o}}(t)=\int_{0}^{t}e^{\boldsymbol{A}^{T}\tau}\boldsymbol{C}^{T}\boldsymbol{C}e^{\boldsymbol{A}\tau}d\tau }[/math]

is nonsingular for any [math]\displaystyle{ t\gt 0 }[/math].

3. The [math]\displaystyle{ nq\times n }[/math] observability matrix

[math]\displaystyle{ \left[\begin{array}{c} \boldsymbol{C}\\ \boldsymbol{CA}\\ \boldsymbol{CA}^{2}\\ \vdots\\ \boldsymbol{CA}^{n-1} \end{array}\right] }[/math]

has rank n.

4. The [math]\displaystyle{ (n + q)\times n }[/math] matrix

[math]\displaystyle{ \left[\begin{array}{c} \boldsymbol{A}\boldsymbol{-\lambda}\boldsymbol{I}\\ \boldsymbol{C} \end{array}\right] }[/math]

has full column rank at every eigenvalue [math]\displaystyle{ \lambda }[/math] of [math]\displaystyle{ \boldsymbol{A} }[/math].

If, in addition, all eigenvalues of [math]\displaystyle{ \boldsymbol{A} }[/math] have negative real parts ([math]\displaystyle{ \boldsymbol{A} }[/math] is stable) and the unique solution of

[math]\displaystyle{ \boldsymbol{A^{T}}\boldsymbol{W}_{o}+\boldsymbol{W}_{o}\boldsymbol{A}=-\boldsymbol{C^{T}C} }[/math]

is positive definite, then the system is observable. The solution is called the Observability Gramian and can be expressed as

[math]\displaystyle{ \boldsymbol{W_{o}}=\int_{0}^{\infty}e^{\boldsymbol{A}^{T}\tau}\boldsymbol{C^{T}C}e^{\boldsymbol{A}\tau}d\tau }[/math]

In the following section we are going to take a closer look at the Observability Gramian.

Observability Gramian

The Observability Gramian can be found as the solution of the Lyapunov equation given by

[math]\displaystyle{ \boldsymbol{A^{T}}\boldsymbol{W}_{o}+\boldsymbol{W}_{o}\boldsymbol{A}=-\boldsymbol{C^{T}C} }[/math]

In fact, we can see that if we take

[math]\displaystyle{ \boldsymbol{W_{o}}=\int_{0}^{\infty}e^{\boldsymbol{A^{T}}\tau}\boldsymbol{C^{T}C}e^{\boldsymbol{A}\tau}d\tau }[/math]

as a solution, we are going to find that:

[math]\displaystyle{ \begin{array}{ccccc} \boldsymbol{A^{T}}\boldsymbol{W}_{o}+\boldsymbol{W}_{o}\boldsymbol{A} & = & \int_{0}^{\infty}\boldsymbol{A^{T}}e^{\boldsymbol{A^{T}}\tau}\boldsymbol{C^{T}C}e^{\boldsymbol{A}\tau}d\tau & + & \int_{0}^{\infty}e^{\boldsymbol{A^{T}}\tau}\boldsymbol{C^{T}C}e^{\boldsymbol{A}\tau}\boldsymbol{A}d\tau\\ & = & \int_{0}^{\infty}\frac{d}{d\tau}(e^{\boldsymbol{A^{T}}\tau}\boldsymbol{C}^{T}\boldsymbol{C}e^{\boldsymbol{A}\tau})d\tau & = & e^{\boldsymbol{A^{T}}t}\boldsymbol{C}^{T}\boldsymbol{C}e^{\boldsymbol{A}t}|_{t=0}^{\infty}\\ & = & \boldsymbol{0}-\boldsymbol{C^{T}C}\\ & = & \boldsymbol{-C^{T}C} \end{array} }[/math]

Where we used the fact that [math]\displaystyle{ e^{\boldsymbol{A}t}=0 }[/math] at [math]\displaystyle{ t=\infty }[/math] for stable [math]\displaystyle{ \boldsymbol{A} }[/math] (all its eigenvalues have negative real part). This shows us that [math]\displaystyle{ \boldsymbol{W}_{o} }[/math] is indeed the solution for the Lyapunov equation under analysis.

Properties

We can see that [math]\displaystyle{ \boldsymbol{C^{T}C} }[/math] is a symmetric matrix, therefore, so is [math]\displaystyle{ \boldsymbol{W}_{o} }[/math].

We can use again the fact that, if [math]\displaystyle{ \boldsymbol{A} }[/math] is stable (all its eigenvalues have negative real part) to show that [math]\displaystyle{ \boldsymbol{W}_{o} }[/math] is unique. In order to prove so, suppose we have two different solutions for

[math]\displaystyle{ \boldsymbol{A^{T}}\boldsymbol{W}_{o}+\boldsymbol{W}_{o}\boldsymbol{A}=-\boldsymbol{C^{T}C} }[/math]

and they are given by [math]\displaystyle{ \boldsymbol{W}_{o1} }[/math] and [math]\displaystyle{ \boldsymbol{W}_{o2} }[/math]. Then we have:

[math]\displaystyle{ \boldsymbol{A^{T}}\boldsymbol{(W}_{o1}-\boldsymbol{W}_{o2})+\boldsymbol{(W}_{o1}-\boldsymbol{W}_{o2})\boldsymbol{A}=\boldsymbol{0} }[/math]

Multiplying by [math]\displaystyle{ e^{\boldsymbol{A^{T}}t} }[/math] by the left and by [math]\displaystyle{ e^{\boldsymbol{A}t} }[/math] by the right, would lead us to

[math]\displaystyle{ e^{\boldsymbol{A^{T}}t}[\boldsymbol{A^{T}}\boldsymbol{(W}_{o1}-\boldsymbol{W}_{o2})+\boldsymbol{(W}_{o1}-\boldsymbol{W}_{o2})\boldsymbol{A}]e^{\boldsymbol{A}t}=\frac{d}{dt}[e^{\boldsymbol{A^{T}}t}[(\boldsymbol{W}_{o1}-\boldsymbol{W}_{o2})e^{\boldsymbol{A}t}]=\boldsymbol{0} }[/math]

Integrating from [math]\displaystyle{ 0 }[/math] to [math]\displaystyle{ \infty }[/math]:

[math]\displaystyle{ [e^{\boldsymbol{A^{T}}t}[(\boldsymbol{W}_{o1}-\boldsymbol{W}_{o2})e^{\boldsymbol{A}t}]|_{t=0}^{\infty}=\boldsymbol{0} }[/math]

using the fact that [math]\displaystyle{ e^{\boldsymbol{A}t}\rightarrow0 }[/math] as [math]\displaystyle{ t\rightarrow\infty }[/math]:

[math]\displaystyle{ \boldsymbol{0}-(\boldsymbol{W}_{o1}-\boldsymbol{W}_{o2})=\boldsymbol{0} }[/math]

In other words, [math]\displaystyle{ \boldsymbol{W}_{o} }[/math] has to be unique.

Also, we can see that

[math]\displaystyle{ \boldsymbol{x^{T}W_{o}x}=\int_{0}^{\infty}\boldsymbol{x}^{T}e^{\boldsymbol{A^{T}}t}\boldsymbol{C^{T}C}e^{\boldsymbol{A}t}\boldsymbol{x}dt=\int_{0}^{\infty}\left\Vert \boldsymbol{Ce^{\boldsymbol{A}t}\boldsymbol{x}}\right\Vert _{2}^{2}dt }[/math]

is positive for any [math]\displaystyle{ \boldsymbol{x} }[/math] (assuming the non-degenerate case where [math]\displaystyle{ {\displaystyle {\boldsymbol {Ce^{{\boldsymbol {A}}t}{\boldsymbol {x}}}} } }[/math] is not identically zero), and that makes [math]\displaystyle{ \boldsymbol{W}_{o} }[/math] a positive definite matrix.

More properties of observable systems can be found in,[1] as well as the proof for the other equivalent statements of "The pair [math]\displaystyle{ (\boldsymbol{A},\boldsymbol{C}) }[/math] is observable" presented in section Observability in LTI Systems.

Discrete Time Systems

For discrete time systems as

[math]\displaystyle{ \begin{array}{c} \boldsymbol{x}[k+1]\boldsymbol{=Ax}[k]+\boldsymbol{Bu}[k]\\ \boldsymbol{y}[k]=\boldsymbol{Cx}[k]+\boldsymbol{Du}[k] \end{array} }[/math]

One can check that there are equivalences for the statement "The pair [math]\displaystyle{ (\boldsymbol{A},\boldsymbol{C}) }[/math] is observable" (the equivalences are much alike for the continuous time case).

We are interested in the equivalence that claims that, if "The pair [math]\displaystyle{ (\boldsymbol{A},\boldsymbol{C}) }[/math] is observable" and all the eigenvalues of [math]\displaystyle{ \boldsymbol{A} }[/math] have magnitude less than [math]\displaystyle{ 1 }[/math] ([math]\displaystyle{ \boldsymbol{A} }[/math] is stable), then the unique solution of

[math]\displaystyle{ \boldsymbol{A^{T}}\boldsymbol{W}_{do}\boldsymbol{A}-W_{do}=-\boldsymbol{C^{T}C} }[/math]

is positive definite and given by

[math]\displaystyle{ \boldsymbol{W}_{do}=\sum_{m=0}^{\infty}(\boldsymbol{A}^{T})^{m}\boldsymbol{C}^{T}\boldsymbol{C}\boldsymbol{A}^{m} }[/math]

That is called the discrete Observability Gramian. We can easily see the correspondence between discrete time and the continuous time case, that is, if we can check that [math]\displaystyle{ \boldsymbol{W}_{dc} }[/math] is positive definite, and all eigenvalues of [math]\displaystyle{ \boldsymbol{A} }[/math] have magnitude less than [math]\displaystyle{ 1 }[/math], the system [math]\displaystyle{ (\boldsymbol{A},\boldsymbol{B}) }[/math] is observable. More properties and proofs can be found in.[2]

Linear Time Variant Systems

Linear time variant (LTV) systems are those in the form:

[math]\displaystyle{ \begin{array}{c} \dot{\boldsymbol{x}}(t)\boldsymbol{=A}(t)\boldsymbol{x}(t)+\boldsymbol{B}(t)\boldsymbol{u}(t)\\ \boldsymbol{y}(t)=\boldsymbol{C}(t)\boldsymbol{x}(t) \end{array} }[/math]

That is, the matrices [math]\displaystyle{ \boldsymbol{A} }[/math], [math]\displaystyle{ \boldsymbol{B} }[/math] and [math]\displaystyle{ \boldsymbol{C} }[/math] have entries that varies with time. Again, as well as in the continuous time case and in the discrete time case, one may be interested in discovering if the system given by the pair [math]\displaystyle{ (\boldsymbol{A}(t),\boldsymbol{C}(t)) }[/math] is observable or not. This can be done in a very similar way of the preceding cases.

The system [math]\displaystyle{ (\boldsymbol{A}(t),\boldsymbol{C}(t)) }[/math] is observable at time [math]\displaystyle{ t_{0} }[/math] if and only if there exists a finite [math]\displaystyle{ t_{1}\gt t_{0} }[/math] such that the [math]\displaystyle{ n\times n }[/math] matrix also called the Observability Gramian is given by

[math]\displaystyle{ \boldsymbol{W}_{o}(t_{0},t_{1})=\int_{t_{0}}^{^{t_{1}}}\boldsymbol{\Phi}^{T}(\tau, t_{0})\boldsymbol{C}^{T}(\tau)\boldsymbol{C}(\tau)\boldsymbol{\Phi}(\tau, t_0)d\tau }[/math]

where [math]\displaystyle{ \boldsymbol{\Phi}(t,\tau) }[/math] is the state transition matrix of [math]\displaystyle{ \boldsymbol{\dot{x}}=\boldsymbol{A}(t)\boldsymbol{x} }[/math] is nonsingular.

Again, we have a similar method to determine if a system is or not an observable system.

Properties of [math]\displaystyle{ \boldsymbol{W}_{o}(t_{0},t_{1}) }[/math]

We have that the Observability Gramian [math]\displaystyle{ \boldsymbol{W}_{o}(t_{0},t_{1}) }[/math] have the following property:

[math]\displaystyle{ \boldsymbol{W}_{o}(t_{0},t_{1})=\boldsymbol{W}_{o}(t_{0},t)+\boldsymbol{\Phi}^{T}(t,t_{0})\boldsymbol{W}_{o}(t,t_{0})\boldsymbol{\Phi}(t,t_{0}) }[/math]

that can easily be seen by the definition of [math]\displaystyle{ \boldsymbol{W}_{o}(t_{0},t_{1}) }[/math] and by the property of the state transition matrix that claims that:

[math]\displaystyle{ \boldsymbol{\Phi}(t_{0},t_{1})=\boldsymbol{\Phi}(t_{1},\tau)\boldsymbol{\Phi}(\tau,t_{0}) }[/math]

More about the Observability Gramian can be found in.[3]

See also

References

  1. Chen, Chi-Tsong (1999). Linear System Theory and Design Third Edition. New York, New York: Oxford University Press. p. 156. ISBN 0-19-511777-8. https://archive.org/details/linearsystemtheo00chen. 
  2. Chen, Chi-Tsong (1999). Linear System Theory and Design Third Edition. New York, New York: Oxford University Press. p. 171. ISBN 0-19-511777-8. https://archive.org/details/linearsystemtheo00chen. 
  3. Chen, Chi-Tsong (1999). Linear System Theory and Design Third Edition. New York, New York: Oxford University Press. p. 179. ISBN 0-19-511777-8. https://archive.org/details/linearsystemtheo00chen. 

External links