Controllability Gramian

From HandWiki

In control theory, we may need to find out whether or not a system such as [math]\displaystyle{ \begin{array}{c} \dot{\boldsymbol{x}}(t)\boldsymbol{=Ax}(t)+\boldsymbol{Bu}(t)\\ \boldsymbol{y}(t)=\boldsymbol{Cx}(t)+\boldsymbol{Du}(t) \end{array} }[/math]

is controllable, where [math]\displaystyle{ \boldsymbol{A} }[/math], [math]\displaystyle{ \boldsymbol{B} }[/math], [math]\displaystyle{ \boldsymbol{C} }[/math] and [math]\displaystyle{ \boldsymbol{D} }[/math] are, respectively, [math]\displaystyle{ n\times n }[/math], [math]\displaystyle{ n\times p }[/math], [math]\displaystyle{ q\times n }[/math] and [math]\displaystyle{ q\times p }[/math] matrices for a system with [math]\displaystyle{ p }[/math] inputs, [math]\displaystyle{ n }[/math] state variables and [math]\displaystyle{ q }[/math] outputs.

One of the many ways one can achieve such goal is by the use of the Controllability Gramian.

Controllability in LTI Systems

Linear Time Invariant (LTI) Systems are those systems in which the parameters [math]\displaystyle{ \boldsymbol{A} }[/math], [math]\displaystyle{ \boldsymbol{B} }[/math], [math]\displaystyle{ \boldsymbol{C} }[/math] and [math]\displaystyle{ \boldsymbol{D} }[/math] are invariant with respect to time.

One can observe if the LTI system is or is not controllable simply by looking at the pair [math]\displaystyle{ (\boldsymbol{A},\boldsymbol{B}) }[/math]. Then, we can say that the following statements are equivalent:

1. The pair [math]\displaystyle{ (\boldsymbol{A},\boldsymbol{B}) }[/math] is controllable.

2. The [math]\displaystyle{ n\times n }[/math] matrix

[math]\displaystyle{ \boldsymbol{W_{c}}(t)=\int_{0}^{t}e^{\boldsymbol{A}\tau}\boldsymbol{BB^{T}}e^{\boldsymbol{A}^{T}\tau}d\tau=\int_{0}^{t}e^{\boldsymbol{A}(t-\tau)}\boldsymbol{BB^{T}}e^{\boldsymbol{A}^{T}(t-\tau)}d\tau }[/math]

is nonsingular for any [math]\displaystyle{ t\gt 0 }[/math].

3. The [math]\displaystyle{ n\times np }[/math] controllability matrix

[math]\displaystyle{ {\mathcal{C}}=[\begin{array}{ccccc} \boldsymbol{B} & \boldsymbol{AB} & \boldsymbol{A^{2}B} & ... & \boldsymbol{A^{n-1}B}\end{array}] }[/math]

has rank n.

4. The [math]\displaystyle{ n\times(n+p) }[/math] matrix

[math]\displaystyle{ [\begin{array}{cc} \boldsymbol{A}\boldsymbol{-\lambda}\boldsymbol{I} & \boldsymbol{B}\end{array}] }[/math]

has full row rank at every eigenvalue [math]\displaystyle{ \lambda }[/math] of [math]\displaystyle{ \boldsymbol{A} }[/math].

If, in addition, all eigenvalues of [math]\displaystyle{ \boldsymbol{A} }[/math] have negative real parts ([math]\displaystyle{ \boldsymbol{A} }[/math] is stable), and the unique solution of the Lyapunov equation

[math]\displaystyle{ \boldsymbol{A}\boldsymbol{W}_{c}+\boldsymbol{W}_{c}\boldsymbol{A^{T}}=-\boldsymbol{BB^{T}} }[/math]

is positive definite, the system is controllable. The solution is called the Controllability Gramian and can be expressed as

[math]\displaystyle{ \boldsymbol{W_{c}}=\int_{0}^{\infty}e^{\boldsymbol{A}\tau}\boldsymbol{BB^{T}}e^{\boldsymbol{A}^{T}\tau}d\tau }[/math]

In the following section we are going to take a closer look at the Controllability Gramian.

Controllability Gramian

The controllability Gramian can be found as the solution of the Lyapunov equation given by

[math]\displaystyle{ \boldsymbol{A}\boldsymbol{W}_{c}+\boldsymbol{W}_{c}\boldsymbol{A^{T}}=-\boldsymbol{BB^{T}} }[/math]

In fact, we can see that if we take

[math]\displaystyle{ \boldsymbol{W_{c}}=\int_{0}^{\infty}e^{\boldsymbol{A}\tau}\boldsymbol{BB^{T}}e^{\boldsymbol{A}^{T}\tau}d\tau }[/math]

as a solution, we are going to find that:

[math]\displaystyle{ \begin{array}{ccccc} \boldsymbol{A}\boldsymbol{W}_{c}+\boldsymbol{W}_{c}\boldsymbol{A^{T}} & = & \int_{0}^{\infty}\boldsymbol{A}e^{\boldsymbol{A}\tau}\boldsymbol{BB^{T}}e^{\boldsymbol{A}^{T}\tau}d\tau & + & \int_{0}^{\infty}e^{\boldsymbol{A}\tau}\boldsymbol{BB^{T}}e^{\boldsymbol{A}^{T}\tau}\boldsymbol{A^{T}}d\tau\\ & = & \int_{0}^{\infty}\frac{d}{d\tau}(e^{\boldsymbol{A}\tau}\boldsymbol{B}\boldsymbol{B}^{T}e^{\boldsymbol{A}^{T}\tau})d\tau & = & e^{\boldsymbol{A}t}\boldsymbol{B}\boldsymbol{B}^{T}e^{\boldsymbol{A}^{T}t}|_{t=0}^{\infty}\\ & = & \boldsymbol{0}-\boldsymbol{BB^{T}}\\ & = & \boldsymbol{-BB^{T}} \end{array} }[/math]

Where we used the fact that [math]\displaystyle{ e^{\boldsymbol{A}t}=0 }[/math] at [math]\displaystyle{ t=\infty }[/math] for stable [math]\displaystyle{ \boldsymbol{A} }[/math] (all its eigenvalues have negative real part). This shows us that [math]\displaystyle{ \boldsymbol{W}_{c} }[/math] is indeed the solution for the Lyapunov equation under analysis.

Properties

We can see that [math]\displaystyle{ \boldsymbol{BB^{T}} }[/math] is a symmetric matrix, therefore, so is [math]\displaystyle{ \boldsymbol{W}_{c} }[/math].

We can use again the fact that, if [math]\displaystyle{ \boldsymbol{A} }[/math] is stable (all its eigenvalues have negative real part) to show that [math]\displaystyle{ \boldsymbol{W}_{c} }[/math] is unique. In order to prove so, suppose we have two different solutions for

[math]\displaystyle{ \boldsymbol{A}\boldsymbol{W}_{c}+\boldsymbol{W}_{c}\boldsymbol{A^{T}}=-\boldsymbol{BB^{T}} }[/math]

and they are given by [math]\displaystyle{ \boldsymbol{W}_{c1} }[/math] and [math]\displaystyle{ \boldsymbol{W}_{c2} }[/math]. Then we have:

[math]\displaystyle{ \boldsymbol{A}\boldsymbol{(W}_{c1}-\boldsymbol{W}_{c2})+\boldsymbol{(W}_{c1}-\boldsymbol{W}_{c2})\boldsymbol{A^{T}}=\boldsymbol{0} }[/math]

Multiplying by [math]\displaystyle{ e^{\boldsymbol{A}t} }[/math] by the left and by [math]\displaystyle{ e^{\boldsymbol{A}^{T}t} }[/math] by the right, would lead us to

[math]\displaystyle{ e^{\boldsymbol{A}t}[\boldsymbol{A}\boldsymbol{(W}_{c1}-\boldsymbol{W}_{c2})+\boldsymbol{(W}_{c1}-\boldsymbol{W}_{c2})\boldsymbol{A^{T}}]e^{\boldsymbol{A^{T}}t}=\frac{d}{dt}[e^{\boldsymbol{A}t}[(\boldsymbol{W}_{c1}-\boldsymbol{W}_{c2})e^{\boldsymbol{A^{T}}t}]=\boldsymbol{0} }[/math]

Integrating from [math]\displaystyle{ 0 }[/math] to [math]\displaystyle{ \infty }[/math]:

[math]\displaystyle{ [e^{\boldsymbol{A}t}[(\boldsymbol{W}_{c1}-\boldsymbol{W}_{c2})e^{\boldsymbol{A^{T}}t}]|_{t=0}^{\infty}=\boldsymbol{0} }[/math]

using the fact that [math]\displaystyle{ e^{\boldsymbol{A}t}\rightarrow0 }[/math] as [math]\displaystyle{ t\rightarrow\infty }[/math]:

[math]\displaystyle{ \boldsymbol{0}-(\boldsymbol{W}_{c1}-\boldsymbol{W}_{c2})=\boldsymbol{0} }[/math]

In other words, [math]\displaystyle{ \boldsymbol{W}_{c} }[/math] has to be unique.

Also, we can see that

[math]\displaystyle{ \boldsymbol{x^{T}W_{c}x}=\int_{0}^{\infty}\boldsymbol{x}^{T}e^{\boldsymbol{A}t}\boldsymbol{BB^{T}}e^{\boldsymbol{A}^{T}t}\boldsymbol{x}dt=\int_{0}^{\infty}\left\Vert \boldsymbol{B^{T}e^{\boldsymbol{A}^{T}t}\boldsymbol{x}}\right\Vert _{2}^{2}dt }[/math]

is positive for any t (assuming the non-degenerate case where [math]\displaystyle{ \left\Vert \boldsymbol{B^{T}e^{\boldsymbol{A}^{T}t}\boldsymbol{x}}\right\Vert }[/math] is not identically zero). This makes [math]\displaystyle{ \boldsymbol{W}_{c} }[/math] a positive definite matrix.

More properties of controllable systems can be found in,[1] as well as the proof for the other equivalent statements of “The pair [math]\displaystyle{ (\boldsymbol{A},\boldsymbol{B}) }[/math] is controllable” presented in section Controllability in LTI Systems.

Discrete Time Systems

For discrete time systems as

[math]\displaystyle{ \begin{array}{c} \boldsymbol{x}[k+1]\boldsymbol{=Ax}[k]+\boldsymbol{Bu}[k]\\ \boldsymbol{y}[k]=\boldsymbol{Cx}[k]+\boldsymbol{Du}[k] \end{array} }[/math]

One can check that there are equivalences for the statement “The pair [math]\displaystyle{ (\boldsymbol{A},\boldsymbol{B}) }[/math] is controllable” (the equivalences are much alike for the continuous time case).

We are interested in the equivalence that claims that, if “The pair [math]\displaystyle{ (\boldsymbol{A},\boldsymbol{B}) }[/math] is controllable” and all the eigenvalues of [math]\displaystyle{ \boldsymbol{A} }[/math] have magnitude less than [math]\displaystyle{ 1 }[/math] ([math]\displaystyle{ \boldsymbol{A} }[/math] is stable), then the unique solution of

[math]\displaystyle{ W_{dc}-\boldsymbol{A}\boldsymbol{W}_{dc}\boldsymbol{A^{T}}=\boldsymbol{BB^{T}} }[/math]

is positive definite and given by

[math]\displaystyle{ \boldsymbol{W}_{dc}=\sum_{m=0}^{\infty}\boldsymbol{A}^{m}\boldsymbol{BB}^{T}(\boldsymbol{A}^{T})^{m} }[/math]

That is called the discrete Controllability Gramian. We can easily see the correspondence between discrete time and the continuous time case, that is, if we can check that [math]\displaystyle{ \boldsymbol{W}_{dc} }[/math] is positive definite, and all eigenvalues of [math]\displaystyle{ \boldsymbol{A} }[/math] have magnitude less than [math]\displaystyle{ 1 }[/math], the system [math]\displaystyle{ (\boldsymbol{A},\boldsymbol{B}) }[/math] is controllable. More properties and proofs can be found in.[2]

Linear Time Variant Systems

Linear time variant (LTV) systems are those in the form:

[math]\displaystyle{ \begin{array}{c} \dot{\boldsymbol{x}}(t)\boldsymbol{=A}(t)\boldsymbol{x}(t)+\boldsymbol{B}(t)\boldsymbol{u}(t)\\ \boldsymbol{y}(t)=\boldsymbol{C}(t)\boldsymbol{x}(t) \end{array} }[/math]

That is, the matrices [math]\displaystyle{ \boldsymbol{A} }[/math], [math]\displaystyle{ \boldsymbol{B} }[/math] and [math]\displaystyle{ \boldsymbol{C} }[/math] have entries that varies with time. Again, as well as in the continuous time case and in the discrete time case, one may be interested in discovering if the system given by the pair [math]\displaystyle{ (\boldsymbol{A}(t),\boldsymbol{B}(t)) }[/math] is controllable or not. This can be done in a very similar way of the preceding cases.

The system [math]\displaystyle{ (\boldsymbol{A}(t),\boldsymbol{B}(t)) }[/math] is controllable at time [math]\displaystyle{ t_{0} }[/math] if and only if there exists a finite [math]\displaystyle{ t_{1}\gt t_{0} }[/math] such that the [math]\displaystyle{ n\times n }[/math] matrix, also called the Controllability Gramian, given by

[math]\displaystyle{ \boldsymbol{W}_{c}(t_{0},t_{1})=\int_{t_{0}}^{t_{1}}\boldsymbol{\Phi}(t_{1},\tau)\boldsymbol{B}(\tau)\boldsymbol{B}^{T}(\tau)\boldsymbol{\Phi}^{T}(t_{1},\tau)d\tau, }[/math]

where [math]\displaystyle{ \boldsymbol{\Phi}(t,\tau) }[/math] is the state transition matrix of [math]\displaystyle{ \boldsymbol{\dot{x}}=\boldsymbol{A}(t)\boldsymbol{x} }[/math], is nonsingular.

Again, we have a similar method to determine if a system is or is not a controllable system.

Properties of Wc(t0,t1)

We have that the Controllability Gramian [math]\displaystyle{ \boldsymbol{W}_{c}(t_{0},t_{1}) }[/math] have the following property:

[math]\displaystyle{ \boldsymbol{W}_c(t_0, t_1)=\boldsymbol{W}_c(t, t_1)+\boldsymbol{\Phi}(t_1,t)\boldsymbol{W}_c(t_0, t)\boldsymbol{\Phi}^T(t_1,t) }[/math]

that can easily be seen by the definition of [math]\displaystyle{ \boldsymbol{W}_{c}(t_{0},t_{1}) }[/math] and by the property of the state transition matrix that claims that:

[math]\displaystyle{ \boldsymbol{\Phi}(t_1,\tau)=\boldsymbol{\Phi}(t_{1},t)\boldsymbol{\Phi}(t,\tau) }[/math]

More about the Controllability Gramian can be found in.[3]

See also

References

  1. Chen, Chi-Tsong (1999). Linear System Theory and Design Third Edition. New York, New York: Oxford University Press. p. 145. ISBN 0-19-511777-8. https://archive.org/details/linearsystemtheo00chen. 
  2. Chen, Chi-Tsong (1999). Linear System Theory and Design Third Edition. New York, New York: Oxford University Press. p. 169. ISBN 0-19-511777-8. https://archive.org/details/linearsystemtheo00chen. 
  3. Chen, Chi-Tsong (1999). Linear System Theory and Design Third Edition. New York, New York: Oxford University Press. p. 176. ISBN 0-19-511777-8. https://archive.org/details/linearsystemtheo00chen. 

External links