HW 3

Processing math: 100%

[next] [prev] [prev-tail] [tail] [up]

4.3 HW 3

  4.3.1 Problem 1
  4.3.2 Problem 2
  4.3.3 Problem 3
  4.3.4 Problem 4
  4.3.5 Problem 5
  4.3.6 HW 3 key solution

: PDF (letter size)
: PDF (legal size)

4.3.1 Problem 1

4.3.1.1 Appendix

problem description

solution

Since $J\left ( \mathbf{u}\right )$ is a convex function $J:\Re ^{n}\rightarrow \Re$ , then by deﬁnition of convex functions we write $J\left ( \left ( 1-\lambda \right ) \mathbf{u}^{1}+\lambda \mathbf{u}^{2}\right ) \leq \left ( 1-\lambda \right ) J\left ( \mathbf{u}^{1}\right ) +\lambda J\left ( \mathbf{u}^{2}\right )$ Where $\lambda \in \left ( 0,1\right )$ . Rewriting the above as follows $\begin{align*} J\left ( \mathbf{u}^{1}-\lambda \mathbf{u}^{1}+\lambda \mathbf{u}^{2}\right ) & \leq J\left ( \mathbf{u}^{1}\right ) -\lambda J\left ( \mathbf{u}^{1}\right ) +\lambda J\left ( \mathbf{u}^{2}\right ) \\ J\left ( \mathbf{u}^{1}+\lambda \left ( \mathbf{u}^{2}-\mathbf{u}^{1}\right ) \right ) -J\left ( \mathbf{u}^{1}\right ) & \leq \lambda \left ( J\left ( \mathbf{u}^{2}\right ) -J\left ( \mathbf{u}^{1}\right ) \right ) \end{align*}$

Dividing both sides by $\lambda \neq 0$ gives $\frac{J\left ( \mathbf{u}^{1}+\lambda \left ( \mathbf{u}^{2}-\mathbf{u}^{1}\right ) \right ) -J\left ( \mathbf{u}^{1}\right ) }{\lambda }\leq J\left ( \mathbf{u}^{2}\right ) -J\left ( \mathbf{u}^{1}\right )$ Taking the limit $\lambda \rightarrow 0$ results in $\lim _{\lambda \rightarrow 0}\frac{J\left ( \mathbf{u}^{1}+\lambda \left ( \mathbf{u}^{2}-\mathbf{u}^{1}\right ) \right ) -J\left ( \mathbf{u}^{1}\right ) }{\lambda }\leq \lim _{\lambda \rightarrow 0}J\left ( \mathbf{u}^{2}\right ) -J\left ( \mathbf{u}^{1}\right )$ But $\lim _{\lambda \rightarrow 0}\frac{J\left ( \mathbf{u}^{1}+\lambda \left ( \mathbf{u}^{2}-\mathbf{u}^{1}\right ) \right ) -J\left ( \mathbf{u}^{1}\right ) }{\lambda }=\left . \frac{\partial J\left ( \mathbf{u}\right ) }{\partial \left ( \mathbf{u}^{2}-\mathbf{u}^{1}\right ) }\right \vert _{\mathbf{u}^{1}}=\left [ \nabla J\left ( \mathbf{u}^{1}\right ) \right ] ^{T}\left ( \mathbf{u}^{2}-\mathbf{u}^{1}\right )$ (appendix below shows how this came about). Therefore the above becomes $\begin{align*} \left [ \nabla J\left ( \mathbf{u}^{1}\right ) \right ] ^{T}\left ( \mathbf{u}^{2}-\mathbf{u}^{1}\right ) & \leq J\left ( \mathbf{u}^{2}\right ) -J\left ( \mathbf{u}^{1}\right ) \\ J\left ( \mathbf{u}^{2}\right ) & \geq J\left ( \mathbf{u}^{1}\right ) +\left [ \nabla J\left ( \mathbf{u}^{1}\right ) \right ] ^{T}\left ( \mathbf{u}^{2}-\mathbf{u}^{1}\right ) \end{align*}$

QED.

4.3.1.1 Appendix

More details are given here on why $\lim _{\lambda \rightarrow 0}\frac{J\left ( \mathbf{u}^{1}+\lambda \left ( \mathbf{u}^{2}-\mathbf{u}^{1}\right ) \right ) -J\left ( \mathbf{u}^{1}\right ) }{\lambda }=\left [ \nabla J\left ( \mathbf{u}^{1}\right ) \right ] ^{T}\left ( \mathbf{u}^{2}-\mathbf{u}^{1}\right )$ Let $\mathbf{u}^{2}-\mathbf{u}^{1}=\mathbf{d}$ . This is a directional vector, its tail starts at $\mathbf{u}^{1}$ going to tip of $\mathbf{u}^{2}$ point. Evaluating $\lim _{\lambda \rightarrow 0}\frac{J\left ( \mathbf{u}^{1}+\lambda \mathbf{d}\right ) -J\left ( \mathbf{u}^{1}\right ) }{\lambda }$ is the same as saying $\begin{align*} \left . \frac{\partial J\left ( \mathbf{u}\right ) }{\partial \mathbf{d}}\right \vert _{\mathbf{u}^{1}} & =\lim _{\lambda \rightarrow 0}\frac{J\left ( \mathbf{u}^{1}+\lambda \mathbf{d}\right ) -J\left ( \mathbf{u}^{1}\right ) }{\lambda }\\ & =\left . \frac{d}{d\lambda }J\left ( \mathbf{u}^{1}+\lambda \mathbf{d}\right ) \right \vert _{\lambda =0} \end{align*}$

Using the chain rule gives $\begin{align*} \left . \frac{d}{d\lambda }J\left ( \mathbf{u}^{1}+\lambda \mathbf{d}\right ) \right \vert _{\lambda =0} & =\left . \left [ \nabla J\left ( \mathbf{u}^{1}+\lambda \mathbf{d}\right ) \right ] ^{T}\frac{d}{d\lambda }\left ( \mathbf{u}^{1}+\lambda \mathbf{d}\right ) \right \vert _{\lambda =0}\\ & =\left . \left [ \nabla J\left ( \mathbf{u}^{1}+\lambda \mathbf{d}\right ) \right ] ^{T}\mathbf{d}\right \vert _{\lambda =0}\\ & =\left [ \nabla J\left ( \mathbf{u}^{1}\right ) \right ] ^{T}\mathbf{d} \end{align*}$

Replacing $\mathbf{u}^{2}-\mathbf{u}^{1}=\mathbf{d}$ , the above becomes $\begin{align*} \lim _{\lambda \rightarrow 0}\frac{J\left ( \mathbf{u}^{1}+\lambda \left ( \mathbf{u}^{2}-\mathbf{u}^{1}\right ) \right ) -J\left ( \mathbf{u}^{1}\right ) }{\lambda } & =\left . \frac{\partial J\left ( \mathbf{u}\right ) }{\partial \left ( \mathbf{u}^{2}-\mathbf{u}^{1}\right ) }\right \vert _{\mathbf{u}^{1}}\\ & =\left [ \nabla J\left ( \mathbf{u}^{1}\right ) \right ] ^{T}\left ( \mathbf{u}^{2}-\mathbf{u}^{1}\right ) \end{align*}$

Where $\nabla J\left ( \mathbf{u}^{1}\right )$ is the gradient vector of $J\left ( \mathbf{u}\right )$ evaluated at $\mathbf{u}=\mathbf{u}^{1}.$

4.3.2 Problem 2

problem description

solution

Since each $m_{ij}\left ( q\right )$ is convex function in $q$ , then $\begin{equation} m_{ij}\left ( \left ( 1-\alpha \right ) q^{1}+\alpha q^{2}\right ) \leq \left ( 1-\alpha \right ) m_{ij}\left ( q^{1}\right ) +\alpha m_{ij}\left ( q^{2}\right ) \tag{1} \end{equation}$ For $\alpha \in \left [ 0,1\right ] .$ We also know by Rayleigh quotient theorem which applies for symmetric matrices that largest eigenvalue of a symmetric matrix is given by $\lambda _{\max }=\max _{x\in \Re ^{n},\left \Vert x\right \Vert =1}x^{T}Mx$ Therefore, evaluated at point $q^{\alpha }=\left ( 1-\alpha \right ) q^{1}+\alpha q^{2}$ , the above become $\begin{equation} \lambda _{\max }\left ( \left ( 1-\alpha \right ) q^{1}+\alpha q^{2}\right ) =\max _{\left \Vert x\right \Vert =1}\sum _{i,j}^{n}m_{ij}\left ( \left ( 1-\alpha \right ) q^{1}+\alpha q^{2}\right ) x_{i}x_{j}\tag{2} \end{equation}$ Applying (1) in RHS (2) changes $=$ to $\leq$ giving $\begin{align} \lambda _{\max }\left ( \left ( 1-\alpha \right ) q^{1}+\alpha q^{2}\right ) & \leq \max _{\left \Vert x\right \Vert =1}\sum _{i,j}^{n}\left ( \left ( 1-\alpha \right ) m_{ij}\left ( q^{1}\right ) +\alpha m_{ij}\left ( q^{2}\right ) \right ) x_{i}x_{j}\nonumber \\ & =\max _{\left \Vert x\right \Vert =1}\left ( \sum _{i,j}^{n}\left ( 1-\alpha \right ) m_{ij}\left ( q^{1}\right ) x_{i}x_{j}+\sum _{i,j}^{n}\alpha m_{ij}\left ( q^{2}\right ) x_{i}x_{j}\right ) \nonumber \\ & =\left ( 1-\alpha \right ) \left ( \max _{\left \Vert x\right \Vert =1}\sum _{i,j}^{n}m_{ij}\left ( q^{1}\right ) x_{i}x_{j}\right ) +\alpha \left ( \max _{\left \Vert x\right \Vert =1}\sum _{i,j}^{n}m_{ij}\left ( q^{2}\right ) x_{i}x_{j}\right ) \tag{3} \end{align}$

Since $\max _{\left \Vert x\right \Vert =1}\sum _{i,j}^{n}m_{ij}\left ( q^{1}\right ) x_{i}x_{j}=\lambda _{\max }\left ( q^{1}\right )$ And $\max _{\left \Vert x\right \Vert =1}\sum _{i,j}^{n}m_{ij}\left ( q^{2}\right ) x_{i}x_{j}=\lambda _{\max }\left ( q^{2}\right )$ Then (3) becomes $\lambda _{\max }\left ( \left ( 1-\alpha \right ) q^{1}+\alpha q^{2}\right ) \leq \left ( 1-\alpha \right ) \lambda _{\max }\left ( q^{1}\right ) +\alpha \lambda _{\max }\left ( q^{2}\right )$ This is the deﬁnition of convex function, therefore $\lambda _{\max }$ is a convex function in $q$ .

Note: I tried also to reduce this to a problem where I could argue that the pointwise maximum of convex functions is also a convex function to solve it. I could not get a clear way to do this, so I solved it as above. I hope I did not violate the cardinal rule by using $\lambda _{\max }=\max _{x\in \Re ^{n},\left \Vert x\right \Vert =1}x^{T}Mx$ .

4.3.3 Problem 3

problem description

solution

pict

To show $U$ is bounded, a proof by induction is used. From the deﬁnition of constructing $U$ $U=\left \{ x\in \Re ^{n}:x=\sum _{i=1}^{m}\lambda _{i}u^{i}\right \}$ Where $\sum _{i=1}^{m}\lambda _{i}=1$ and $\lambda _{i}\geq 0$ .

For $m=1$ , $x=\lambda u^{1}$ . So $U$ contains just one element $u^{1}$ . Since $\lambda =1$ and $u^{1}$ is given and bounded, then this is closed and bounded set with one element. Hence compact. Now we assume $U$ is compact for $m=k-1$ and we need to show it is compact for $m=k$ . In other words, we assume that each $x^{\ast }\in U$ generated using $x^{\ast }=\sum _{i=1}^{k-1}\lambda _{i}u^{i}$ Is such that $\left \Vert x^{\ast }\right \Vert <\infty$ and $x^{\ast }\in U$ . Now we need to show that $U$ is bounded when generator contains $k$ elements. Now $\begin{align*} x & =\sum _{i=1}^{k}\lambda _{i}u^{i}\\ & =\lambda _{1}u^{1}+\lambda _{2}u^{2}+\cdots +\lambda _{k-1}u^{k-1}+\lambda _{k}u^{k} \end{align*}$

Multiply and divide by $\left ( 1-\lambda _{k}\right )$ $\begin{align*} x & =\left ( 1-\lambda _{k}\right ) \left ( \frac{\lambda _{1}u^{1}}{\left ( 1-\lambda _{k}\right ) }+\frac{\lambda _{2}}{\left ( 1-\lambda _{k}\right ) }u^{2}+\cdots +\frac{\lambda _{k-1}u^{k-1}}{\left ( 1-\lambda _{k}\right ) }+\frac{\lambda _{k}}{\left ( 1-\lambda _{k}\right ) }u^{k}\right ) \\ & =\left ( 1-\lambda _{k}\right ) \left ( \sum _{i=1}^{k-1}\frac{\lambda _{i}}{\left ( 1-\lambda _{k}\right ) }u^{i}+\frac{\lambda _{k}}{\left ( 1-\lambda _{k}\right ) }u^{k}\right ) \\ & =\left ( 1-\lambda _{k}\right ) \left ( \sum _{i=1}^{k-1}\frac{\lambda _{i}}{\left ( 1-\lambda _{k}\right ) }u^{i}\right ) +\lambda _{k}u^{k} \end{align*}$

But $\sum _{i=1}^{k-1}\frac{\lambda _{i}}{\left ( 1-\lambda _{k}\right ) }u^{i}=x^{\ast }$ which we assumed in $U$ . Hence the above becomes $x=\left ( 1-\lambda _{k}\right ) x^{\ast }+\lambda _{k}u^{k}$ Since $u^{k}$ is element in the generator set $G$ and it is in $U$ by deﬁnition, then the above is convex combination of two elements in $U$ . Hence $x$ in also in $U$ (it is on a line between $x^{\ast }$ and $u^{k}$ , both in $U)$ . Therefore $U$ is closed and bounded for any $m$ in the generator set. Hence $U$ is compact.

4.3.4 Problem 4

problem description

solution

pict

The extreme points of $P$ are subset of $G$ . They are the points used to generate $P$ . The set $P$ is compact (by problem 3) and convex set (by construction, since it is convex combinations of its extreme points). If we can show that $J^{\ast }$ is at an extreme point of $P$ , then we are done, since an extreme point of $P$ is in $G$ .

Let $u^{\ast }\in P$ be the point where $J\left ( u\right )$ is maximum. $u^{\ast }$ is a convex combinations of all extreme points of $P$ , (these are also subset from $G$ but they can be the whole set $G$ also if there were no redundant generators), Therefore $u^{\ast }={\displaystyle \sum \limits _{i=1}^{k}} \lambda _{i}v^{i}$ where $k\leq N$ and $v^{i}\in G$ . If it happens that all points in $G$ are extreme points of $P$ , then $k=N$ . Therefore $J^{\ast }=J\left ( u^{\ast }\right ) =J\left ({\displaystyle \sum \limits _{i=1}^{k}} \lambda _{i}v^{i}\right )$ Where $\sum _{i=1}^{k}\lambda _{i}=1$ and $\lambda _{i}\geq 0$ . But $J$ is convex function (given). Hence by deﬁnition of convex function $\begin{equation} J^{\ast }=J\left ({\displaystyle \sum \limits _{i=1}^{k}} \lambda _{i}v^{i}\right ) \leq{\displaystyle \sum \limits _{i=1}^{k}} \lambda _{i}J\left ( v^{i}\right ) \tag{1} \end{equation}$ The above is generalization of $J\left ( \left ( 1-\lambda \right ) u^{1}+\lambda u^{2}\right ) \leq \left ( 1-\lambda \right ) J\left ( u^{1}\right ) +\lambda J\left ( u^{2}\right )$ applied to convex mixtures. Now we look at $J\left ( v^{i}\right )$ term in the above. We pick the maximum of $J$ over all $v^{i}$ . There must be a point in $G$ where $J\left ( v\right )$ is largest. We call this value $J_{G}^{\ast }$ . This is the value of $J$ where it attains its maximum over generator elements $v^{i}:i=1\cdots k$ . Eq (1) becomes $J^{\ast }\leq{\displaystyle \sum \limits _{i=1}^{k}} \lambda _{i}J_{G}^{\ast }$ Where we replaced $J\left ( v^{i}\right )$ by one value, the maximum $J_{G}^{\ast }.$ But $J_{G}^{\ast }$ does not depend on $i$ now, and can take it outside the sum $J^{\ast }\leq J_{G}^{\ast }\left ({\displaystyle \sum \limits _{i=1}^{k}} \lambda _{i}\right )$ But ${\displaystyle \sum \limits _{i=1}^{k}} \lambda _{i}=1$ by deﬁnition. Therefore the above becomes $J^{\ast }\leq J_{G}^{\ast }$ We now see that the maximum of $J\left ( u\right )$ over $P$ is smaller (or equal) than the maximum of $J\left ( u\right )$ over the generator set $G$ . Hence a maximum occurs at one of the extreme points $v^{i}$ , since these are by deﬁnition taken from $G$ . which is what we are asked to show.

4.3.5 Problem 5

   4.3.5.1 Part (a)
   4.3.5.2 Part(b)
   4.3.5.3 Part(c)

problem description

solution

4.3.5.1 Part (a)

Let us look at the closed loop. Let $v=0$ and we have, since $u\left ( t\right ) =kx\left ( t\right )$ $\begin{align*} \dot{x} & =Ax+Bkx\\ & =\left ( A+Bk\right ) x\\ & =A_{c}x \end{align*}$

Where $A_{c}$ is the closed loop system matrix. Since $J\left ( k\right ) =\int _{0}^{\infty }x^{T}\left ( t\right ) x\left ( t\right ) +\lambda u^{T}\left ( t\right ) u\left ( t\right ) dt$ , where $u\left ( t\right ) =kx\left ( t\right )$ , then $\begin{align*} J\left ( k\right ) & =\int _{0}^{\infty }x^{T}x+\lambda \left ( kx\right ) ^{T}\left ( kx\right ) dt\\ & =\int _{0}^{\infty }x^{T}x+\lambda x^{T}\left ( k^{T}k\right ) xdt \end{align*}$

Let us ﬁnd a matrix $P$ , if possible such that $d\left ( x^{T}Px\right ) =-\left ( x^{T}x+\lambda x^{T}\left ( k^{T}k\right ) x\right )$ Can we ﬁnd $P$ ? Since $d\left ( x^{T}Px\right ) =x^{T}P\dot{x}+\dot{x}^{T}Px$ Then we need to solve $\begin{align*} x^{T}P\dot{x}+\dot{x}^{T}Px & =-\left ( x^{T}x+\lambda x^{T}\left ( k^{T}k\right ) x\right ) \\ x^{T}P\left ( A_{c}x\right ) +\left ( A_{c}x\right ) ^{T}Px & =-\left ( x^{T}x+\lambda x^{T}\left ( k^{T}k\right ) x\right ) \\ x^{T}P\left ( A_{c}x\right ) +\left ( x^{T}A_{c}^{T}\right ) Px & =-\left ( x^{T}x+\lambda x^{T}\left ( k^{T}k\right ) x\right ) \end{align*}$

Bring all the $x$ to LHS then $\begin{align*} x^{T}x+\lambda x^{T}\left ( k^{T}k\right ) x+x^{T}P\left ( A_{c}x\right ) +\left ( x^{T}A_{c}^{T}\right ) Px & =\mathbf{0}\\ \lambda \left ( k^{T}k\right ) +PA_{c}+A_{c}^{T}P & =-I \end{align*}$

Hence the Lyapunov equation to solve for $P$ is $\fbox{$\lambda \left ( k^Tk\right ) +PA_c+A_c^TP=-I$}$ Where $I$ is the identity matrix. This is the equation to determine matrix $P$ . Without loss of generality, we insist on $P$ being symmetric matrix. Using this $P$ , now we write $\begin{align*} J\left ( k\right ) & =\int _{0}^{\infty }x^{T}x+\lambda \left ( kx\right ) ^{T}\left ( kx\right ) dt\\ & =-\int _{0}^{\infty }d\left ( x^{T}Px\right ) \\ & =\left . x^{T}Px\right \vert _{\infty }^{0}\\ & =x^{T}\left ( 0\right ) Px\left ( 0\right ) -x^{T}\left ( \infty \right ) Px\left ( \infty \right ) \end{align*}$

For stable system, $x\left ( \infty \right ) \rightarrow 0$ (since we set $v=0$ , there is no external input, hence if the system is stable, it must end up in zero state eventually). In part (b) we check for $k$ range so that the roots are in the left hand side. Therefore $J\left ( k\right ) =x^{T}\left ( 0\right ) P\left ( k\right ) x\left ( 0\right )$ With $P\left ( k\right )$ satisfying solution of Lyapunov equation found above.

4.3.5.2 Part(b)

For $k=\begin{bmatrix} k_{1} & k_{2}\end{bmatrix} ,x\left ( 0\right ) =\begin{bmatrix} 1\\ 0 \end{bmatrix}$ and system $y^{\prime \prime }=u$ . Hence $x_{1}^{\prime }=x_{2},x_{2}^{\prime }=u$ . Since $u=\begin{bmatrix} k_{1} & k_{2}\end{bmatrix}\begin{bmatrix} x_{1}\\ x_{2}\end{bmatrix}$ The system $\dot{x}=Ax+Bu$ becomes $\begin{align*} x^{\prime } & =Ax+Bu\\ & =Ax+Bkx\\ & =\left ( A+Bk\right ) x\\\begin{bmatrix} x_{1}^{\prime }\\ x_{2}^{\prime }\end{bmatrix} & =\left ( \overset{A}{\overbrace{\begin{bmatrix} 0 & 1\\ 0 & 0 \end{bmatrix} }}+\overset{B}{\overbrace{\begin{bmatrix} 0\\ 1 \end{bmatrix} }}\overset{k}{\overbrace{\begin{bmatrix} k_{1} & k_{2}\end{bmatrix} }}\right ) \begin{bmatrix} x_{1}\\ x_{2}\end{bmatrix} \\ & =\left ( \begin{bmatrix} 0 & 1\\ 0 & 0 \end{bmatrix} +\begin{bmatrix} 0 & 0\\ k_{1} & k_{2}\end{bmatrix} \right ) \begin{bmatrix} x_{1}\\ x_{2}\end{bmatrix} \\ & =\overset{A_{c}}{\overbrace{\begin{bmatrix} 0 & 1\\ k_{1} & k_{2}\end{bmatrix} }}\begin{bmatrix} x_{1}\\ x_{2}\end{bmatrix} \end{align*}$

For stable system, we need $k_{1},k_{2}<0$ from looking at the roots of the characteristic equation. Now we solve the Lyapunov equation. $\begin{align*} \lambda \left ( k^{T}k\right ) +PA_{c}+A_{c}^{T}P & =-I\\ \lambda \begin{bmatrix} k_{1} & k_{2}\end{bmatrix} ^{T}\begin{bmatrix} k_{1} & k_{2}\end{bmatrix} +\begin{bmatrix} p_{11} & p_{12}\\ p_{21} & p_{22}\end{bmatrix}\begin{bmatrix} 0 & 1\\ k_{1} & k_{2}\end{bmatrix} +\begin{bmatrix} 0 & 1\\ k_{1} & k_{2}\end{bmatrix} ^{T}\begin{bmatrix} p_{11} & p_{12}\\ p_{21} & p_{22}\end{bmatrix} & =\begin{bmatrix} -1 & 0\\ 0 & -1 \end{bmatrix} \\ \lambda \begin{bmatrix} k_{1}\\ k_{2}\end{bmatrix}\begin{bmatrix} k_{1} & k_{2}\end{bmatrix} +\begin{bmatrix} p_{11} & p_{12}\\ p_{21} & p_{22}\end{bmatrix}\begin{bmatrix} 0 & 1\\ k_{1} & k_{2}\end{bmatrix} +\begin{bmatrix} 0 & k_{1}\\ 1 & k_{2}\end{bmatrix}\begin{bmatrix} p_{11} & p_{12}\\ p_{21} & p_{22}\end{bmatrix} & =\begin{bmatrix} -1 & 0\\ 0 & -1 \end{bmatrix} \\ \lambda \begin{bmatrix} k_{1}^{2} & k_{1}k_{2}\\ k_{1}k_{2} & k_{2}^{2}\end{bmatrix} +\begin{bmatrix} k_{1}p_{12} & p_{11}+k_{2}p_{12}\\ k_{1}p_{22} & p_{21}+k_{2}p_{22}\end{bmatrix} +\begin{bmatrix} k_{1}p_{21} & k_{1}p_{22}\\ p_{11}+k_{2}p_{21} & p_{12}+k_{2}p_{22}\end{bmatrix} & =\begin{bmatrix} -1 & 0\\ 0 & -1 \end{bmatrix} \\\begin{bmatrix} k_{1}\left ( p_{12}+p_{21}+\lambda k_{1}\right ) & p_{11}+k_{1}p_{22}+k_{2}p_{12}+\lambda k_{1}k_{2}\\ p_{11}+k_{1}p_{22}+k_{2}p_{21}+\lambda k_{1}k_{2} & \lambda k_{2}^{2}+2p_{22}k_{2}+p_{12}+p_{21}\end{bmatrix} & =\begin{bmatrix} -1 & 0\\ 0 & -1 \end{bmatrix} \end{align*}$

Hence we have 4 equations to solve for $p_{11,}p_{12},p_{21,}p_{22}$ . (but we know also that $p_{12}=p_{21}$ ). Now let $\lambda =1$ per the problem, and we obtain the four equations from above as $\begin{align*} k_{1}^{2}+k_{1}p_{12}+k_{1}p_{21} & =-1\\ p_{11}+k_{1}k_{2}+k_{1}p_{22}+k_{2}p_{12} & =0\\ p_{11}+k_{1}k_{2}+k_{1}p_{22}+k_{2}p_{21} & =0\\ k_{2}^{2}+2p_{22}k_{2}+p_{12}+p_{21} & =-1 \end{align*}$

Solution is (Using Matlab syms).

Gives

P =
[ -(k1^3 - k1^2 + k1 - k2^2)/(2*k1*k2), -(k1^2 + 1)/(2*k1)]
[ -(k1^2 + 1)/(2*k1), -(- k1^2 + k1*k2^2 + k1 - 1)/(2*k1*k2)]
J1 =
-(k1^3 - k1^2 + k1 - k2^2)/(2*k1*k2)

$P=\begin{bmatrix} -\frac{k_{1}-k_{1}^{2}+k_{1}^{3}-k_{2}^{2}}{2k_{1}k_{2}} & -\frac{k_{1}^{2}+1}{2k_{1}}\\ -\frac{k_{1}^{2}+1}{2k_{1}} & -\frac{k_{1}+k_{1}k_{2}^{2}-k_{1}^{2}-1}{2k_{1}k_{2}}\end{bmatrix}$ Hence $\begin{align*} J\left ( k\right ) & =x^{T}\left ( 0\right ) P\left ( k\right ) x\left ( 0\right ) \\ & =\begin{bmatrix} 1 & 0 \end{bmatrix}\begin{bmatrix} -\frac{k_{1}-k_{1}^{2}+k_{1}^{3}-k_{2}^{2}}{2k_{1}k_{2}} & -\frac{k_{1}^{2}+1}{2k_{1}}\\ -\frac{k_{1}^{2}+1}{2k_{1}} & -\frac{k_{1}+k_{1}k_{2}^{2}-k_{1}^{2}-1}{2k_{1}k_{2}}\end{bmatrix}\begin{bmatrix} 1\\ 0 \end{bmatrix} \end{align*}$

Therefore $\fbox{$J\left ( k\right ) =-\frac{1}{2k_1k_2}\left ( k_1^3-k_1^2+k_1-k_2^2\right ) $ }$ For $k_{1}=k_{2}=k$ , the above becomes $\begin{align*} J\left ( k\right ) & =-\frac{\left ( k^{3}-2k^{2}+k\right ) }{2k^{2}}\\ & =-\frac{\left ( k^{2}-2k+1\right ) }{2k} \end{align*}$

Or $\fbox{$J\left ( k\right ) =-\frac{1}{2k}\left ( k-1\right ) ^2$}$ Now we ﬁnd the optimal $J^{\ast }$ . Since $\frac{dJ\left ( k\right ) }{dk}=\frac{\left ( k-1\right ) ^{2}}{2k^{2}}-\frac{\left ( 2k-2\right ) }{2k}$ Then $\frac{dJ\left ( k\right ) }{dk}=0$ gives $k=1,-1$ Since $k$ must be negative for stable system, we pick $\fbox{$k^\ast =-1$}$ And $\frac{d^{2}J\left ( k\right ) }{dk^{2}}=\frac{\left ( k-1\right ) ^{2}}{k^{3}}-\frac{2\left ( 1-k\right ) }{k^{2}}-\frac{1}{k}$ At $k^{\ast }=-1\,$ $\frac{d^{2}J\left ( k\right ) }{dk^{2}}=1>0$ Hence this is a minimum. Therefore $J^{\ast }=\left . -\frac{1}{2k}\left ( k-1\right ) ^{2}\right \vert _{k=-1}$ Hence $\fbox{$J^\ast =2$}$ $J^{\ast }$ do not get to zero. (same as in the class problem we did without $\lambda u^{T}u$ term. I thought we will get $J^{\ast }=0$ now since this I thought it was the reason for using $\lambda u^{T}u$ . I hope I did not make mistake, but do not see where if I did. Below is a plot of $J\left ( k\right )$ .

pict

At $k=1$ then $J\left ( 1\right ) =0$ , but we can not use $k=1$ since this will make the system not stable. The system now using $k^{\ast }=-1$ becomes $\begin{align*} \begin{bmatrix} x_{1}^{\prime }\\ x_{2}^{\prime }\end{bmatrix} & =\begin{bmatrix} 0 & 1\\ k_{1} & k_{2}\end{bmatrix}\begin{bmatrix} x_{1}\\ x_{2}\end{bmatrix} \\ & =\begin{bmatrix} 0 & 1\\ -1 & -1 \end{bmatrix}\begin{bmatrix} x_{1}\\ x_{2}\end{bmatrix} \end{align*}$

To verify it is stable. Since $\left \vert \left ( \lambda I-A_{c}\right ) \right \vert =\lambda ^{2}+\lambda +1$ The roots are $-\frac{1}{2}\pm \frac{1}{2}i\sqrt{3}$ Hence the system is stable since real part of roots are negative. If we had used $k=1$ , the roots will be $-0.618,1.618$ , and the system would have been unstable.

4.3.5.3 Part(c)

From last part, we obtained $P$ $P=\begin{bmatrix} -\frac{k_{1}-k_{1}^{2}+k_{1}^{3}-k_{2}^{2}}{2k_{1}k_{2}} & -\frac{k_{1}^{2}+1}{2k_{1}}\\ -\frac{k_{1}^{2}+1}{2k_{1}} & -\frac{k_{1}+k_{1}k_{2}^{2}-k_{1}^{2}-1}{2k_{1}k_{2}}\end{bmatrix}$ When $k_{1}=k_{2}=k$ the above becomes $P=\begin{bmatrix} \frac{-k+2k^{2}-k^{3}}{2k^{2}} & -\frac{k^{2}+1}{2k}\\ -\frac{k^{2}+1}{2k} & \frac{1-k-k^{3}+k^{2}}{2k^{2}}\end{bmatrix}$ Now since $x\left ( 0\right )$ is random variable, then $\begin{align*} J\left ( k\right ) & =E\left ( x^{T}\left ( 0\right ) Px\left ( 0\right ) \right ) \nonumber \\ & =E\left ( \begin{bmatrix} x_{1}\left ( 0\right ) & x_{2}\left ( 0\right ) \end{bmatrix}\begin{bmatrix} \frac{-k+2k^{2}-k^{3}}{2k^{2}} & -\frac{k^{2}+1}{2k}\\ -\frac{k^{2}+1}{2k} & \frac{1-k-k^{3}+k^{2}}{2k^{2}}\end{bmatrix}\begin{bmatrix} x_{1}\left ( 0\right ) \\ x_{2}\left ( 0\right ) \end{bmatrix} \right ) \nonumber \\ & =E\left ( -\frac{1}{2k^{2}}\left ( k^{3}x_{1}^{2}\left ( 0\right ) +2k^{3}x_{1}\left ( 0\right ) x_{2}\left ( 0\right ) +k^{3}x_{2}^{2}\left ( 0\right ) -2k^{2}x_{1}^{2}\left ( 0\right ) -k^{2}x_{2}^{2}\left ( 0\right ) +kx_{1}^{2}\left ( 0\right ) +2kx_{1}\left ( 0\right ) x_{2}\left ( 0\right ) +kx_{2}^{2}\left ( 0\right ) -x_{2}^{2}\left ( 0\right ) \right ) \right ) \tag{1} \end{align*}$

Let $E\left ( x_{1}\left ( 0\right ) \right ) =\bar{x}_{1}$ and $E\left ( x_{2}\left ( 0\right ) \right ) =\bar{x}_{2}$ Then $J\left ( k\right ) =-\frac{1}{2k^{2}}\left ( k^{3}\bar{x}_{1}^{2}+2k^{3}\bar{x}_{1}\bar{x}_{2}+k^{3}\bar{x}_{2}^{2}-2k^{2}\bar{x}_{1}^{2}-k^{2}\bar{x}_{2}^{2}+k\bar{x}_{1}^{2}+2k\bar{x}_{1}\bar{x}_{2}+k\bar{x}_{2}^{2}-\bar{x}_{2}^{2}\right )$ But $E\left ( x_{1}\left ( 0\right ) \right ) =0$ , hence $\bar{x}_{1}=0$ and similarly $\bar{x}_{2}=0,$ but $\bar{x}_{1}^{2}=\frac{1}{3}$ since $\int _{-1}^{1}x^{2}p\left ( x\right ) dx=\frac{1}{2}\int _{-1}^{1}x^{2}dx=\frac{1}{2}\left ( \frac{x^{3}}{3}\right ) _{-1}^{1}=\frac{1}{3}$ Similarly $\bar{x}_{2}^{2}=\frac{1}{3}$ and $\bar{x}_{1}\bar{x}_{2}=0$ (since i.i.d, then $E\left ( x_{1}\left ( 0\right ) x_{2}\left ( 0\right ) \right ) =E\left ( x_{1}\left ( 0\right ) \right ) E\left ( x_{2}\left ( 0\right ) \right ) =0$ . Using these values of expectations, Eq (1) becomes $J\left ( k\right ) =-\frac{1}{2k^{2}}\left ( k^{3}\frac{1}{3}+k^{3}\frac{1}{3}-2k^{2}\frac{1}{3}-k^{2}\frac{1}{3}+k\frac{1}{3}+k\frac{1}{3}-\frac{1}{3}\right )$ Or $\begin{equation} \fbox{$J\left ( k\right ) =\frac{-2k^3+3k^2-2k+1}{6k^2}$}\tag{2} \end{equation}$ To ﬁnd the optimal: $\frac{dJ\left ( k\right ) }{dk}=-\frac{1}{3}-\frac{1}{3k^{3}}+\frac{1}{3k^{2}}$ $\frac{dJ\left ( k\right ) }{dk}=0$ gives 3 roots. The only one which is real and negative (the other two are complex) is $\fbox{$k^\ast =-1.325$}$ At this $k^{\ast }$ , we check $\frac{d^{2}J\left ( k\right ) }{dk^{2}}$ and ﬁnd it is $0.611>0$ , hence $J$ is minimum at $k^{\ast }$ . The value $J^{\ast }$ at $k^{\ast }$ is found to be from substituting $k^{\ast }$ in (2) $\fbox{$J^\ast =1.28817$}$

pict

We now check if the system is stable. (it should be, since $k^{\ast }<1$ ). The system now is $\begin{align*} \begin{bmatrix} x_{1}^{\prime }\\ x_{2}^{\prime }\end{bmatrix} & =\begin{bmatrix} 0 & 1\\ k_{1} & k_{2}\end{bmatrix}\begin{bmatrix} x_{1}\\ x_{2}\end{bmatrix} \\ & =\begin{bmatrix} 0 & 1\\ -1.325 & -1.325 \end{bmatrix}\begin{bmatrix} x_{1}\\ x_{2}\end{bmatrix} \end{align*}$

Hence $\left \vert \left ( \lambda I-A_{c}\right ) \right \vert =\lambda ^{2}+1.325\lambda +1.325$ The roots are $-0.6625\pm i0.941$ The system is stable since real part of roots are negative. The following is the step response for system in part(b) and part(c) to compare.

pict

4.3.6 HW 3 key solution

[next] [prev] [prev-tail] [front] [up]