Optimal Control Formulation using Calculus of Variations

(1)

Lecture – 25

Optimal Control Formulation using

Calculus of Variations

Dr. Radhakant Padhi

Asst. Professor

Dept. of Aerospace Engineering

(2)

Topics

Optimal Control Formulation

• Objective & Selection of Performance Index

• Necessary Conditions of Optimality and

Two-Point Boundary Value Problem (TPBVP)

Formulation

• Boundary/Transversality Conditions

(3)

Optimal Control Formulation:

Objective & Selection of Performance Index

Dr. Radhakant Padhi

Asst. Professor

Dept. of Aerospace Engineering

(4)

Objective

( )

(

)

0

To find an "admissible" time history of control variable

,

which:

1) Causes the system giverned by

,

to follow an admissible trajectory

2) Optimizes (minimizes/maximizes)

f

U t

t

t t

X

f t X U

⎡

⎤

∈ ⎣

⎦

=

(

)

(

)

( )

0

0 0

a "meanigful" performance index

,

3) Forces the system to satisfy "proper boundary conditions".

[ our focus:

(given),

: fixed ]

f t f f

t

f

J

t

X

L t X U dt

X t

X

t

ϕ

=

+

=

(5)

Meaningful Performance Index

(

)

[

]

0

1) Minimize the operational time

1 0,

1 2) Minimize the control effort

1

1 ,

0 0,

2

2 3) Minimize deviation of state from a fixed value

with minimum co

f

t

f

t

T T

t

J

t

dt

L

J

U RU dt

R

L

U RU

C

ϕ

=

−

=

⎡

⎤

=

>

_⎢

=

_⎥

⎣

⎦

∫

(

) (

)

ntrol effort

1 ,

0,

0

2

f

t

T _T

(6)

Meaningful Performance Index

0

4) Minimize deviation of state from origin

with minimum control effort

1 ,

0,

0

2 5) Minimize the control effort, while the final state

reaches close to a constant

f

t

T T

t

f

J

X QX

U RU dt

Q

R

X

C

⎡

⎤

=

∫

_⎣

+

_⎦

≥

>

(

) (

)

(

)

0

1

1 ,

0,

0

2

f

t

T _T

f f f f

t

(7)

Optimal Control Using Calculus of

Variations:

Hamiltonian Formulation

Dr. Radhakant Padhi

Asst. Professor

Dept. of Aerospace Engineering

(8)

Optimal Control Problem

z

Performance Index (to minimize / maximize):

z

Path Constraint:

z

Boundary Conditions:

(

)

(

)

0

,

f

t

f

t

J

=

ϕ

t

X

+

∫

L t X U dt

(

,

)

X

=

f t X U

( )

0

0 : Specified

: Fixed,

: Free

f

X

t

X t

(9)

Necessary Conditions of

Optimality

z

Augmented PI

z

Hamiltonian

z

First Variation

(

)

0

f

t

T

t

J

= +

ϕ

∫

⎡

_⎣

L

+

λ

f

−

X

⎤

_⎦

dt

(

T

)

H

L

+

λ

f

(

)

(

)

0

0 f

f

t

T

t

T

t

J

H

X dt

H

X dt

δ

δϕ δ

λ

δϕ

δ

λ

=

+

−

=

+

−

∫

(10)

Necessary Conditions of

Optimality

z

First Variation

z

Individual terms

(

)

0 f

t

T

t

J

H

X

X dt

δ

=

δϕ

+

∫

δ

−

δλ

−

λ δ

(

_f

,

_f

) (

_f

)

T

f

t

X

ϕ

δϕ

=

δ

⎛

_⎜

∂

⎞

_⎟

∂

⎝

⎠

(

,

, ,

) ( )

T

H

( )

T

H

( )

T

H

H t X U

X

U

X

U

δ

λ

δ

δλ

λ

∂

⎛

⎞

⎛

⎞

⎛

⎞

=

_⎜

_⎟

+

_⎜

_⎟

+

_⎜

_⎟

∂

(11)

Necessary Conditions of

Optimality

(

)

( )

0 0 0 0 0 0 , , 0 0 f f f f f f f t t T T t t t T t X T t X t t T

T T T

f f t t T T T f f

d

X

X dt

dt

d

X

X dt

dt

X

dt

X

dt

δ

λ δ

λ

λ δ

δ

λ δ

δ

λ

λ δ

δ

λ

⎛

⎞

= ⎜

⎟

⎝

⎠

⎛

⎞

⎡

⎤

=

_⎣

_⎦

_{− ⎜}

_⎟

⎝

⎠

⎡

⎤

=

_⎣

−

_⎦

−

=

−

∫

0

(12)

Necessary Conditions of

Optimality

z

First Variation

(

)

(

)

( )

0

0 0

f

f f

T

f

t

T

t

T

t

J

X

H

X

U

dt

X

U

X

dt

X dt

ϕ

δ

λ

δ

δλ

λ

δ

λ

δλ

⎛

_∂

⎞

=

_⎜

_⎟

−

∂

⎝

⎠

⎡

_⎛

∂

_⎞

_⎛

∂

_⎞

_⎛

∂

_⎞

⎤

+

_⎢

_⎜

_⎟

+

_⎜

_⎟

+

_⎜

_⎟

_⎥

∂

⎝

⎠

⎝

⎠

⎝

⎠

⎣

⎦

+

−

∫

(13)

Necessary Conditions of

Optimality

z

First Variation

(

)

( )

0 0 0 f f f T f f f t t T T t t t T t

J

X

H

X

dt

U

dt

X

U

H

X dt

ϕ

δ

λ

δ

λ

δ

δλ

λ

⎡

_∂

⎤

=

_⎢

−

_⎥

∂

⎢

⎥

⎣

⎦

∂

⎡

⎤

⎡

⎤

+

_⎢

+

_⎥

+

_⎢

_⎥

∂

⎣

⎦

⎣

⎦

∂

⎡

⎤

+

_⎢

−

_⎥

∂

⎣

⎦

∫

= 0

(14)

Necessary Conditions of

Optimality: Summary

z

State Equation

z

Costate Equation

z

Optimal Control

Equation

z

Boundary Condition

(

,

)

H

X

f t X U

λ

∂

=

∂

H

X

λ

_{= −⎜}

⎛

∂

⎞

_⎟

∂

⎝

⎠

0 H

U

∂

₌

∂

f

X

ϕ

λ

=

∂

(15)

Necessary Conditions of

Optimality: Some Comments

z

State and Costate equations are dynamic equations

z

Optimal control equation is a stationary equation

z

Boundary conditions are split: it leads to

Two-Point-Boundary-Value Problem (TPBVP)

z

State equation develops forward whereas Costate

equation develops backwards

z

Traditionally, TPBVPs demand

computationally-intensive iterative numerical procedures

z

These iterative numerical procedures lead to

(16)

An Useful Theorem

Theorem:

If the Hamiltonian is not an explicit function of

time, then is ‘constant’ along the optimal path.

Proof:

H

and

T T T

T T T T

dH

H

X

U

dt

t

X

U

H

X

U

X

t

X

U

λ

∂

=

+

∂

⎛

∂

⎞

⎛

∂

⎞

⎛

∂

⎞

=

+

_⎜

+

_⎟

+

_⎜

_⎟

_⎜

=

_⎟

∂

_⎝

∂

_⎠

_⎝

∂

_⎠

_⎝

∵

∂

_⎠

(

)

(

)

on optimal path

0 if is not an explicit function of . Hence, the result!

dH

H

dt

t

H

t

∂

=

∂

=

(17)

General

Boundary/Transversality Condition

General condition:

Special Cases:

0

f f T f f t t

X

H

t

X

λ δ

t

δ

∂Φ

⎡

₋

⎤

₊

⎡

₊

⎤

₌

⎢

_∂

⎥

⎢

_∂

⎥

⎣

⎦

⎣

⎦

(

)

( )

1)

: fixed,

: free

,

0 2)

: free,

: fixed

0

f f f f T f f f f t f f f f f t f

t

X

t

X

t

X

H

t

H t

t

λ δ

λ

δ

∂Φ

⎡

₋

⎤

₌

_⇒

₌

⎢

_∂

⎥

_∂

⎣

⎦

∂Φ

⎡

₊

⎤

₌

_⇒

₌

⎢

_∂

⎥

_∂

⎣

⎦

(

0 0

)

with

t X

,

fixed

⎡

⎤

(18)

Example – 1: A Toy Problem

Dr. Radhakant Padhi

Asst. Professor

Dept. of Aerospace Engineering

(19)

Example

Problem:

Solution:

Costate Eq.

Optimal control Eq.

(

) (

)

( )

0 1 2 2 2 2 2 2 1 2

0 1 2

1

5

2

2 0,

2,

0

f f f t t f

x

u

J

x

u dt

t

x

⎡ ⎤ ⎡

⎤

=

⎢ ⎥ ⎢

_{− +}

⎥

⎣ ⎦ ⎣

⎦

=

−

+

−

+

=

∫

(

2

)

(

)

1 2 2 2

/ 2

H

=

u

+

λ

x

+

λ

− +

x

u

(

)

(

1

)

1

2 1 2

2

/

0 /

H

x

H

x

λ

λ λ

λ

− ∂

∂

⎡

⎤

⎡ ⎤

⎡

⎤

=

_⎢

_⎥

=

⎢ ⎥

_{− ∂}

_∂

⎢

_{− +}

⎥

⎣

⎦

⎣ ⎦ ⎣

⎦

2

0

2

(20)

Example

Boundary Conditions

Define

Solution

( )

1 1 1

2 2 2

0

2

5 ,

0

2

2 x

x

λ

−

⎡

⎤

_{⎡ ⎤}

⎡

⎤ ⎡

⎤

=

⎢

⎥

_{⎢ ⎥}

⎢

⎥ ⎢

₋

⎥

⎣ ⎦

⎣

⎦

⎣

⎦ ⎣

⎦

[

1

2

1

2 ]

T

Z

x

λ λ

Z

=

A Z

0

1

0

1

0

1

0

1

1 A

⎡

⎤

⎢

₋

⎥

⎢

⎥

=

⎢

⎥

⎢

₋

⎥

⎣

⎦

( )

At

(21)

Example

Use the boundary condition at

0 t

=

1

2

0

0 c

c

⎡ ⎤

= ⎢ ⎥

⎢ ⎥

⎣ ⎦ ⎣ ⎦

2

f

t

=

( )

1

2 2

1 3 3

2 4 4

2

0

1

0.86

1.63

2.76

0

2

0

0.14

2.76

3.63

0

2

5

0

1

0

2

0

6.39

7.39

A

x

e

x

c

x

c

⎡

⎤

_{⎡ ⎤}

_⎡

−

_⎤

_{⎡ ⎤}

⎢

⎥

_{⎢ ⎥}

_⎢

₋

_⎥

_{⎢ ⎥}

⎢

⎥

₌

_{⎢ ⎥}

₌

_⎢

_⎥

_{⎢ ⎥}

⎢

₋

⎥

_{⎢ ⎥}

_⎢

_⎥

_{⎢ ⎥}

⎢

⎥

_{⎢ ⎥}

_⎢

_⎥

_{⎢ ⎥}

−

⎢

⎥

_{⎣ ⎦}

_⎣

_⎦

_{⎣ ⎦}

(22)

Example

Four equations and four unknowns:

( )

1

2

3

4

1

0

1.63

2.76

2

0

1

2.76

3.63

2

0

1

0

1

0

5

0

1

6.39

7.39

2 x

x

c

−

⎡

⎤

⎡

⎤

⎡ ⎤

⎢

⎥

⎢

₋

⎥

⎢ ⎥

⎢

⎥

⎢

⎥

₌

⎢ ⎥

⎢

⎥

⎢

−

⎥

⎢ ⎥

⎢

⎥

⎢

₋

⎥

⎢ ⎥

⎣

⎦

⎣

⎦

⎣ ⎦

( )

1 1 2 3 4

1

0

1.63

2.76

0

2.30

2

0

1

2.76

3.63

0

1.33

2

1

0

1

0

5

2.70

0

1

6.39

7.39

2

2.42 x

x

c

−

⎡

⎤ ⎡

⎤ ⎡ ⎤ ⎡

⎤

⎢

⎥ ⎢

₋

⎥ ⎢ ⎥ ⎢

⎥

⎢

⎥ ⎢

₌

⎥ ⎢ ⎥ ⎢

₌

⎥

⎢

⎥ ⎢

−

⎥ ⎢ ⎥ ⎢

−

⎥

⎢

⎥ ⎢

₋

⎥ ⎢ ⎥ ⎢

₋

⎥

⎣

⎦ ⎣ ⎦ ⎣

⎦

⎣

⎦

(23)

Example

z

Solution for State and Costate

z

Solution for Optimal Control

( )

1 2 1 2

0

2.70

2.42

At

x t

e

t

λ

⎡

⎤

_⎡

_⎤

⎢

⎥

_⎢

_⎥

⎢

⎥

₌

_⎢

_⎥

⎢

⎥

_⎢

₋

_⎥

⎢

⎥

_⎢

_⎥

−

⎢

⎥

_⎣

_⎦

⎣

⎦

0

1

0

1

0

1 where

0

1

1 A

⎡

⎤

⎢

₋

⎥

⎢

⎥

=

⎢

⎥

⎢

₋

⎥

⎣

⎦

( )

2

(24)

Example – 2: Double Integrator Problem

(Relevance: Satellite Attitude Control Problem)

Dr. Radhakant Padhi

Asst. Professor

Dept. of Aerospace Engineering

Indian Institute of Science - Bangalore

(25)

Double Integrator Problem

2

u

=

x

₂

=

x

₁

x

₁

=

y

∫

( )

[

]

T 2 2

0

Consider a double integrator problem as shown in the above figure.

Find such ( ) that the system initial values

0 = 10

0 are

driven to the origin by minimizing

1

2 (1) : unspecified

(2

f t

f

u t

X

J

t

u dt

t

=

+

∫

Note :

(26)

Double Integrator Problem

[

]

( )

1 1

2 2

1 2

System dynamics

0

1

0

1 1 0

(not required)

Boundary Condition

10

0

0 ,

0

B X

A

C

f

x

U

AX

BU

x

y

CX

x

X

X t

⎡ ⎤

⎡

⎤

⎡ ⎤

=

+

=

+

⎢ ⎥

⎢

⎥

⎢ ⎥

⎣

⎦

⎣ ⎦

⎡ ⎤

=

_{⎢ ⎥}

=

⎣ ⎦

⎡ ⎤

=

_{⎢ ⎥}

=

_{⎢ ⎥}

⎣ ⎦

(27)

Double Integrator Problem

[

]

Controllability Matrix

0

1

0

1

0

1

0

1

0 Hence, the system is controllable.

M

B

AB

M

⎡

_{⎡ ⎤ ⎡}

_{⎤ ⎡ ⎤}

⎤

_⎡

_⎤

=

_⎢

_{⎢ ⎥ ⎢}

_{⎥ ⎢ ⎥}

_⎥

=

_⎢

_⎥

⎣ ⎦ ⎣

⎦ ⎣ ⎦

⎣

⎦

⎣

⎦

= − ≠

(28)

Necessary Conditions of Optimality

(

)

2

1

2 (1) State Eq:

H

(2) Optimal Control Eq:

0

0 (3) Costate Eq:

T

H

u

AX

Bu

X

AX

Bu

u

B

u

B

H

A

X

λ

=

+

=

+

∂

₌

∂

+

=

= −

∂

= −

(29)

Optimal Control Solution

1

2

1

2

1

2

1

2

1

2

1

2

0

1

0

0 T

A

c

c t

c

u

c t

c

λ

_λ

λ

⎡ ⎤

_⎡

_⎤

_{⎡ ⎤ ⎡}

_⎤

= −

=

⎢ ⎥

_⎢

_⎥

_{⎢ ⎥ ⎢}

₋

_⎥

⎣

⎦ ⎣ ⎦ ⎣

⎦

⎣ ⎦

= ⇒

=

= − = −

= − +

(30)

Optimal State Solution

1 2 2

2 1 2

2

2 1 2 3

3 2

1 2 1 2 3 4

However,

Hence

2

6

2 x

x

u

c t

c

t

x

c

c t

c

t

x

x dt

c

c t

c

⎡ ⎤

⎡

⎤

=

⎢ ⎥

⎢

₋

⎥

⎣ ⎦

⎣

⎦

=

−

+

(31)

Optimal State Solution

( )

1 4 2 3 3 2 1 2 1 2 2 1 2 3 2 1 2 1 2 1 2 2

Using the B.C. at

0 :

0

10

0

10

6

2

2 Using the B.C at

:

10

0

6

2

0

2

f f f f f f f

t

x

c

x

c

t

x t

c

t

c t

t

c

t

x t

c

x t

_t

_{c t}

=

⎡

_{⎤ ⎡ ⎤ ⎡ ⎤}

=

⎢

_{⎥ ⎢ ⎥ ⎢ ⎥}

⎣ ⎦

⎣

⎦

⎡

₋

₊

⎤

⎢

⎥

⎡

⎤

∴

_⎢

_⎥

_{= ⎢}

_⎥

⎢

⎥

⎣

⎦

₋

⎢

⎥

⎣

⎦

=

⎡

₋

₊

⎤

⎡

_{⎤ ⎢}

_{⎥ ⎡ ⎤}

⎢

_{⎥ =}

_⎢

_{⎥ ⎢ ⎥}

₌

⎢

⎥

_⎢

₋

_⎥

_{⎣ ⎦}

⎣

_{⎦ ⎢}

_⎥

⎣

⎦

(32)

Transversality Conditions (t

_f

: free)

(

)

[

]

(

)

_{( ) ( ) (}

₎

(

)

2 2 2 1 2 2 2 1 2

1 2 1 2

2 1 2

2 2 2

1 1 2 2

2

1

2

4

2

f f f f t t T f t t f

f f f

f

f f f

H

t

u

t

AX

Bu

x

u

c t

c

t

x t

c t

c

c t

c

t

c t

c c t

c

ϕ

λ

λ λ

λ

∂

= −

∂

⎡

⎤

= −

_⎢

+

_⎥

⎣

⎦

⎡

_{⎡ ⎤}

⎤

= −

_⎢

+

_{⎢ ⎥}

_⎥

⎣ ⎦

⎣

⎦

⎡

₋

⎤

⎢

⎥

= −

+

−

⎢

⎥

⎣

⎦

=

−

=

−

+

0 (B.C.)

(33)

Transversality Conditions (t

_f

: free)

(

)

1 2

3 2

1 2

2

1 2

2 2 2

1 1 2 2

1 2

In summary, we have to solve for ,

and from:

3

60

0

2

0

2

4

0 At this point, one can solve ,

from first two

equations in terms of and subtitute them in

f

f f

f

c c

t

c t

c c

t

c

c c

t

−

+

=

−

=

−

+

=

the

third equation. Then the resulting nonlinear equation

in can be solved (preferably in closed form). However, one must

discard unrealistic solutions (e.g.

0 is unrealistic).

One may use nu

f

t

≤

(34)

Transversality Conditions (t

_f

: free)

(

)

_[

_]

( )

[

]

1 2

2 2 1 2

2

T T

2.025 Finally, 3.95

/ 4

Hence, the optimal solution is given by:

2.025

3.95

3.95 and it will take

3.901 time units to reach

= 0

0 ,

4 starting from

0 = 10

0 Note: (1)

f

f f

c

t

c

u

c t

c

t

X

⎡ ⎤ ⎡

⎤

⎢ ⎥ ⎢

₌

⎥

⎢ ⎥ ⎢

⎥

⎢ ⎥ ⎢

_⎣

⎥

_⎦

⎣ ⎦

=

− =

−

=

It is an open-loop control law

(35)

References on

Optimal Control Design

z

T. F. Elbert

,

Estimation and Control Systems

, Von

Nostard Reinhold, 1984.

z

A. E. Bryson and Y-C Ho

,

Applied Optimal Control

,

Taylor and Francis, 1975.

z

R. F. Stengel

,

Optimal Control and Estimation

,

Dover Publications, 1994.

z

_{D. S. Naidu}

_,

_{Optimal Control Systems}

_{, CRC Press,}

2002.

z

A. P. Sage and C. C. White III

,

Optimum Systems

Control (2nd Ed.)

, Prentice Hall, 1977.

z

D. E. Kirk

,

Optimal Control Theory: An Introduction

,

(36)