Semi-smooth Newton methods for time-optimal control for a class of odes

(1)

Time-Optimal Control for a Class of ODEs

Kazufumi Ito 1

Karl Kunish 2

February 12, 2010

1

Departmentof Mathematis,North CarolinaStateUniversity,Raleigh,North

Carolina, 27695-8205, USA; researh partially supported by the Army Researh

OÆe under DAAD19-02-1-0394 and the Air Fore OÆe of Sienti Researh

under FA9550-09-1-0226

2

Institutfur Mathematik,Karl-Franzens-Universitat Graz, A-8010 Graz,

Aus-tria, supported in part by the Fonds zur Forderung der wissenshaftlihen

Forshung under SFB 32, "Mathematial Optimization and Appliations in

(2)

Time optimalontrol problems for a lass of linear multi-inputsystems are

onsidered. Theproblems are regularizedand theasymptotiand monotone

behavior of the regularisation proedureis investigated. For the regularised

problems the appliabilityofsemi-smoothNewton methodsisveried. First

numerialtestsarepresented whihshowthattheproposedapproah,

dier-entlyfromothermethods,doesnotrelya-prioryinformationoftheswithing

(3)

This paper addresses time optimal ontrol for a lass of linear multi-input

ontrols systems for ordinary dierential equations. Due to their pratial

relevane and inherent strutural diÆulties, time optimal ontrol has been

reeiving a onsiderable amount of attention for deades. Muh of the

lit-erature up to the late sixties is overed in [HL℄. Many reent results an

be found or are referened in [BPW , KLM, MO℄. Time optimal ontrol for

innite dimensionalsystems isonsidered in [Fa℄, for example.

The optimalitysystem assoiatedto time optimalontrolproblems with

pointwiseonstraintsontheontrolsisompliatedduetolakofsmoothness

of the optimal ontrols. In fat, the rst order optimality system for time

optimalontrolproblemsontainsamultivaluedoperationwhihimpedesthe

use of fastnumerialmethods. Forthis reasonwe introduea regularization

to the time optimal problem. In setion 2 the behavior of the solutions of

the regularized problems as the regularization parameter " tends to zero is

investigated. In partiularmonotoni struture ofthe solutionswith respet

to " is shown. An optimality system for the regularized problems is derived

under a ondition whih is stronger than ontrollability and weaker than

normality. TheoptimalontrolsoftheregularizedproblemsareW 1;1

regular

and onverge to a minimum norm solution of the original problem as the

regularization parametertends to zero.

The optimalitysystem of the regularized problems is stillnot C 1

sothat

seondordermethodswithloalquadrationvergeneorderarenotdiretly

appliable. However, suÆientonditionswillbeobtainedinsetion3whih

imply that semi-smooth Newton methods [IK2℄ are wellposed and loally

superlinearly onvergent.

Setion 4 ontains a brief desription of numerial results. We ompare

the hosen regularizationto an alternative one, whih has stronger

regular-ization properties. Sine the optimal ontrols of the original time optimal

problems are typially not ontinuous, it appears that our hoie of

regu-larization whih leads to W 1;1

regularized ontrols is preferable over other

regularizationstrategieswhihprovidesmootherontrols. More detailed

nu-merial tests are availablein [XK℄.

Let us note that the approah that we propose for solving time optimal

problemsdeviatesfromtraditionalapproahes, whiharefrequentlygrouped

intodiret andindiretmethods. Indiretmethodsbasedonmultiple

(4)

unknowns,inludingthe swithingfuntion,theshootingmethodisreported

to onverge fast and to generate very aurate solutions. The methods that

weproposealsooriginatesfromtherst orderondition,butdierentlyfrom

the shooting methoditdoes not requireaurate informationonthe

swith-ingstruture inadvane.

Diretmethods on the other hand, onsider time optimal problems as a

genuinenonlinear programmingproblems. Theyare usedinseveral variants,

whihfrequently involvereparametrization ofthe ontrols asthe unknowns.

Thenewunknownsanbetheswithingtimesasin[MB℄ortheardurations

as in[KN℄.

2 The time-optimal problem and its

regularization

Consider the time-optimalontrolproblemfor the linearmulti-inputsystem

(P) 8

>

<

>

: min

0 R

0 dt

subjet to

d

dt

x(t)=Ax(t)+Bu(t); ju(t)j

` 1

1; x(0)=x

0

;x()=x

1 ;

where A 2 R nn

; B 2 R nm

; x

0 2 R

n

; x

1 2 R

n

are given, u(t) 2 R m

, u is

measurable, and jj

`

1 denotes the innity-norm on R m

. The olumns of B

are denoted by b

i

. It is assumed that x

1

an be reahed in nite time by an

admissible ontrol. Then (P) admits a solution with optimal time denoted

by

, and assoiated state x

and ontrolu

.

The rst order optimality system for (P) an be expressed in terms of

the adjointp and the Hamiltonian

H(x;u;p

0

;p)=p

0 +p

T

(5)

(2.1)

8

>

<

>

: _

x=Ax+Bu; x(0) =x

0

; x()=x

1 ;

_ p=A

T

p;

u=argmin

jvj

` 11

H(x;v;p

0

;p); a.e. in(0;);

p

0

+p(t) T

(Ax(t)+Bu(t))=0; p

0 0;

where the supersript T denotes transposition,see e.g. [MS ℄, hapter V, pg.

109, 110. Further p is not identially 0, so that there exists a nontrivial

vetor q2R n

suh that

p(t)=exp(A T

( t))q:

Due to the speial struture of H the optimal ontrol an be expressed

as

(2.2) u

i

= (b

T

i

p)= b T

i

exp( A T

( t))q

;

where denotes the oordinate-wise operation

(s) 2 8

>

<

>

:

1 if s <0

[ 1;1℄ if s =0

1 if s >0: (2.3)

The last equation in (2.1) holds everywhere rather than a.e. on [0;℄. In

fat, p and xare ontinuous and p(t) T

Bu(t)= P

m

i=1 jp(t)

T

b

i j.

Let us reall the notions of ontrollability and normality, whih will be

referred to below.

(2.4)

(

The pair (A;B)is alled ontrollable if

rank fB;AB;:::;A n 1

Bg=n

(2.5)

(

The pair (A;B)is allednormal if (A;b

i )

isontrollable for allolumnsb

i of B:

Normality of (A;B) implies ontrollability. Moreover, if (A;B) is normal,

then the optimalontrolu

to (P)is unique, it is bang bang, and pieewise

(6)

(2.6) p

0 >0

is referred to as strit transversality. In this ase it an be assumed that

p

0

=1,whihanbeahievedbysalingq. Ifstrittransversalityholdsthen

(x

;u

;

) is a strit loal minimum, in the sense that there exists Æ > 0

suh that x

1

is not inthe attainable set for t2(

Æ;

), [HL℄, pg.89.

With(2.6) holding,we an express the optimality ondition as

(2.7)

8

>

<

>

: _

x=Ax+Bu; x(0) =x

0

; x() =x

1 ;

_ p=A

T

p;

u=argmin

jvj

` 11

H(x;v;p); a.e. in(0;);

1+p() T

(Ax()+Bu())=0:

Here we eliminatethe variable p

0

fromthe notation forH sine it wasxed

to be 1.

Introduing the transformation ^

t= t

and setting

^ x(

^

t)=x( ^

t )=x(t); p(^ ^

t )=p( ^

t)=p(t); u(^ ^

t ) =u( ^

t)=u(t);

we obtain the following equivalent system to (2.7), where for the ease of

presentation we omit the supersripts ^

:

(2.8)

8

>

<

>

: _

x=(Ax+Bu); x(0)=x

0

; x(1) =x

1 ;

_ p=A

T

p;

u=argmin

jvj

` 11

H(x;v;p); a.e. in(0;);

1+p(1) T

(Ax(1)+Bu(1))=0:

The non-dierentiable operation involved in haraterizing the optimal

ontrol,

u= (B T

p);

ompare (2.2), prohibits the use of Newton-type methods for solving (2.8)

(7)

(P " ) 8 > > > < > > > : min 0 R 0 (1+ " 2 ju(t)j 2 )dt subjet to d dt

x(t)=Ax(t)+Bu(t); ju(t)j

` 1

1; x(0)=x

0

; x()=x

1 ;

with " >0 is onsidered. The norm jj used in the ost-funtional denotes

theEulidean norm. Itisstraightforwardtoarguetheexisteneofasolution

(u " ;x " ; " ).

Convergeneofthesolutions(x

" ; p " ;u " ; " )of(P

"

)toasolution(x ;p ; u ; )

of (P) is onsiderednext. Notethat

isunique.

Proposition 2.1. For every 0 < "

0 < "

1

and any solution (

; u

) of (P)

we have (2.9) " 0 " 1 (1+ " 1 2 ); (2.10) ju " 1 j L 2 (0; " 1 ) ju " 0 j L 2 (0; " 0 ) ju j L 2 (0; ) : If u

is a bang-bang solution, then

(2.11) 0ju j 2 L 2 (0; ) ju " j 2 L 2 (0;")

meas ft2[0;

℄:ju

"

(t)j<1g

for every ">0.

Proof. From the denition of and " wehave "

for every ">0;

and " + " 2 Z " 0 ju " j 2 dt + " 2 Z 0 ju j 2 dt; hene ju " j L 2 (0;") ju j L 2 (0; ) ; and " (1+ " 2 ):

For 0<"

(8)

" 0 " 0 " 0 1 2 (" 1 " 0 ) R " 1 0 ju "1 j 2

dt on both sides implies that

(2.12) "0 + " 1 2 Z " 1 0 ju "1 j 2 dt+ " 0 2 Z " 0 0 ju "0 j 2 dt Z " 1 0 ju "1 j 2 dt Z " 1 0 (1+ " 1 2 ju " 1 j 2 )dt " 0 + " 1 2 Z " 0 0 ju " 0 j 2 dt:

Estimatingthe rst by the last expression in (2.12)implies that

" 1 Z " 1 0 ju " 1 j 2 dt Z " 0 0 ju " 0 j 2 dt " 0 Z " 1 0 ju " 1 j 2 dt Z " 0 0 ju " 0 j 2 dt ; and hene (2.13) ju "1 j L 2 (0; " 1 ) ju "0 j L 2 (0; " 0 ) :

Estimatingthe rst by the seond expression in(2.12) we obtain

" 0 + " 0 2 ju " 0 j 2 L 2 (0; " 0 ) " 1 + " 0 2 ju " 1 j 2 L 2 (0; " 1 )

and by (2.13)

" 0 " 1 :

These estimates imply (2.9) and (2.10).

Ifu

is bang-bang,then

0ju j 2 L 2 (0; ) ju j 2 L 2 (0;") R ft2(0; ):ju"(t)j<1g (1 ju " (t)j 2

)dt measft 2(0;

):ju

"

(t)j<1g;

so that (2.11) holds.

Theorem 2.1. For " ! 0 +

we have

"

!

and every onvergent

subse-queneofsolutionsf(u

" ; x " )g ">0 to(P "

)onvergesinL 2 (0; ;R m )W 1;2 (0; ; R n )

to a solution (u

; x

) of (P), where u

(9)

Here onvergene of u

"

to u isdened as

Z 1 0 ju " ( " t) u ( t)j 2

dt !0

and analogously forfx

"

g, and for weak onvergene.

Proof. The rst laim follows from Proposition 2.1. Sine fu

" ( " )g ">0 and fx " ( " )g ">0

areboundedinL 2

(0; 1;R m

)andW 1;2

(0; 1;R n

),thereexistweak

aumulation points u 2 L 2 (0; ;R m

); andx 2 W 1;2 (0; ; R n ).

Subse-quently we avoid subsequential indies. Passing to the limit in x_

" ( " ) = " (Ax " ( "

)+Bu

" (

"

)) and x

"

(0) = x

0 ; x

" (

" ) = x

1

it follows that x

is

admissible. Due toweaklosedness of fu2L 2

(0; 1;R m

): ju(x)j

` 1

1a:e:g

we have that u

isadmissibleas well. Sine

lim "!0 + " + " 2 Z " 0 ju " j 2 dt= ;

the triple (

; u

; x

)is optimalfor(P). By Proposition 2.1andweaklower

semi-ontinuity of norms

(2.14) lim "!0 sup ju " j L 2 (0; " ) ju j L 2 (0; ) lim "!0 inf ju " j L 2 (0; " )

and hene lim

"!0 ju " j L 2 (0;";R m ) = ju j L 2 (0; ;R m )

. As a onsequene u

" and

x

"

onverge strongly in L 2

(0;

"

),respetively W 1;2

(0;

" ;R

n

), tou

and x

.

Letu^denoteanother optimalontrolfor(P)withj^uj<ju

j. Thenby (2.10)

and (2.14) lim "!0 sup ju " j L 2 (0;";R m ) j^uj L 2 (0; ;R m ) <ju j L 2 (0; ;R m ) lim "!0 inf ju " j L 2 (0;";R m ) ;

whihis aontradition. Consequently (P)has a minimalnorm ontroland

the laimedstrong onvergene properties hold.

Corollary 2.1. If (2.5) holds, then the solution u

to (P) is unique, it is

bang-bang, and u

" !u

in L 2

as "!0 +

.

Proof. (2.5) implies that the solution to (P) is unique and it is bang-bang.

(10)

"

" (s) 2

8

>

<

>

:

1 if s "

s

"

if jsj< "

1 if s ": (2.15)

If

"

is appliedto avetor, then it ats oordinate-wise.

We shall use aontrollabilityassumption whih isstronger than

ontrol-lability and weaker than normality.

(H1) Thereexists i

suh that (A;b

i )

is ontrollable:

Theorem 2.2. Assume that (H1) holds and let (x

" ;u

" ;

"

) be a solution of

(P

"

). If there exist >0 and an interval I

i

(0;1)suh that

(2.16) j(^u

" )

i

(t)j

` 1

1 for a.e. t 2I

i ;

then there exists an adjoint state p

"

suh that

(2.17)

8

>

<

>

: _ x

" =Ax

" +Bu

" ; x

"

(0)=x

0 ; x

" (

" )=x

1

_ p

" =A

T

p

"

u

"

=

" (B

T

p

" )

1+

2 ju

" (

" )j

2

R m

+p

" (

" )

T

(Ax

" (

"

)+Bu

" (

"

))=0:

Proof. We use a Lagrange multiplier argument for the reparameterized

for-mulationof (P

"

) whih isgiven by

(2.18)

8

>

<

>

: min

0 R

1

0 ( +

"

2 j^u(t)j

2

)dt

subjet to

d

dt ^

x (t)=(A^x (t)+Bu(t));^ u^2C; x (0)^ =x

0

; x(1)^ =x

1 ;

where C = f^u 2 L 2

(0;1;R m

) : juj^

` 1

1g. Here (^u;) are treated as

(11)

0

onstraint e(^u;)=x (1)^ x

1

=0. The resulting Lagrangianis

L(^u;;

0 )=

Z

1

0 ( +

"

2 j^u(t)j

2

)dt+ T

0

(^x (1) x

1 );

where x (1)^ isdened through the dierentialequation and the initial

ondi-tion.

We now argue that e : C R L 2

(0;1;R m

)R ! R n

satises the

regular point ondition in the sense of Maurer-Zowe [MZ, IK2℄. Thus we

have toverify that

(2.19) 02intfe

0

(^u

" ;

"

) (C u^

" )R

g;

where e 0

(^u

" ;

"

) denotes the linearisation of e at(^u

" (

" );

" ).

Consideringe 0

(^u

" ;

"

) indiretions Æ =0and Æu satisfying(Æu)

i

=0for

i6=i

and (Æu)

i

=0 in(0;1)n( ;+Æ),with ( ;+Æ):=I

i

, we nd

e 0

(^u

" ;

"

)(Æu u^

" ;0)=

R

+Æ

e

A(1 t)

b

i

(Æu(t) u^

" (t))

i dt

= R

Æ

0 e

A(Æ t)

~

b

i

(Æu(t+ ) u^

"

(t+ ))

i dt;

where ~

b

i

=e

A(1 Æ )

b

i .

Then

e 0

(^u

" ;

"

)(Æu u^

" ;0)=

Z

Æ

0 e

A(Æ t)

~

b

i (Æ~

u(t) u~

" (t))

i

dt;

where Æu~

i (t)

=Æu

i (t

+ );u~

";i (t)

=u^

";i (t

+ ). Note that by (2.16)

f(Æu~ u~

" )

i

:[0;Æ℄!R 1

j j(Æu)~

i j

1gS :=fv :[0;Æ℄!R 1

; jvj

2 g:

Observe that ontrollabilityof (A;b

i

) impliesthat (A; ~

b

i

) isontrollableas

well. Controllabilityof the singleinput system (A; ~

b

i )

implies that

(2.20) 02intf

Z

Æ

0 e

A(Æ t)

~

b

i v

dtjv 2Sg:

In fattheset ontherightof (2.20) ontains 0andithas nonemptyinterior,

see e.g. [LM℄, page 77, 133. Moreover, if 0 was a boundary point of this

set, then the orresponding ontrol v = 0 is an extremal ontrol, whih is

(12)

arity properties (2.21) L (^u " ; " ; 0 )=0;

L u (^u " ; " ; 0

)(Æu u^

"

)0for allÆu with jÆuj

` 1

1:

From the seondproperty in (2.21)we have,

Z

1

0 ("u^

" +B T e A T (1 t) 0

)(Æu u^

"

)dt0:

Setting

(2.22) p(t)=e

A T

t

q with q=e A T 0 ; this implies Z 1 0 ("^u " +B T ^ p "

)(Æu u^

"

)dt 0;

for all Æu as in (2.21). The seond and third laim in (2.17) follow with

p

"

(t)=p^

" (

1

t).

Exploitingthe rst property in (2.21) impliesthat

L (^u " ; " ; 0

)=1+ " 2 R 1 0 j^u " j 2 dt + T 0 Ae A x 0 + R 1 0 e A(1 t)

Bu^

"

(t)dt+ R

1

0

A(1 t)e A(1 t)

Bu^

" (t)dt

=1+ " 2 R 1 0 j^u " j 2 dt+ T 0 (Ae

((1 t)+t)A

x 0 + R 1 0 e A(1 t)

Bu^

"

(t)dt+ R 1 0 Ae A(1 t) R t 0 e A(t s)

Bu^

"

(s))dsdt=0:

This implies (2.23) L (^u " ; " ; 0

)=1+ " 2 Z 1 0 j^u " j 2 dt+ Z 1 0 p T

(t) Ax^

"

(t)+Bu^

" (t)

dt =0:

From u

" = " (B T p "

) weonlude that u

" 2W 1;1 (0; " ;R m ).

We introdue the Hamiltonianfor (P

" )as

H

"

(x;u;p)=1+ " 2 juj 2 R m +p T

(13)

on (0;1)

d

dt H

" (^x

" ;u^

" ;p^

" )="^u

T

" d

dt ^ u

" +

1

" d

dt ^ p T

" d

dt ^ x

" +p^

T

" A

d

dt ^ x

" +p^

T

" B

d

dt ^ u

"

=("^u

" +B

T

^ p

" )

T d

dt ^ u

" =0:

Combined with (2.23)this impliesthat

1+ "

2 ju^

" j

2

R m

+p^ T

" (A^x

" +Bu^

"

)=0on [0;1℄:

This implies the laim.

The proof revealed extra regularity of u

" :

Corollary2.2. UndertheassumptionsofTheorem2.2wehaveu

" 2W

1;1

(0;

" ;R

m

).

Remark 2.1. Condition (2.16) requires that the modulus of atleast one of

the oordinates of u^

"

is not almost everywhere equalto1. One it isknown

from Corollary 2.2 that u^

"

is ontinuous this amounts to requiring that at

least one of the oordinates of u

swithes from1 to 1 or vieversa.

UndertheassumptionsofTheorem2.2therstorderneessaryoptimality

ondition for (P

"

) afterthe transformation t! t

isgiven by

(2.24)

8

>

<

>

: _

x=(Ax+Bu); x(0)=x

0

; x(1)=x

1

_ p=A

T

p

u=

" (B

T

p)

1+

2 ju(1)j

2

+p(1) T

(Ax(1)+Bu(1))=0;

where for onveniene of notationthe dependene on " and the supersript

hat were dropped.

Inthefollowingsetionweshallinvestigatesemi-smoothNewtonmethods

for solving (2.24).

Welose this setionwith asimpleexample whihillustratessomeof the

(14)

simple ontrolsystem

(

_ x

1 =u

1

_ x

2 =u

2 ;

with so that A is the zero, and B the identity matrix, with initialondition

(1; 1

2

)and terminalonditionthe origin. Thissystem isontrollablebut itis

not normal. The optimal time is

= 1, the rst oordinate of an optimal

ontrol is uniquely determined u

1

= 1 , with assoiate state x

1

= 1 t.

There are innitely many hoies for optimalsolutions u

2

of bang-bang and

non bang-bang type. The assoiated onstant adjoints are (p

1 ;p

2

) = (1;0).

They satisfy

u= (p)2

1

[ 1;1℄

:

The transversality ondition 1+p T

Bu=0 issatised.

For the regularized problem we nd

"

= 1. Dierently from the

un-regularized problem the solution to the regularized problem is unique. The

optimal ontrol and trajetory are given by

(u

1 ;u

2

)=( 1; :5) with (x

1 ;x

2

)=(1 t;:5(1 t)):

In this partiular example the solution of the regularized problemdoes not

depend on ". Note that this solution is also one of the minimum norm

solutions of the unregularized problem. The adjoint is p

"

= (1+ 3"

8 ;

"

2 ). It

satises

u=

" (p

T

" )=

1

2

and the transversality ondition 1+ "

2 juj

2

+p T

"

Bu=0:

3 Semi-smooth Newton method

In this setion the semi-smooth Newton method for solving the regularized

optimalitysystem (2.24) is desribed and analyzed. It willallowthat (2.24)

an besolved eÆiently inspite of the fat that

"

(15)

(x

" ; u

" ;

"

) 2 W 1;2

(0;1) L 2

(0;1)R a solution to (P

"

) with assoiated

adjoint p

" 2W

1;2

(0; 1). It is assumed that

(H2) there exists s2(0;1) suhthat j 1

" b

T

i

p

"

(s)j=j(u

" )

i

(s)j<1;

and

(H3) jb

T

i p

"

(1)j6="; for alli=1;:::;m:

Assumption (H2)orresponds to(2.16),wherewe nowuse thefat thatasa

onsequeneofTheorem2.2theontrolu

"

isontinuous. With(H2)and(H3)

holdingthereexists aneighborhoodU

p" ofp

" inW

1;2

(0;1;R n

);

t2(0;1),and

a nontrivialinterval( ;+Æ)(0;1)suh that for p2U

p"

we have

jb T

i

p(t)j6=" for allt 2[

t; 1℄; and i=1;:::;m

and

(3.1) jb

T

i

p(t)j<" for t2( ;+Æ):

We set U =fu 2 L 2

(0;1;R m

) :uj[

t; 1℄ 2 W 1;2

(

t;1;R m

)g endowed with the

norm

juj

U

=(juj 2

L 2

(0;1) +juj_

2

L 2

(

t ;1) )

1

2

;

and introdue

F :D

F

X !L 2

(0;1;R n

)L 2

(0;1;R n

)U R n

R

where

D

F =W

1;2

(0;1)U

p"

U R;

X =W 1;2

(0;1;R n

)W 1;2

(0;1;R n

)U R;

and

(3.2) F(x;p; u; )= 0

B

_

x Ax Bu

_ p A

T

p

u+

" (B

T

p)

x(1) x

1

1+ "

2 ju(1)j

2

+p(1) T

(Ax(1)+Bu(1)) 1

C

(16)

1 5 1 2

F

3

. For F

4 ;F

5

it follows from the fat that W 1;2

(0;1) embeds ontinuously

into C(0;1). Morevover F(x

" ;p

" ; u

" ;

"

) = 0: We shall keep x

"

(0) = x

0 as

an expliit onstraint.

Remark 3.1. The need for introduing U in suh a way that its elements

are more regular at 1 is due to the fat that we use here the point-wise

transversality ondition rather than the integrated form (2.23). - Condition

(H3)willbeneeded toprovesuperlinearonvergeneoftheNewtoniteration.

ApplyingNewton'smethodtoF =0isimpededbythenon-dierentiability

of

"

. We use

G

" (s):=

(

1

"

if jsj< "

0 if jsj " (3.3)

as a generalized derivative and argue that the resultingNewton iterationis

semi-smooth and hene loallysuperlinearly onvergent. The Newton

itera-tion step isgiven by

(3.4) DF(x;p; u; )(Æx; Æp;Æu; Æ)= F(x; p;u; )

whereÆx(0)=0andDF denotes the Frehet-derivativeinalltermsofF

ex-eptforp!

" (B

T

p),forwhihthegeneralizedderivativeistaken aording

to (3.3). Forfurther referene wegive the detailedform of (3.4):

(3.5)

8

>

<

>

: d

dt

Æx AÆx BÆu Æ(Ax+Bu)= F

1

; Æx(0)=0

d

dt

Æp A T

Æp ÆA

T

p= F

2

Æu+G

" (B

T

p)B T

Æp= F

3

Æx(1) = F

4

p(1) T

(AÆx(1)+BÆu(1))+Æp(1) T

(Ax(1)+Bu(1))

+"u(1) T

Æu(1)= F

5 ;

where the oordinatesof G

" (B

T

p)B T

Æpare given byG

" ((B

T

p)

i ))(B

T

Æp)

i .

Apossibleinitializationmayonsistinhoosing((u)

0 ;

0

),setting(x)

0 as

the linear interpolation between x

0

and x

1

, and determining (p)

0

suh that

(17)

whih are relevant for this paper. Let X and Z be Banah spaes and let

F :D

F

X !Z be a nonlinear mappingwith open domain D

F .

Denition3.1. ThemappingF :D

F

X !Z isalledNewton-dierentiable

on an open subset U D

F

, if for eah x 2 U there exists a generalized

derivative DF(x)2L(X; Z) and

(3.6) lim

h!0 1

jhj

X

jF(x+h) F(x) DF(x+h)hj

Z =0:

Theorem 3.1. Suppose thatx

2U isa solution to F(x)=0and that F is

Newton-dierentiableinanopensetU ontainingx

. IffurtherfkDF(x) 1

k:

x2 Ug is bounded, then the Newton-iteration x

k+1 =x

k

DF(x

k )

1

F(x

k )

onverges q-superlinearly tox

, provided thatjx

0 x

j

X

issuÆiently small.

For the statement and proof of superlinear onvergene of the time-optimal

ontrolproblem,some furthernotationisrequired. For(x; p; u;)2D

F we

dene A2R

(n+1) (n+1)

by

A= A

11 A

12

A

21 0

!

;

where

(3.7) A

11 ="

1

Z

1

0 e

A(1 t)

B

I B

T

e A

T

(1 t)

dt2R nn

;

(3.8)

A

12 ="

1

R

1

0 e

A(1 t)

B

I B

T R

1

t e

A T

(t s)

A T

p(s)dsdt

R

1

0 e

A(1 t)

(Ax+Bu)dt2R n

;

(3.9) A

21

=(Ax(1)+Bu(1)) T

(p T

(1)B+"u T

(1))G

" (B

T

p(1))B T

2(R n

) T

;

with

I

=diag(

I1

;:::;

Im

) and

I

i

the harateristi funtionof the set

I

i =I

i

(p)=ft:j(B T

p)

i j<

1

"

(18)

whih is nonempty for p 2 U

p

"

and i = i . The ontrollability assumption

(H1) together with (H2) imply that the symmetri matrix A

11

is invertible

with uniformly bounded inverse with respet to p 2 U

p"

and in ompat

subsets of (0;1). In fat,sine I

i

(p)( ; +Æ) weobtain forsome >0

A 11 =" 1 Z 1 0 e A(1 t) m X i=1 b i I i b T i e A T (1 t) dt " 1 Z +Æ e A(1 t) b i b T i e A T (1 t) dt =" 1 e A(1 ) Z Æ 0 e At b i b T i e A T t dte A T (1 )

dt>;

where we use that the ontrollabilityGramian

Z Æ 0 e At b i b T i e A T t dt;

is uniformlypositive denitefor inompat subsets of (0;1).

Forour analysis we shall utilize the fat that the Shur omplement

A 21 A 1 11 A 12 2R

of A for (x; p;u; ) in a neighborhood of (x

" ; p " ;u " ; "

) is nontrivial. If

A 21 A 1 11 A 12

6=0 at (x

" ; p " ; u " ; "

),weannotonludethatA

21 A 1 11 A 12 6=0

for (x;p; u; ) in a neighborhood of (x

" ; p " ; u " ; "

), sine, while with (H2)

holding, A

21

is ontinuous with respet to (x;p; u; ) 2 X, this is not the

ase for A 1

11

and A

12

due to the term

I

. We thereforeassume that

(H4) 8 > > > < > > > :

there exists a bounded neighborhood

U D

F

X of (x

" ; p " ; u " ; "

) and >0 suh that

jA 21 A 1 11 A 12

jfor all (x;p; u; )2U:

Theorem 3.2. If (H1){(H4) hold and (x

" ; u

" ;

"

) denotes a solution to

(P

"

) withassoiated adjoint p

"

, then thesemi-smooth Newton algorithm

on-verges superlinearly, provided that the initialization is suÆiently lose to

(19)

Lemma 3.1. Suppose that (H1) { (H4) hold. Then there exists a onstant

C suh that for every (x;p; u; )2U, and F 2L 2

(0; 1)L 2

(0;1)U R

DF(x; p; u; )(Æx; Æp;Æu; Æ)= F

admits a unique solution (Æx;Æp;Æu; Æ)2X and

(3.10) j(Æx; Æp;Æu;Æ)j

X

C jFj

L 2

UR :

Proof. Let (x;p; u;)2U and note that system (3.5) is equivalent to

(3.11)

8

>

<

>

:

Æx(t)= R

t

0 e

A(t s)

( F

1

+bÆu+Æ(Ax+Bu))ds

Æp(t)=e A

T

(1 t)

Æp(1)+ R

1

t e

A T

(t s)

(ÆA T

p F

2 )ds

Æu+G

" (B

T

p)B T

Æp= F

3

Æx(1) = F

4

p(1) T

( AF

4

+BÆu(1))+Æp(1) T

(Ax(1)+Bu(1))+"Æu(1)= F

5 :

On I

i

we have (G

" (B

T

p))

i = "

1

. The third equation in (3.11) an be

expressed as

(3.12) Æu= F

3 "

1

I B

T

Æp a.e. in (0;1):

Let usset

^

F

1 =

Z

1

0 e

A(1 t)

F

1 (t)dt;

^

F

2 ="

1

Z

1

0 Z

1

t e

A(1 t)

B

I B

T

e A

T

(t s)

F

2

(s)dsdt;

^

F

3

= B Z

1

0 e

A(1 t)

F

3 (t)dt:

From (3.12), and the rst and fourth equation in(3.11) we have

(3.13)

F

4

=Æx(1)

= " 1

R

1

0 e

A(1 t)

B

I B

T

Æpdt+Æ R

1

0 e

A(1 t)

(Ax+Bu)dt+ ^

F

1 +

^

F

(20)

F

4 = "

1

Z

1

0 e

A(1 t)

B

I B

T

e A

T

(1 t)

Æp(1)dt

" 1

Æ Z

1

0 e

A(1 t)

B

I B

T Z

1

t e

A T

(t s)

A T

p(s)dsdt+ ^

F

2

+Æ Z

1

0 e

A(1 t)

(Ax+Bu)dt+ ^

F

1 +

^

F

3 ;

whih involvesÆp(1) and Æ as unknowns, and an be expressed as

(3.14) A

11

Æp(1)+A

12 Æ =

^

F

1 +

^

F

2 +

^

F

3 +F

4 =:r

1 :

Eliminating Æu(1) from the last equation in (3.11) by means of the third

equation implies

(3.15) A

21

Æp(1)= F

5 +F

3 (1)

T

(B T

p(1)+"u(1))+p(1) T

AF

4 =:r

2 :

Combining(3.14)and(3.15)weobtainthefollowinglinearsystemfor(Æp(1); Æ):

(3.16) A

Æp(1)

Æ !

= r

1

r

2 !

:

By (H1),(H2), and (H4)its unique solutionis given by

(3.17) Æ =(A

21 A

1

11 A

12 )

1

(A

21 A

1

11 r

1 r

2

); Æp(1)=A 1

11 (r

1 A

12 Æ):

MoreoverthereexistsaonstantC =C(; jxj

C(0;1) ; jpj

C(0;1) ; juj

L 2

(0;1)

; ju(1)j);

suh that

jÆp(1)j+jÆjCjFj

L 2

UR :

From (3.5) and (3.11), C an alsobe hosen suh that

j(Æx; Æp; Æu; Æ)j

X

CjFj

L 2

(21)

Proof of Theorem 3.2. We apply Theorem 3.1 with x =(x

" ;p

" ; u

" ;

" ).

Lemma 3.1 implies the required uniform bound of the generalized inverses

DF in the neighborhoodU D

F of (x

" ; p

" ; u

" ;

"

). Therefore itsuÆes to

argueNewton-dierentiabilityofF inD

F

. Thisisobviousforalloordinates

of F exept for F

3

, and speially for the mapping

F :p!

" (B

T

p)

from U

p"

W

1;2

(0;1; R n

) ! U. Utilizing the denitions of U

p"

and

" it

suÆes to onsider the restrition of F from W 1;2

(0;

t; R n

) to L 2

(0;

t; R 1

)

whih weagain denote by F. Note that F an be deomposed as

F =F

3 ÆF

2 ÆF

1 ;

where

F

1 :W

1;2

(0;

t ; R n

)!W 1;2

(0;

t; R); F

2 :W

1;2

(0;

t; R) !L 4

(0;

t; R);

F

3 :L

4

(0;

t; R) !L 2

(0;

t; R);

are given by

F

1

(u)=B T

u; F

2

(v)=max( 1; v

" ); F

3

(v)=min(1;v):

In [HIK , IK2℄ it was shown that v ! max(0;v) is Newton dierentiable

from L p

() to L q

() if 1 p > q 1, if is a bounded domain. Sine

min(1;v)=1+min(0;v 1) this implies that F

3

and similarlythat F

2 are

Newtondierentiable. FromthehainruleforNewtondierentiablemapping

in [HK ℄ it follows that F

3 ÆF

2

is Newton dierentiable. The hain rule for

a linear mapping, here F

1

, followed by the Newton dierentiable mappings

F

3 ÆF

2

; [IK1℄, implies that F is Newton dierentiable in D

F

:

4 A numerial example

The semi-smooth Newton method is used to solve a lassial time optimal

problem related to the harmoni osillator with three swithing points. We

(22)

(4.1)

>

<

>

: min

0 R

0 dt

subjet to

d

dt

x(t)=Ax(t)+Bu(t); ju(t)j1;x(0) =x

0

;x()=x

1 ;

where

A=

0 1

1 0

; B =

0

1

; x

0 =

5

; x

1 =

0

:

The optimal minimal time for the ontinuous problem is known tobe

=

10:5871. Tosolve(4.1)numeriallyatimedisretizationbasedonthe Crank

Niolson method with equidistant grid points was applied to (3.5). The

initialisation for the state was hosen as a semiirleonneting x

0

and x

1 .

Thenu(1)washosentobeative,andpwashosensothatthetransversality

ondition and the adjoint equation hold. With respet to the hoie of the

parameter = 1

"

weutilized a ontinuation proedure, startingwith a small

value and inreasing it, using the solution from the smaller value of as

initialization for the next larger -value. Certainly this proedure an be

automated as has been done elsewhere, but this was not the fous of this

paper. In Table 1 we show the number of iterates of the Netwon iteration

(outer loop) that was required for this ontinuation proedure with respet

to . The Newton iterationwasstopped when the residualof the optimality

systemintheL 2

-normwasbelow10 8

. AlsoinTable1wedepittheoptimal

minimal times

(). These results are obtained for meshsize h = 1

32

. Then

1 5 10 20

No. ofiterations 8 8 4 7

FinalTime 11:26515 10:84455 10:82977 10:81781

Table 1

the results for = 1 are interpolated to the ner grid h = 1

128

and the

ontinuationproedurewithrespettoisrepeated. Theresultsaredepited

inTable 2. Thegraphs fortheorrespondingontrols aregiveninFigure1.

The same proedure with h=1=512 and =100 gives the optimaltime

(23)

No. ofiterations 5 46 4 4 3

FinalTime 11:1088 10:6092 10:6034 10:6033 10:6031

Table 2

0

2

4

6

8

10

12 −1

−0.8

−0.6

−0.4

−0.2

0

0.2

0.4

0.6

0.8

1 11.1088

time

control

0

2

4

6

8

10

12 −1

−0.8

−0.6

−0.4

−0.2

0

0.2

0.4

0.6

0.8

1 10.6092

time

control

0

2

4

6

8

10

12 −1

−0.8

−0.6

−0.4

−0.2

0

0.2

0.4

0.6

0.8

1 10.6033

time

control

Figure1: N=128and=1(left),10(middle)and100(right)

the lowest values of the full Newtonstep was toolarge. Therefore we used

a one-dimensionallinesearh based onaquadratipolynomial interpolation

for the L 2

norm of the residual ombinedwith an Armijorule.

Table 3depits the quotients ju

k +1

u

()j

L 2

ju k

u

()j

L 2

, whereu

() is the solution to

the disretized version of (2.17)for =50. It shows that the algorithmisin

fat superlinearlyonvergent.

No. ofiterations 1 2 3 4

k

0:94138 0:00037 0:00001 0:00000

Table 3

In this paper we hose to regularize by the ramp funtions

" with

inreasing slops as " ! 0 +

. Certainly other alternatives are possible as for

instane ~

(s) =

2

atan(s): This family of C 1

funtions also has the

property that it onverges to as ! 0, but it appears to be less apt for

the purpose of approximating the disontinuous swithing struture of the

optimal ontrols sine has to be taken signiantly larger for ~

than for

1

toobtain omparable results.

Aknowledgement: We thank Mrs. J. Rubesa for providing us with the

(24)

[BPW℄ Ch. Buskens, H.J. Pesh, and S. Winderl: Real-time solutions for

bang-bang and singular optimalontrol problems, in: M. Grotshel,

et al. (Eds), Online Optimization of Large Sale Systems,

Springer-Verlag, Berlin,2001, 129-142.

[Fa℄ H.O. Fattorini: Innite Dimensional Linear Control Systems: The

Time Optimal and Norm Optimal Problems, Elsevier, Amsterdam,

2005.

[HL℄ H. Hermes and J. LaSalle: Funtional Analysis and Time Optimal

Control, Aademi Presse, 1969.

[HK℄ M.HintermullerandK.Kunish: PDE-onstrainedoptimization

sub-jet to pointwise ontrol and zero- and rst-order state onstraints,

preprint.

[HIK℄ M. Hintermuller, K. Ito and K. Kunish: The primal{dual ative

set strategy as a semi{smooth Newton method, SIAM Journal on

Optimization,13(2002), 865-888.

[IK1℄ K. Ito and K. Kunish: The primal-dual ative set method for

non-linear optimal ontrol problems with bilateral onstraints, SIAM J.

on Control and Optimization,43(2004), 357-376.

[IK2℄ K.ItoandK.Kunish: On theLagrangeMultiplierApproahto

Vari-ational Problems and Appliations, SIAM, Philadelphia,2008.

[KN℄ C.Y.KayaandJ.L.Noakes: Computationalmethodsfortime-optimal

swithing ontrols, JOTA, 117(2003),69-92.

[Ke℄ H.B. Keller: Numerial Mathods for Two-Point Boundary Value

Problems: Blaisdell,London, 1968.

[KLM℄ J.-H.R.Kim, G.L. Lippi, and H. Maurer: Minimizing the transition

time in lasers by optimal ontrol methods. Single-mode

semiondu-torlasers withhomogenoustransverse prole: PhysiaD, 191(2004),

238-260.

[LM℄ E.B. Lee and L. Markus: Foundations of Optimal Control Theory:

(25)

Springer Verlag, New York, 1982.

[MO℄ H. Maurer and N.P. Osmolovskii: Seond order suÆient optimality

onditions fortime-optimalbang-bang ontrol, SIAM J.Controland

Optimization,42(2004), 2239-2263.

[MZ℄ H. Maurer and J. Zowe: First and seond-order neessary and

suf-ient optimality onditions for innite-dimensional programming

problems, MathematialProgramming 16(1979), 98{110.

[MB℄ E.Meier and A.E.Bryson: EÆient algorithmsfortime-optimal rigid

spaeraft reorientation problem,J. Guidane, 13(1990), 859-866.

[XK℄ L. Xieand K. Kunish: Numerialmethodsfor time-optimalontrol