CiteSeerX — Smooth Task Switching through Behaviour Competition

(1)

Behaviour Competition

Philipp Althaus and Henrik I. Christensen

Centre for Autonomous Systems, Numeri al Analysis and Computer S ien e,

Royal Instituteof Te hnology (KTH), S-10044 Sto kholm, Sweden

Abstra t

Navigation in large-s ale environments is omposed of dierent lo al tasks. To

a hieve smooth swit hing between these tasks and thus a ontinuous ontrol sig-

nal, usually a pre ise map of the environment and an exa t pose estimate of the

robotareneeded.Botharehardto fulllforexperimentsinreal-worldsettings.We

presentasystemthatshows howone anrelaxtheneedfora uratemetri models

of the environment while at the same time a hieving smooth task swit hing. To

fa ilitate this s heme the dynami al systems approa h is used,whi h in orporates

behaviour oordination through ompetition ina dynami framework.Feature de-

te torsusesonar datatoprovidemeansforlo alnavigation.Thisability ombined

with a simpletopologi al map onstitutes a omplete navigationsystem for large-

s aleoÆ eenvironments.ExperimentsshowedthataS outrobotusingthiss heme

isableto su essfullynavigatethroughourwholeinstitute.Throughtheuseofthe

dynami behaviour oordination, swit hing between the navigational tasks o urs

ina smoothmanner leadingto ontinuous ontrolofthe platform.

Key words: behaviour oordination,mobilerobots, robotnavigation, dynami al

systems,hybriddeliberative systems

1 Introdu tion

A omplete navigation system of an autonomous robot is traditionally of a

hybridnature.It onsistsofa rea tivepart, whi h allowstodeal with unfore-

seenlo alevents.Then,thereisadeliberativepart, whi htakes are ofglobal

issues like path planning. See [3℄ for an overview of dierent ar hite tures.

Email addresses: philippnada.kth.se(PhilippAlthaus),hi nada.kth.se

(HenrikI. Christensen).

(2)

s alestru tures. Fortheimplementationofthis mapand itsusethere aretwo

on eptually dierent approa hes, using either an exa t geometri map or a

qualitativemap (e.g. topologi al map).

An exa t geometri map (forexample, a grid map[11℄ ora feature map [10℄)

allowstodetermineanappropriate ontrola tionatanyrobotposition,whi h

leadstosmooth ontrolandtraje tories.However,thispre isemaprsthasto

beobtained and the exa t robotposition has tobe known atalltimes. These

onditions are hard to fulll and might be omputationally very expensive.

In ontrast, the use of a topologi al map requires only an approximate guess

of the robot pose; also the small-s alegeometry does not have to be known

pre isely. Through sensory informationfrom the robotit is determined when

therobothasrea hedanodeinthemap.Thisinformationisenoughtoswit h

between the ne essary behaviours to deal with the navigation task at hand.

The Polly system [5℄, for example, shows how su essful navigation an be

a hieved using thissimpletype ofqualitativemap.However, through dis rete

swit hing between tasks the ontrolsignals are not ontinuous. Furthermore,

alsothe traje toryof the robot is not smooth anymore.

In this paper we propose a unied ontrols heme ombining the advantages

ofea hof thetwoapproa hes above.Wepresentanavigationsystem foroÆ e

environments using feature dete tors whi hextra t representations fromsen-

sory data. This geometri pla e re ognition fa ilitates safe lo al navigation.

To apture the large-s ale properties of the environment a simple topologi-

almap is used. Nevertheless, through smooth task swit hing ina dynami al

system framework, ontinuity in the ontrol signal an be a hieved. The dy-

nami alsystemsapproa htobehaviourbasedroboti s introdu edby S honer

and Dose [15℄ provides this ne essary framework. It utilizes a ompetition

s heme among individual behaviours. So far, this ompetition has only been

exploitedinsimulationwork[15,16,9℄usingsimpliedsmall-s alesettings.The

system proposed in this paper, however, a ts in a large-s ale real-world envi-

ronment usinga topologi almap ontainingthe global informationne essary

for navigation. This information about the whereabouts of the robot is still

of a dis rete nature; nevertheless, it is only used as parameters in the dy-

nami oordinationamongthebehaviours. Thisdynami system thenensures

smooth swit hing between tasks. In this way the dis rete nature of a quali-

tative map is ombined with the need for ontinuous navigation in a unied

ontrols heme. Inessen e, the systempresented shows thatone an relaxthe

needfor ontinuousposeestimationwhilemaintainingapossibilityforsmooth

behaviourswit hing.

Initially, the topologi al map fa ilitating task swit hing is introdu ed inSe -

tion 2. Then, in Se tion 3 the geometri representations used for lo al navi-

gation are presented. The dynami al systems approa h is outlinedin Se tion

(3)

periment is presented in Se tion 5, while a summary and avenues for future

resear h are outlinedin Se tion6.

2 The Topologi al Map and its Use

The topologi almap is used by the system to swit h between dierent tasks,

in essen e to a tivate an appropriate set of behaviours. Hen e, it ontains

qualitative information about the large-s ale stru ture of the environment.

This information is re e ted in nodes and edges that onne t these nodes.

The nodes stand for important pla es in the environment. There has to be

one in front of ea h door, at ea h orridor rossing and at other pla es of

interest(e.g.goallo ationsand hargingstation).Ea hnode hasalo ationin

axed oordinatesystem.The edgesthat onne t thesenodes anbeof three

dierenttypes: room, orridor,door.Duetothelimitedamountandsimpli ity

of information in the topologi almap, it is a matter of minutes to onstru t

a new one for a previously unknown oÆ e environment. Figure 1 shows the

topologi almapofour institute. Toplanapath between twoarbitrarynodes,

thesystem ondu ts abreathrst sear hthrough thegraph.This isafeasible

solutionfor environments of the size of anoÆ e area.

Fig. 1. The topologi al map of our institute.The ir lesin the map depi t nodes,

whi h have an exa t lo ation in a oordinate system. Edges are of three dierent

types: orridor (thi k line), room (dashed line), and door (thin line). Additional

nodes for goal points and starting positions an be added arbitrarily. The nodes

denoted with \Charging Station" and \Goal Point" orrespond to the initial and

nal lo ation of the experiment des ribed inSe tion 5. The nodes in gray are the

onesused to exe utethisplan.

At plan exe ution, it is assumed that the initial position and orientation of

the robot are known (e.g. harging station). From there odometry is used

to determine the robots lo ation.This introdu es errors in the estimation of

the exa t position of the robot, but is totally suÆ ient to determine if the

system is in the vi inity of a node. This information is enough to evoke the

(4)

s ale stru tures of the environment (see Se tion 4.2). Note that the error in

the position estimate would grow bigger than desired on long trials over a

great distan e. To avoidthis de ien y, the pose estimate is orre ted based

ondete ted features(see Se tion3).

3 Extra ting Geometri Representations from Raw Sensor Data

Inour experimentsweused theS outrobotfromNomadi Te hnologies(Fig-

ure 2). The platform has a ylindri al shape with a diameter of 38 m and

moves at a speed of up to 1 m/s. The robot is equipped with a ring of 16

evenly spa ed ultrasoni sensors. The per eption and geometri re onstru -

tion of obsta les, walls and doors isbased solely onthe informationprovided

by these sensors. Ea h sonar has a beam width of 25 Æ

and a dete tion range

of 6 to 255 in hes. The robot possesses a two wheel dierentialdrive lo ated

atthegeometri enter whi hallows omni-dire tionalsteeringatzeroturning

radius.The wheelsare equipped with en oders to obtain odometri data.

Fig.2.The S out robot usedintheexperiments.

The ontrol system onsistsof ve dierent behaviours: orridor follow-

ing,wall avoidan e,door passing, obsta le avoidan e,and go to.

Thesebehavioursensuresafenavigationgivenanindividualtask.Hen e,they

allneed some geometri representation of the lo alproperties of the environ-

ment.

To navigate along a orridor two behaviours have been designed: orridor

following and wall avoidan e. They are based on the orientation of

the orridor and the distan e to itswalls. To obtainthis informationthe 200

most re ent sonar readings are kept in a FIFO buer. A Hough transform

[4℄ is invoked on the sonar data every few se onds in order to extra t the

pair of parallel lines (one on either side of the robot) that oin ide with the

largest number of sonar e hos. No assumptions on the width or dire tion of

the orridor are made.

(5)

large enough opening ina wall.In order tond a door, when the robot nds

itself in a orridor, the dire tion to the dete ted orridor wall is used. The

25 most re ent sonar readings, that lie in the dire tion of this wall and not

more than 50 m behind it, are kept in a FIFO buer. The largest angular

segment (from the robots point of view) that does not ontain any sonar

reading is determined.If this segment is greaterthan 15 Æ

we onsider a door

to be dete ted and its dire tion is dened as the entre of the free segment.

Thispro essisinvoked atevery ontrol y leof therobot.Notethatthisdoor

dete tor is very rude, due to the simpli ity of the sensors used. Espe ially

half blo ked doors, with passages that are to small to pass, will sometimes

still be dete ted as doors. However, situations like this are resolved by the

oordination amonga doorpassing and anobsta le avoidan e behaviour(see

the experimental results in Se tion 5).Further, if the robot is in a roomthe

samestrategy todete t adoorisapplied.However, rstthe wallatwhi hthe

door islo ated has to be extra ted. In order to do this a Hough transform is

invoked onthe 100 most re ent sonar e hos.

Duetothelimitedangularresolutionofsonarsensors,thegeometri represen-

tation of obsta les (used by the behaviour obsta le avoidan e) is rather

simple and losely linked to the a tual per eption of the robot. Out of the

50 most re ent sonar readingsthat donot belong to dete ted walls, the ones

in the frontal half plane of the urrent robot heading are onsidered. Obsta-

les are re onstru ted from these dete ted e hos in as ending order of their

distan e to the robot.The e ho losest to the robotdenes the rst obsta le

whose orientation in the robot frame is given by the axis of the sensor that

re eivedthe e ho.A newobsta leisre orded foreverysubsequente howhose

orientationdiers by anangle of atleast 22.5 Æ

from any previously identied

obsta le. New obsta les are added in an in remental fashion until the sonar

buer ontains no further e hos. Obsta le re onstru tion is invoked at every

ontrol y le of the robot. Noti e, that our representation only onsiders the

distan etoanobsta lebut ignoresitsshapeorsize. Despiteitssimpli ity,the

hosen representation ispowerfulenoughto su essfullynavigate in luttered

areas [2,1℄.

Ea h of the above dete tors keeps a ertainnumberof the most re ent sonar

readingsinaFIFObuer.While olle tingthesereadingstherobotisdriving

ashortdistan e.Odometryisused,to al ulatethe lo ationofsonarreadings

taken at dierent robot positions, whi h introdu es further un ertainty in

the sonar data. These errors, however, are omparatively small and hardly

in uen e the performan e of the behaviours.

The behaviour go to aligns the robotsheading with the dire tion of a goal

point. We do not use any dete tor for this goal point yet; its lo ation is only

dened by anode inthe topologi al map (Se tion 2).

(6)

todeterminewhi hoftheabovedete torsshouldbeinvoked.Theinformation

about the exa t lo ation of its nodes and the robots position estimate (from

odometry)determineifthe robotndsitselfina orridororinaroomand/or

lose to a door. The informationfrom the dete ted features, in turn, is used

to update the position estimate. Otherwise, the error in this estimate may

growbiggerthandesiredusingonlyodometry.Hen e,ina orridortherobots

omputed orientation and position are adjusted relativeto the orridor walls

ea htimetheHoughtransformisinvoked(i.e.everyfewse onds).Inaddition,

every timeadoorispassedorientationandpositionrelativetothe doorposts

an be updated a ordingly.

4 Behaviour Coordination

The dynami alsystems approa h has been used to design the individual be-

haviours and their intera tion. The on eptualframework of this approa h is

based on the theory of nonlinear dynami al systems [13℄. In Se tion 4.1 we

only providea brief outline of this framework and referthe interested reader

to [16℄ for a more detailed des ription. Se tion 4.2 des ribes the design of

our system,espe iallythedynami oordinationof theindividualbehaviours.

Note that both, the behaviours and their oordination, are implemented in

a unied framework to provide a ontinuous ontrol signal at all times. The

parametersof thisframeworkare an hored inthetopologi almap(Se tion 2)

and in the features extra ted from the sensory data (Se tion 3).

4.1 Dynami alSystems Approa h

A behaviour b emerges from the time evolution of the behavioural variables

des ribed by the ve tor ~x . In a navigation task for example the robot head-

ingand velo ity onstitute the set of behavioural variables.In the dynami al

system des ribed by

_

~x=

~

f

b

(~x) (1)

the fun tion

~

f

b

an be interpreted as a for e a ting on the behavioural vari-

ables.Thisfor eisdesignedsu hthat thedesiredvalues of~x(e.g.dire tionof

atarget)formanattra torandundesiredvalues(e.g.dire tionofanobsta le)

forma repellor in the dynami s of the behavioural variables. The fun tion

~

f

b

dependsontherelativeposebetween therobotanditsenvironment.However,

the dynami s of ~x takes pla e on a mu h faster time s ale than the gradual

hanges that emerge in

~

f

b

as a result of the robot's motion. This property

assures that the dynami variables remain lose to the attra tor state at all

(7)

vidual ontributions

~

f

b :

_

~ x=

X

b jw

b j

~

f

b

(~x)+noise (2)

The weights w

b

2[ 1;1℄ dene the strength of ea h behaviour and are om-

puted based on the per eived ontext of operation. The noise has a small

amplitude and merely ensures that the dynami s es apes unstable x-points

(repellors).Coordinationamong behaviours ismodelledbymeans of anaddi-

tional ompetitivedynami sthat ontrols the weightsw

b

,whi hevolveinthe

following fashion:

b _ w

b

=

b (w

b w

3

b )

X

b 0

6=b

b 0

;b w

2

b 0

w

b

+noise (3)

The rst term onstitutes a pit hfork bifur ation,i.e. the dynami spossesses

stable x-pointsat

w

b

= 8

<

:

1 if

b

>0

0 if

b

<0

(4)

The fa tors

b

2 [ 1;1℄ are alled ompetitive advantages. They determine

the degree to whi h a behaviour is appropriate and desirable in the present

ontext. The se ond term in equation 3 aptures the ompetitive dynami s

in that an a tive behaviour b 0

of higher priority suppresses the a tivation of

another on i tingbehaviourb.Hen e, thefa tors

b 0

;b

2[0;1℄are alled om-

petitive intera tions. For jw

b 0

j 1 and

b 0

;b

>

b

, the point w

b

= 0 be omes

the new stable x-pointof behaviour b, despitea positive ompetitiveadvan-

tage

b

>0.A detailedanalysisof howthestabilityof x-pointsvaries a ross

dierentvaluesof ompetitiveadvantagesandintera tionsisgiven in[9℄.The

time onstant

b

determines the rate atwhi h the behaviours are swit hed on

and o.Similartothe behaviouraldynami s,the noise termhelpsthe system

toes ape unstable x-points interms of behaviour oordination.

4.2 System Design

We hose the robot heading as the behavioural variable of the dynami-

al system as it oers the advantage that the behaviours an be naturally

expressed in this variable. Furthermore, the ommanded turn rate _

an be

dire tly appliedas a ontrola tion tothe robot. The translationalvelo ity is

regulatedby anexternal ontrolloop,whi hredu es therobotspeedbasedon

two values: 1) the proximity of nearby obsta les, for safety reasons, 2) ahigh

turn rate _

, to assure that the robots heading remains lose to an attra tor

state atall times (see Se tion4.1).

(8)

ridor following, wall avoidan e, door passing, obsta le avoid-

an e, and go to. The design of these behaviours and their fun tional form

f

b

() (equation 1) are motivated and dis ussed in our previous work [2,1℄.

These behaviours or ombinations of them are able to deal su essfully with

theenvironmentonthesmalls ale.In[2℄,forexample,weshowed analyti ally

that the system an reliably distinguish between those passages that are too

narrow topass and gaps that are wide enough totraverse safely.

Theoverall dynami softhe system isobtainedfromthe weighted summation

of individualbehaviours based onequation 2:

_

= X

b jw

b jf

b

()+noise (5)

with b 2 fgoto;obst; orr;wall;doorg. Forthe oordination of the behaviours

the ompetitiveadvantages

b

,the ompetitiveintera tions

b 0

;b

,andthe time

onstants

b

in equation3 have tobe hosen appropriately.

The ompetitive advantages re e t the relevan e and appli ability of a be-

haviourin a parti ular ontext. Obviously, go toshould bea tivated when-

ever the agent nds itself in a room and is supposed to approa h a goal;

otherwise, itis turnedo. Ifthe robotisin a ertain room, is determinedus-

ing the estimate of the robot's positionand the topologi al map (Se tion 2).

For

goto

2(0;1℄the behaviour go to isswit hed on. To have the possibility

for any ompetitive intera tion

b;goto

2 [0;1℄ to be greater or smaller than

goto

, a value of 0.5is hosen for the ompetitiveadvantage. Hen e:

goto

= 8

<

:

0:5 if ina room

0:5 otherwise

(6)

Equivalently, orridor following and wall avoidan e are relevant if

the robotis in a orridor.

orr

=

wal l

= 8

<

:

0:5 if in orridor

0:5 otherwise

(7)

The ompetitive advantage of door passing is tightly oupled to the per-

eption. It is set to a positive value as soon as the door we want to pass is

dete ted (se tion 3).

door

= 8

<

:

0:5 if door dete ted

0:5 otherwise

(8)

Therelevan eofobsta leavoidan edependsonthenumberandproximity

of the obsta les urrently surrounding the robot.As ameasure re e ting this

(9)

= X

i e

d

i

(9)

The sum goes over all obsta les i and the d

i

stand for the distan e to them

as a multiple of the robot's radius. The ompetitive advantage of obsta le

avoidan e is,then, omputed a ording to

obst

=tanh(

0

) (10)

The onstant

0

determines the density above whi h obsta le avoidan e be-

omes relevant (i.e.

obst

>0).The tangent hyperboli ensures that the mag-

nitude of

obst

is limitedto the interval [ 1;1℄.

The ompetitive intera tion

b 0

;b

re e ts the degree to whi h an a tive be-

haviourb 0

suppresses another behaviourb. In fa t,there are situations where

behaviours would interfere with ea h otherin anundesirable, ounterprodu -

tivemanner.Adoorthatishalf-blo ked byanobsta lemightstillbedete ted

as a door, although the gap to pass is a tually too narrow. Hen e, we want

obsta le avoidan e to suppress door passing in the presen e of a high

obsta ledensity. Furthermore,if twoobsta les lie lose toea hother, the dy-

nami s of generates a weak repellor in the middle of them (this has been

shown in [2℄). This repellor, however, ould be dominated by an attra tor

of another behaviour, whi h would inevitably lead to ollision.Consequently,

obsta le avoidan e ought to suppress go to and orridor follow-

ing as well, if the obsta le density (equation 9) ex eeds a riti al threshold

. This prioritization is a hieved by appropriately hoosing the ompetitive

intera tions:

obst;goto

=

obst; orr

=

obst;door

= 1

2

(1+tanh(

)) (11)

The onstant

determinesthedensityatwhi hobsta leavoidan esuppresses

the otherbehaviours (

obst;b

>0:5).The fun tionalformof theterm is hosen

su h that

obst;b

2 [0;1℄. Sin e there exist no potential on i ts among any

other pair of behaviours, all other ompetitive intera tions

b 0

;b

are set to

zero.

The time onstants

b

determine the time s ale at whi h the behaviours are

swit hed onand orespe tively.

obst

is hosenverysmall,su hthattherobot

rea ts almost immediatelyif anew obsta leis per eived. The same holds for

wal l

. As soon as a door is dete ted, the robot should turn towards it before

driving out of dete tion range again. Consequently,

door

is also hosen to be

small. The dynami sof w

goto

and w

orr

evolveat aslowerrate

goto

=

orr

obst

. On e obsta le avoidan e be omes less relevant, e.g. after the robot

ir umnavigates anobsta le, the other behaviours swit hongradually rather

than ausingjitter amongthemselvesand obsta le avoidan e.

(10)

In the following we dis uss a path followed by the robot.The result is quite

representativeforresultsobtainedfromotherexperiments.Figure3showsthe

traje toryoftherobotduringatypi altask:drivingfromthe hargingstation

in the living room to a goal point in the manipulator lab. During this task,

the robot overed adistan e ofabout 50meters.Inthe middleof the orridor

the way was blo ked by people. Therefore, the robot was ir ling for about

a minute before the passage was leared and it was able to pro eed (see [2℄

fordetailsonbehaviourswit hinginblo ked orridors). Throughthis ir ling

theerror inthe positionestimate,obtained fromodometryonly,would betoo

large.However,by orre tingthisestimateonthebasisofthedete ted orridor

(Se tion 3) the error was kept small and the robot ontinued su essfully to

the goalpoint.

0 10m

living room manipulator lab

Fig.3.Thetraje toryfromatrialinourinstitute.Therobotstartedatthe harging

stationinthelivingroomand pro eededthroughthe orridorto agoalpointinthe

manipulatorlab.The boldre tangle an beseen enlargedinFigure 4.

Figure4visualizes moredetailson thetraje toryfromfollowinga orridorto

passing a door and rea hing a goal point. In addition, the absolute values of

the dierent weightsjw

b

j(equation5)are plottedover time.The labelledti s

on the time axis orrespond to the situations A-F that are des ribed below.

Noti e,thatthetimedieren ebetweentwosu essiveti sisnotproportional

to the path length between the orresponding events, as the robot does not

moveat a onstant speed.

A) The vi inityof the next node in thetopologi almap was rea hed and the

doorwasdete ted. Door passing wasgraduallyswit hed onandguidedthe

robot towards the door. Corridor following was turned o on a slower

times alethanwall avoidan e.B)Thedoorwasblo ked byapersonleav-

ingtheroom.However,obsta le avoidan e ompetedwithdoorpassing

andnallydea tivatedthelatterduetoahigh obsta ledensity (equation11).

Consequently, the robot turned away from the door. C) The door was de-

te ted again and obsta le density has de reased. Therefore door passing

wasgraduallya tivatedandthe robotturnedtowardsthe dooragain.D)The

robot passed the door. Due to the high obsta le density door passing was

swit hed o again and obsta le avoidan e guided the robot through the

opening. E) The vi inity of the next node was rea hed. Thus, go to was

(11)

B

C A F

D E

A B C D E F

0 1

time

b

A B C D E F

0 1

time

b

Fig. 4. Details on the task swit hing from following a orridor to rea hing a goal

point in a room. The left image shows an outline of the traje tory of the robot.

Thegrayellipsedenotesapersonthatwasleavingtheroom,whentherobotwasat

lo ationB.Ontherighthandside,theabsolutevaluesoftheweights(seeequation5)

are plotted over time: jw

orr

j(upperplot, solid urve), jw

wal l

j (upper plot,dotted

urve), jw

door

j (upper plot, dashed urve), jw

obst

j (lower plot, dotted urve), and

jw

goto

j (lower plot, solid urve). The situations A-F in the left image orrespond

to the time instan es labeled in the plots. These situations are explained in more

detailin thetext.

graduallyturnedonleadingtoasmoothswit hintasksagain.F)Eventually,

the goalpoint was rea hed and the trialwas ompleted.

Note that in Figure 4 not the a tual traje tory of the robot is plotted, but

thepositionestimateobtained fromtheodometry.Thesharp bendrightafter

passing the door (D) is therefore not a turn that the robot a tually made. It

is simply an artifa t of updating the position estimate after having passed a

door (see Se tion 3). Further, if the door was losed or the doorway had not

been learedaftera ertainamountoftime,theplanningmodulewoulddete t

this. It would sear h for an alternative path through the topologi al map. If

thereis nootherway torea hthe goal,the robotrepeats stepsC andDuntil

it eventuallywill beable topass.

In se tion4.2 the parameter values of the behaviour oordination s heme are

motivatedandexplained.However, animportantquestionisthe sensitivityof

thesystemtodierentparametervalues.Ourexperien eshowsthatthepre ise

values of the ompetitive advantages,

b

, (equations 6, 7, 8, 10) are not that

important as long as the signs are orre t. However, the performan e of the

system isquitesensitive tothe ompetitiveintera tions,

obst;b

,(equation11).

This means that the riti al obsta le density,

, has to be hosen arefully,

su hthatobsta le avoidan esuppressesthe otherbehaviours atthe right

moment.

(12)

Feature dete tors were used to extra t environmental representations from

the sensory data obtained by the sonars of the S out robot; namely orri-

dors, doors, walls, and obsta les. This geometri information fa ilitates the

single behaviours to perform individual tasks: navigating safely along orri-

dors, through doors and around obsta les. To determine whi h one of these

tasksisappropriateina given ontext, asimpletopologi almapand odomet-

ri data from the robot were used. To ombine the ontinuous ontrol of the

dierent behaviours with the dis rete information from the qualitative map

we deployed the dynami alsystems approa h tobehaviourbased roboti s. In

this way smooth swit hing between the navigational tasks ould be a hieved.

In essen e, this omprises a unied ontrol s heme that relaxes the need for

a urate metri models of large-s ale environments, through embedded pla e

re ognition inthe behaviours and behavioural ompetition.

This framework has been su essfully tested in the premises of our institute

(70 20 meters). An example of an experiment has been presented, whi h

shows that the system is apable of ondu ting long trials through a large-

s ale real-world oÆ e environment. It an also be seen that the ompetition

among the behaviours is able to deal with more omplex situations like half-

blo ked doors.

There are other approa hes to navigate in large-s ale environments. Many

of them (Xavier [7℄, for example) need more detailedmodels of the environ-

ment and sophisti ated algorithms (e.g. Markov de ision pro ess models) to

determine an appropriate ontrol a tion for the robot at all times. An other

approa h [14℄ uses, as we do, the superposition of dierent lo al behaviours

(motor s hemas). However, dis rete ontext hanges lead to dis rete hanges

in ontrol. The same holds for a variety of systems using topologi al maps

(Dervish[12℄,for example).Inour system thisinformationfromthe topologi-

almap anbemergedwiththe ontinuousnatureoftheindividualbehaviours

using thedynami al system approa h. Thisleads toa unied ontrols heme.

Atrstsight,itseemsthatthe oordinationframeworkwouldnots alewellto

alargenumberofbehaviours,sin ethereisaparameter,

b 0

;b

,tobedetermined

forea hpairofbehaviours.However,inourexperien e,thisshouldnotbe ome

aproblem.Asinoursystem,thereare usuallyfewbehaviours on i ting with

ea hother. This means that most ompetitiveintera tions are set tozero.

Theuseofsonarsasthetheonlysensorsrestri tsoursystemindierentways.

The representations of the environment are rather simple, whi h an lead to

problems (e.g. if two doors are right next to ea h other). Future resear h

in this proje t will be dire ted towards integration of more a urate sensors

(13)

more sophisti ated sensing apabilities, we willalso try to solve the problem

of globallo alization(negle ted in this paper) using justa simpletopologi al

map.In addition,advan ed sensing apabilitieswould alsoprovideabasis for

learningthetopologi almapbythesystemitself.This ouldbea hieved using

moresophisti atedmodulesfor pla ere ognitionasshown in[8℄.Further,the

generality of the behaviour oordination s heme has to be exploited. Up to

now, it has only been used for navigation behaviours in fet h-and- arry type

tasks. It isplanned toapply the framework toa dierent platform[6℄, where

the emphasis lies onhuman-robot intera tion.

A knowledgment

This resear h has been sponsored by the Swedish Foundation for Strategi

Resear h. The support is gratefully a knowledged.

Referen es

[1℄ P. Althaus and H. I. Christensen. Behaviour oordination for navigation in

oÆ eenvironments. InPro eedings of the IEEE/RSJInternational Conferen e

on IntelligentRobots and Systems, pages2298{2304, 2002.

[2℄ P. Althaus, H. I. Christensen, and F. Homann. Using the dynami al system

approa htonavigateinrealisti real-worldenvironments. InPro eedings ofthe

IEEE/RSJ International Conferen e on Intelligent Robots and Systems, pages

1023{1029, 2001.

[3℄ R.C. Arkin. Behavior-Based Roboti s. MIT Press,Cambridge, MA,1998.

[4℄ J. Forsberg, U. Larsson, and

A. Wernersson. Mobile robot navigation using

therange-weighted houghtransform. IEEE Roboti s &Automation Magazine,

2(1):18{26, 1995.

[5℄ I. Horswill. The Polly system. In D. Kortenkamp, R. P. Bonasso, and

R. Murphy, editors, Arti ial Intelligen e and Mobile Robots: Case studies of

su essfulrobotsystems, hapter5,pages125{139.MITPress,Cambridge,MA,

1998.

[6℄ T.Kanda,H.Ishiguro,M.Imai,T.Ono,andK.Mase. A onstru tiveapproa h

for developing intera tive humanoid robots. In Pro eedings of the IEEE/RSJ

International Conferen eon Intelligent Robots and Systems, pages1265{1270,

2002.

(14)

partiallyobservablemarkov de isionpro essmodels. InD. Kortenkamp, R.P.

Bonasso, and R. Murphy, editors, Arti ial Intelligen e and Mobile Robots:

Case studies of su essful robot systems, hapter 4, pages 91{122. MIT Press,

Cambridge, MA,1998.

[8℄ B. Kuipers, R. Froom, W.-Y. Lee, and D. Pier e. The semanti hierar hy in

robotlearning. InJ.ConnellandS.Mahadevan,editors,RobotLearning,pages

141{170. KluwerA ademi Publishers, Boston,MA, 1993.

[9℄ E.W. Large, H.I. Christensen,and R.Baj sy. S aling thedynami approa h

to pathplanningand ontrol: Competition among behavioral onstraints. The

International Journal of Roboti s Resear h, 18(1):37{58, 1999.

[10℄J.J.Leonardand H.F.Durrant-Whyte. Mobilerobotlo alizationbytra king

geometri bea ons. IEEETransa tions on Roboti sand Automation,7(3):376{

382,1991.

[11℄H.P.Morave . Sensorfusionin ertaintygridsformobilerobots. AIMagazine,

9(2):61{74, 1988.

[12℄I. Nourbakhsh. Dervish:An oÆ e-navigatingrobot. In D. Kortenkamp, R.P.

Bonasso, and R. Murphy, editors, Arti ial Intelligen e and Mobile Robots:

Case studies of su essful robot systems, hapter 3, pages 73{90. MIT Press,

Cambridge, MA,1998.

[13℄L.Perko. Dierential Equations and Dynami al Systems. Springer,New York,

1991.

[14℄A. SaÆotti, K. Konolige, and E. Ruspini. A multi-valued logi approa h to

integratingplanningand ontrol. Arti ial Intelligen e,76(1{2):481{526 ,1995.

[15℄G. S honer and M. Dose. A dynami al systems approa h to task-level system

integration usedto plan and ontrolautonomous vehi lemotion. Roboti s and

Autonomous Systems, 10:253{267, 1992.

[16℄G. S honer, M. Dose, and C. Engels. Dynami s of behavior: theory and

appli ations for autonomous robot ar hite tures. Roboti s and Autonomous

Systems, 16(2{4):213{245, 1995.

(15)

Philipp Althaus is a PhD student at the Centre for Autonomous Systems at

the Royal Institute of Te hnology (KTH) in Sto kholm, Sweden. He gradu-

ated1998inphysi sfromtheFederalInstituteofTe hnology(ETH)inZuri h,

Switzerland. Until 1999, he worked as a resear h assistant at the Institute of

Neuroinformati s at the University of Zuri h. His resear h interests in lude

behaviourbased roboti s, navigation, and mobilerobots ingeneral.

Henrik I. Christensen is Professor of Computer S ien e at the Royal Insti-

tute of Te hnology and dire tor of the Centre for Autonomous Systems. He

was awarded M.S . and Ph.D. degrees in 1987 and 1989, respe tively, from

Aalborg University, Denmark. His resear h is primarily in dynami vision,

robot navigation and systems integration. He has published/edited 7 books,

6spe ial issues and 140 papers. Dr.Christensen isthe oordinator of the Eu-

ropean Roboti s Network, EURON, and a resear h oordinator for the EU

network on Cognitive Vision. In addition he is on the editioral board of the

journals IEEE Transa tions on Pattern Analysis and Ma hine Intelligen e,

International Journal onRoboti s Resear h, International Journalof Pattern

Re ognitionand Arti ialIntelligen e, Ma hine Visionand Appli ations,and

Roboti s and Autonomous Systems. He maintains a tive ollaborations with

resear h groupson three ontinents.