• No results found

Data Aggregation and Cross-layer Design in WSNs

N/A
N/A
Protected

Academic year: 2019

Share "Data Aggregation and Cross-layer Design in WSNs"

Copied!
98
0
0

Loading.... (view fulltext now)

Full text

(1)

Rochester Institute of Technology

RIT Scholar Works

Theses

Thesis/Dissertation Collections

2005

Data Aggregation and Cross-layer Design in WSNs

Carter May

Follow this and additional works at:

http://scholarworks.rit.edu/theses

This Thesis is brought to you for free and open access by the Thesis/Dissertation Collections at RIT Scholar Works. It has been accepted for inclusion

in Theses by an authorized administrator of RIT Scholar Works. For more information, please contact

[email protected].

Recommended Citation

(2)

Data Aggregation and Cross-layer Design in WSNs

by

Carter May

A Thesis Submitted

m

Partial Fulfillment of the

Requirements for the Degree of

Advisor:

Fei

Hu

Master of Science

m

Computer Engineering

---Dr. Fei Bu, Assistant Professor

Committee Member:

---

Marcin Lukowiak

Dr. Marcin Lukowiak, Assistant Professor

Committee Member:

_S_h_a_n_c_h_ie_h_J_aL-y_Y_a

n~g",--____________________ __

Dr. Shanchieh Jay Yang, Assistant Professor

Department of Computer Engineering

Kate Gleason College of Engineering

Rochester Institute of Technology

Rochester, NY

(3)

Release Permission Form

Rochester Institute of Technology

Data Aggregation and Cross-layer Design in WSNs

I, Carter May, hereby grant pennission to the Wallace Library of the Rochester Institute of

Technology to reproduce this thesis, in whole or in part, for non-commercial and non-profit

purposes only.

Carter May

Carter May

(4)

Table

of

Contents

TableofContents 1

ListofFigures 3

Chapter 1 Introduction 5

1 Abstract 5

2 Data Aggregation 6

3 Cross-Layer Optimized Design 6

4 Research

Methodology

6

5 Thesis Organization 7

Chapter 2 BackgroundandRelated Research 8

1 Wireless Sensor Networks 8

1.1 Common Network Topologies 8

1.2

Clustering

8

1.3 Trees 10

1.4 Ad Hoc Peer Networkvs.WSN 10

2 Data Aggregation 12

2.1 Data Aggregation

Timing

14

3 Cross-Layer Design 17

3.1 Traditional Network Model

-OSI Stack 17

3.2 MAC-layer design in WSNs 19

4

Summary

20

Chapter 3 A Novel

Timing

Control for Optimal Data Aggregation 21

1 Problem Statement 21

2 Assumptions 23

-2.1 Cluster Size 24

3 Proposed Time Synchronization Algorithm 27

3.2 Control Parameter Refresh Rate 30

3.3 Finite State Machine ImplementationofAlgorithm 31

3.4 LevelofAggregationChosen

by

Sink 34

3.5 Height-based Differential Analysis 35

4

Energy

AnalysisofProposed

Timing

Synchronization Algorithm 38

5 ApplicationsandExamples 46

Chapter 4 Cross-layer Design 49

1 Introduction 49

1.1 Traditional Network Model 49

1.2

Existing

WorkonCross-Layer Design 50

1.3 WSNMAC layer design 51

2 Our Proposed

Routing

Optimization

Using

MAC Layer Information 53 3 Our Proposed MAC Layer Optimization

Using Routing

Layer Information 55
(5)

3.2 Optimized

Sleep

Schedule 57

3.3 Proposed Route-aware MAC Optimization: FSM Control 60

3.4 Proposed Route-aware MAC Optimization: Protocol 61

4 Mathematical Analysis 62

4.1 Formulae 62

4.2 Visualization 62

5

Summary

63

Chapter 5 OPNET SimulationofData Aggregation

Timing

Control 64

1 Simulation Design Requirements 64

2 Locationfor

Aggregating

Data 65

2.1 Between MACand

Routing

Layer 65

2.2 Inthe

Routing

Layer 65

2.3 Betweenthe

Routing

andApplication Layer 65

3 Simulation Implementation Overview 66

3.1 Intelligent Optimization BasedonLate Packets 69

4 SimulationDetails 69

4.1 Packet Format 69

4.2 Node FSM 70

4.3_ Sink FSM 71

5 OPNET Simulator Implementation 72

5.1 Sensor Node Simulation Model 72

5.2 Sink FSM 73

5.3

Routing

Layer 76

6 SimulationResults 78

6.1 ProofofConcept 79

6.2 PerformanceComparison 82

6.3 ComparisonResults 84

7

Summary

89

Chapter 6 ConclusionandFutureWork 91

1 DataAggregation

Timing

Control 91

2 Cross-Layer Optimized Design 91

3 Future Work 92

(6)
[image:6.546.82.466.83.589.2]

List

of

Figures

Figure 1: SimpleTree Network 9

Figure 2: TypicalWSN

Using

a

Clustering Topology

9

Figure3: Simple Network

Query

Response,

No Aggregation 13

Figure4: Unbalanced NetworkwithPotential

Timing

Problem 14

Figure5: Response Times for Several Aggregation Schemes 16

Figure 6: Simple NetworkwithPotential Aggregation

Timing

Problem 22

Figure 7: Simple TreeNetwork 24

Figure 8: TreeSizevs.Cluster

Complexity

25

Figure9:

Setup

vs.DataCollection Phases 28

Figure 10: FSM#1 33

Figure 11: FSM #2 33

Figure 12: FSM #3 34

Figure 13: Unbalanced

Tree,

Aggregation Example 35

Figure 14: NetworkwithUnbalanced Tree 36

Figure 15: Chain Plus Child Near Leaf 37

Figure 16: Chain Plus Child Near Sink 37

Figure 17:

Energy

Savingsvs.PercentageofAggregation 42

Figure 18: NumberofMessages Receivedvs.Aggregation Period 43

Figure 19: Simplified NumberofMessages Receivedvs.AggregationPeriod 43

Figure 20: Convergence Timefor FSM #1 44

Figure 21: Convergence Time for FSM #2 45

Figure22: Convergence Time for FSM #3 46

Figure 23: HotspotCreated

by Routing

Algorithm 54

Figure24:

Topology

with

Likely

Channel Contention Problem 55

Figure 25:

Energy

Savingswith

Variably Participating

Nodes 57

Figure 26: Example Route 58

Figure 27: Original

Sleep

Schedule 59

Figure 28: Optimized

Sleep

Schedule 59

Figure29: Finite State Machine 60

Figure 30: Idle

Listening Energy

vs.

Duty

CycleandActive Route 63

Figure 31: Idle

Listening Energy

vs.

Connectivity

andActive Route 63

Figure 32: Node FSM 70

Figure 33: Exampleof

Query

Propagationfrom SinktoLeaves 71

Figure34: OPNET ImplementationofNode FSM 73

Figure 35: Simulated NetworkLayout 74

Figure 36: Data Sink Node Model 74

Figure 37:OPNET ImplementationofData Sink 75

Figure38: OPNET Implementationof

Routing

Layer 78

Figure 39: NumberofResponses Reduced

Dynamically

79

Figure40: Timeout Period Reduced

Dynamically

80

Figure41: NumberofResponses Increased

Dynamically

81

Figure42: Timeout Period Increased

Dynamically

81
(7)

Figure 44: Timeout

Period,

n_opt=10 86

Figure 45: Timeout

Period,

Time

Average,

n_opt= 10 87

Figure 46: Timeout

Period,

nopt=8 88

Figure 47: Numberof

Responses,

Time

Average,

nopt=
(8)

Chapter 1 Introduction

1

Abstract

Overthepastfew years,advancesinelectrical engineering haveallowed electronicdevices toshrinkin bothsize and cost. It has becomepossibletoincorporateenvironmental sensors into asingledevicewith a microprocessor andmemorytointerpretthedataand wirelesstransceivers tocommunicatethedata. These "sensornodes"

have becomesmall andcheapenoughthatthey

can be distributed in verylarge numbers intothe areato bemonitored and can beconsidered

disposable. Once

deployed,

thesesensor nodes shouldbeabletoself-organizethemselvesintoa usable network. These "wirelesssensor

networks,"

or

WSNs,

differ fromother adhocnetworks

mainly inthewaythattheyare used. Forexample, inadhocnetworks of personal computers, messages are addressedfromonePCtoanother. Ifa message cannotberouted, thenetworkhas failed. In

WSNs,

dataabouttheenvironmentisrequested

by

the"datasink."

If anyor multiple sensor nodes can return an informativeresponseto this request, thenetworkhassucceeded. A

networkthatisviewedintermsofthedata itcandeliverasopposedtotheindividual devicesthat makeit up has beentermeda

"data-centric"

network[26]. The individualsensor nodesmay fail

to respond to a query, oreven

die,

as

long

as the final result is valid. The network is only considered useless when no usabledatacanbe delivered.

In thisthesis,wefocuson twoaspects. Thefirst is data aggregation with accuratetiming control. Inordertomaintaina certaindegreeof servicequalityand a reasonablesystem

lifetime,

energy needs to be optimized at every stage of system operation. Because wireless communication consumes a major amount ofthe limited

battery

powerforthese sensornodes,

we proposeto limittheamount ofdatatransmitted

by

combiningredundantandcomplimentary

data as much as possible in order to transmit smaller and fewer messages.

By

using mathematical models and computer simulations, we will show that our aggregation-focused protocol

does, indeed,

extend system lifetime. Our secondary focus isa study of cross-layer

design. Wearguethattheextremelyspecialized use ofWSNsshould convince us nottoadhere

to thetraditionalOSI networkingmodel. Throughourexperiments,we will showthatsignificant

(9)

2

Data Aggregation

The first issue we have investigated in this thesis work is related to the issue ofData Aggregation. Themostimportant issue in Wireless Sensor Networks is energyconsumption. To

this end,many networkingschemes attempttominimizetheamount ofdatatransmitted

by

using dataaggregation. Thistradesoffdatafreshnessand

delay

forsavingsinenergy,becausereports

fromsensor nodesthatarriveat anaggregatingnodemay havetobe heldthereforsome period

oftime before

being

reportedso thatadditional reportsmay reach theaggregatorfrom slower1 nodes. Weproposetouse an intelligenttimerand somehigh-level knowledgeofthenetworkto

implementan efficient aggregation timingcontrol protocol. Ourprotocol aims to

dynamically

changethedataaggregation periodaccordingto theaggregation quality. The datasink willissue

requeststhatincludebothadesirednumber of responses and a maximumtimeinwhichitwishes

to receive them. In some situations, the sink may only require responses from a few sensor

nodes out ofthe

field,

butthe timelinessoftheseresponsesisstillimportant.

Aggregating

nodes attempt to provide as many responses as possible within time constraints, which also allows

maximum aggregation and energy savings. Responses that cannot be provided in time are

ignored andthe sinkis responsible for specifyingmore relaxed timing constraints for itsnext

query.

3

Cross-Layer Optimized Design

Inthisthesis,inadditionto thetimingcontrol scheme addressed above, we aimtodefinea

customized cross-layer network model for WSNs that reduces overall power consumption

by

making anyseparatelayersaware of usefulinformation fromotherlayers. Weintendtodesigna

network model that integrates routing,

MAC,

and other operations with the goal of overall systemefficiencyandlow energyconsumption. InformationavailableintheMAC layerwillbe

used tomake more efficientrouting layer decisions. Lower-layer information more accurately describesthecondition ofthe network,so shouldbetightlyintegratedwithhigherlayers

4

Research

Methodology

1

Weusethe term"slower"heremerelytodenotethata report of a related eventtakeslongertoreachthe aggregator. This may becausedbythenetworktopology,theTDMA schedulingpolicy,orphysically lost

(10)

We intendtoverifyourideas both mathematicallyandusingcomputer simulations. Wewill

usetheOPNETsimulatorto implementa sensor network with ourdataaggregation algorithms

[31]. Our OPNET model will allow easy implementation of various routing and MAC

algorithms,as well as new aggregation algorithms astheyare proposed. Wewill also useMatlab toprogram andverifyour numerical analyses.

5

Thesis Organization

Therest ofthis thesisisorganizedasfollows. In Chapter 2we presentthestate oftheartin

regardto sensor networks andthe twoissues we wishto address: dataaggregation and cross-layerdesign. Chapter 3 is devotedto dataaggregation timing in sensor networks and howto

intelligently

and

dynamically

update the timing to satisfy

latency

and energy savings requirements. Chapter 4 analyzestheenergysavings

by

designing

a MACscheme thatmakes use ofrouting information. Chapter 5 showsdetailedsimulation results ofthedataaggregation
(11)

Chapter

2 Background

and

Related

Research

This chapter describes our background research and some related works in the field of

wireless sensor networks. We firstpresenta general introductiontosensor networktopologies,

followed

by

specific backgroundontheresearchissuesofthisthesis,i.e.timingcontrol in data

aggregation and cross-layerdesign. We describethe current research in theseareas

here,

and thenexplore ournewlyproposedideas in Chapters

3, 4,

and5.

1

Wireless Sensor Networks

1.1

Common Network Topologies

Wireless sensor networks typically contain thousands or tens ofthousands of nodes, but

spatiallyor

logically

related nodesmaypossess redundantdata [21]. Forthese reasons,manyof thenetworktopologiesusedinsmaller scale ad hocandaddress-centric wired networks are not

appropriatefor WSNs.

Atypicalnetworktopologyfor"wired"

networksisatree. Therootnodesareusually DNSs

(domain controllers), and messages are routed through intermediate controllers to end user

computers, orleaves. Aproposedtopologyfor WSNs is clustering,whereby 50or 100sensor nodesare grouped together and,forthemostpart, are used as a single entity. Inthis thesiswe proposeto make use ofboth methods(i.e. clusters and trees), so thatgroups of sensor nodes combinetheirreports atthelowest

level,

thenreportsmaycontinuetobeaggregated astheypass

back uptheaggregationtree[20].

1.2

Clustering

Many

WSN topologies make use ofclustering [17].

Usually,

a number of regular sensor

nodesare organized underthecontrol of a

"clusterhead."

Withintheclusterthetopologyvaries.

The nodes may be organizedintoa flat topology, a tree

hierarchy,

etc. Consider thecase in

whichthereare a numberofclusters,each with a clusterheadthatreportstoa commondatasink.

In this case, theclusterheads will

likely

be organized into a treewith the sink atthe root, as
(12)

be multiple levelswithin the tree with several clusterheads reporting to a singleintermediate

nodethat,intum,transmitsresultsto thesink.

Level2, Sink/Clusterhead

Level 1,

Clusterhead/Node

Level0,

Cluster/Leaf Node

[image:12.546.84.458.314.549.2]

Figure 1: SimpleTree Network

Figure 2 shows a networkthathas been divided up intoclusters[13]. Inthe

figure,

eachof

the three clusters has aclusterhead thatcommunicates

directly

with the command node (data

sink). Theregular sensor nodescommunicateonlywiththeclusterheads and perhaps one ortwo

intermediate sensorin thisdiagram. In addition, the clusterheadsmaycommunicate between

themselves. Theseclusterheads may be especiallypowerful nodes, but oftenthey are regular

sensor nodesthathave beengiven additional responsibilities on atemporarybasis.

(13)

1.3

Trees

Traditional networking schemes make

heavy

use of some sort ofMasters or Controllers.

Thiscentral server would beresponsiblefor

individually

queryingeach nodeandaggregatingall

ofthefinal data

[42]

andisanalogousto theend sinkin WSNs.

Often,

thecontrollerisassumed

tohaveinfinitecomputational resources. In

WSNs,

thesemaster nodes

(clusterheads)

are most

likely

regular sensor nodes that have been tasked with greater responsibility. Multiple

clusterheads are organized under a data sink, but these local leaders do not have infinite resources as in the traditional paradigm.

They

arejust as resource-constrained as the other members of their group, so most of the computation and communication should remain

distributed,

withonlythecore coordinationfunctions

being

givento themaster. In Figure

1,

the

sinkmayormaynotbeenergy-constrained,but intermediateclusterheads (level 1

)

should still

only begiven minimal extratasks.

Ifatreeistobeused asthenetworktopology,itcanbecreated

by

someexistingalgorithms [49]. For

instance,

abreadth-first-searchcan create anoptimally balancedtree. Thiswillgreatly aid network management and data aggregation, however the complexity ofthis algorithm is

Otime(diameter)

and0meSsage(nxdiameter+|E|). Thisis rarelyusedinsensor networksdueto its

lackofscalability [42]. Eventhecommon minimumspanningtree

(MST)

has complexity 0(n

)

[13]. Inaddition,MSTalgorithmsdonottakeintoaccountthebroadcastnatureofWSNs. For

example,atransmissiontoadistantnodemay beoverheard

by

closernodes,butcommunication

to eachindividual node is modeled

by

a weighted edge [32]. Thechoice ofthe tree-building

structureis

important,

butoutsidethescope ofthis thesis. Severalschemeshave beensuggested for

building

tree networks appropriate for sensor networks [49].

Specifically,

our data aggregation protocolsare compatible withanytypeoftree topology.

1.4 Ad

Hoc Peer Network

vs.

WSN

Wirelesssensor networksare,ineffect,specialized adhocnetworks.

However,

theydiffer in several ways. Whenonethinksof a

"traditional"

adhocnetwork,one envisions several PCsor perhaps wireless devices communicatingwithin alocal area. Communicationprotocols reflect

this typeof network

by

stressingperformance metrics such asreliability, bandwidthutilization,
(14)

energy-constrained. Whena node's

battery

powerisexhaustedit isnolongerusefulinthenetwork and thenetwork suffersexponentiallyas nodes continuetodie.

Individualnodesmay

frequently

becomeunreachableduetodynamicenvironmental

factors,

malfunctions, and energy depletion. The whole network must be able to recover from these failures as efficiently as possible and would preferably operate transparently to them. In addition,WSNsare subjecttothe

following

listofdifferencesfromtraditionaladhocnetworks [1]:

Thenumberof sensor nodesina sensor network canbeseveral orders of magnitude higherthanthenodesinan adhocnetwork.

Sensornodes are

densely

deployed. Sensornodesare pronetofailures.

Thetopologyof a sensor network changesvery frequently.

Sensornodesmainlyusebroadcastcommunication paradigm whereas mostadhoc networks arebasedon point-to-point communications.

Sensornodes arelimitedinpower,computationalcapacities,and memory.

Sensornodes may not haveglobal identification

(ID)

because ofthe large amount of overhead andlargenumber of sensors.

AllofthesefeaturescombinetomakeWSNsa unique area ofresearch. Previous networking

designs areserviceable inmost cases, butthey perform sub-optimally whenenergy-efficiency and systemlifetime becomethetopconcerns.

WSNs differfrom"wired"

andmanyother wireless networksinthattheyare adhocand self-configuring. There isno central controllerthat is aware of eachindividual sensornode;nodes must communicate with direct neighbors to ensure overall network connectivity. Though cellular networks communicate wirelessly, the distribution ofcontrolling entities

(towers)

is well-planned andtheyhavesufficientprocessingpowertocontrolthedeviceswithin theircell. Internet protocols can

dynamically

discover routes between any two

hosts,

but intermediate routersbetween a source and a destination are much more stable than in a wireless network. Messagesaregenerallyroutedupa

hierarchy

ofdedicated hardwaretoan establishedbackbone. WSNsmust transmitsenseddata from distributedsources through theirpeersback to a single sink. Sensornodes have high failureratesand may movearound a sensor

field,

breaking

old links and creatingnew ones. A route evaluated as efficient

during

one communication round
(15)

thenetworklayer. Abovethis,any numberof applicationsmayrun. Sensornetworks arevery

application specificinthatan entire networkisconfiguredanddeployed fora single purpose.

Wirelesssensor networks canbecomparedtocommonBluetoothnetworks. Like

WSNs,

a

Bluetooth network must be able to self-configure within a reasonable amount oftime after

deployment (ad

hoc),

and uses apotentially

lossy

communication medium. Bluetoothnetworks

formthemselvesintoa startopologymadeupof a single master andupto seven slaves. These

mini-networks are termed a

"piconet."

Within this piconet, the master is responsible for

assigning both a TDMA schedule and a

frequency-hopping

schedule.

Physically,

Bluetooth

devices usually transmit at around 20 dBm

[1],

and a piconet might cover 10 or 12 meters.

Nodes inthesenetworksmay be

battery

powered,butarelargeenough andfewenoughthatthey

canbe easilycollected, recharged,and maintained.

On the other

hand,

WSNs are madeup ofmany more nodes with shorter communication ranges. This has led to the idea ofclustering,where a clustermay be thoughtof as a large

piconet,butnumerous clusters mustinteracttoserve as a usable network

[2]

[10]. Thescale of thenetwork exacerbatestheproblemofMAC-Iayercontrol considerably. Thenetworktopology must be maintained under extremely dynamic conditions, due to mobility but also the high failureand error rate ofthesensor nodehardware.

The

following

section provides the state-of-the-art in the area ofData Aggregation

()

in

wireless sensornetworks,whichisone ofthefocusesofthisthesis.

2

Data Aggregation

Somecommunication schemes usethenaive approach of

having

everynode communicateits datatoeveryother node. As discussed in

[1],

thisis inappropriate forsensor networks.

Only

a powerful master or controller node could even addressthe potential hundreds ofthousands of sensor nodes. Insensor networksthisis not evennecessary,as onlythesinkcan make use of collecteddata.

Dataaggregationis a mechanism

by

whichonlya portion ofthedatareceived

by

a host is

retransmitted. Thisis done

intelligently

asthehost

(or,

ina wireless sensornetwork, thesensor

node) uses some criteria to decide specifically what data need not be retransmitted. Most

(16)

able to transmitfewermessage

headers,

thoughall original datamust stillbetransmitted. Also easilycalculatedbutcapable ofshowingmoredramatic energysavings arefunctionssuch as min and max. For example,a chain ofk nodes,each with an8-bitvaluetoreport can communicate

the minimum or maximum value from one end of the chain to the other with only -1 transmissionsandk-\ mathematical comparisons. Two 8-bitvalues are compared at each node andonlyoneisretransmitted

by

each. This isaformofcompressingrepresentativedata intoa singlefinalaggregated result.

Figure 3shows a simplified network with an establishedhierarchy. Weconsider a network

tobeorganizedinatree

topology

forreasonsdiscussed later inthis thesis. Itcanbeseenthat,if

noaggregationis performed,therewillbeatotalof18transmissionsassumingallleafnodes and onlytheleafnodes respondto thequery. Shouldtheintermediatenodesalsohave datato report, therewillbe 21 transmissions. Ifresponsesfromtheleafnodes couldbecombined intoa single response at eachintermediatenodetherewouldonly be 12transmissions torespondto thesame query.

Non-aggregated responses, 9 transmissions

si\A\A*

Individual responses, 9 transmissions

Figure 3: SimpleNetworkQuery Response,No Aggregation

Though ithas beenproventhatdataaggregation can saveenergy inwireless sensornetworks,

research issues such as where and when to perform the aggregation still exist [5]. Network designersrequire,

first,

an algorithmthatcan choose an optimallocation inthenetworktopology

toperformdataaggregation.

Some aggregation schemes that have been proposed are shortest path tree

(SPT),

center

nearestsource

(CNS),

andgreedy incrementaltree

(GIT) [26]

[43]. SPT isthesimplest ofthese.

Each messageis routed to the sink viatheshortest path. Data is aggregated opportunistically

(17)

aggregationrequires the most overhead tosetupout ofthese threeaggregation schemes [26]. SimilartoDjikstra'slink-state routingalgorithm,aggregation pointsare chosen

iteratively

before

aggregationbegins. Thisrequiresknowledgeoftheentire network andlinkcosts. Though it is notnecessarilya priori

knowledge,

itcouldbeconsideredsuch,sincetheaggregationtreemust becreated

independently

fromtheroutingtopology[22].

2.1 Data Aggregation

Timing

Most researchers agree that data aggregation is a useful technique for reducing energy

consumption in wireless sensor networks

[26]

[41].

However,

how to determine appropriate aggregation

timing

characteristics remains a

largely

unexploredfield.

Inalargescale

WSN,

theremay bea significant

delay

betweenwhen an eventissensed(ora

query is answered) and when thedata reachesthe sink. For maximum energysavings, each aggregating sensor should be able to transmit all data received from all ofits children at one time. Thiscouldleadtoan even larger

delay

betweenwhen thedataoriginates and when itis

finally

reported, as each aggregating node may have to pause before sending a report to its parent

Depending

onthepriorityofenergy savings vs. maximum

latency,

it may make more senseforan aggregatortowaitforall, some,oronlythefirstofitschildrentorespond.

Forexample, Figure 4 shows a network with a potential timingproblem. In the case in

which leaf nodes and only leaf nodes respond to a query, if Node B wishes to aggregate responses from itschildren, it will be facedwith adecision of whethertowaitfortheslower responsefrom Node Corto transmit theresponsefrom Node D firstand sendC'sresponselater. Thisassumesthateachtransmissionover a wirelesslinktakesexactlythesame amount oftime. Ifthisisnotthe case, thetimingproblembecomesmore complicated andless deterministic.

Level 3 B39 Sink

Level2

Level 1

Level 0

Figure4: Unbalanced NetworkwithPotentialTimingProblem

(18)

aggregation, but stopshort ofspecifyingdetails forany specific implementation. Others have

proposeddetailed aggregation schemes intendedtobeused withDirected

Diffusion,

including

"Greedy

Aggregation"

[41]

and CLUDDA [7].

However,

none ofthese address the issue of

timing

controlbetween differentlevelsof sensors. Mostaggregation schemesfocuson

finding

an optimal node at which to aggregate

data,

simply mentioningthat aggregationis performed

thereand oftenassuming intheircalculationsthatall responses received are aggregated before

being

propagated.

Based onthe

timing

modelsdescribed

by

SolisandObraczka in

[41],

existingperiodicdata

aggregation protocols canbeclassified into three categories,namely:periodicsimple,periodic

per-hop,and periodicper-hopadjusted. Periodicsimpleaggregation works

by having

each node wait a pre-defined period oftime,aggregate alldata itemsreceived,and send out a single packet

containing the result. Aggregation mechanisms in theperiodicper-hop category have nodes

sendtheaggregated packet as soon as

they

hearfromalltheirchildren.

Excessively

latereports are dropped as inperiodic simple.

Finally,

periodicper-hop adjusted schemes use the same

basicprinciple ofperiodicper-hop butschedule a node's timeoutbased on itsposition in the

distribution tree (rooted at the information sink and spanning all reporting and intermediate

nodes). The aggregation scheme proposed

by

this thesis fallswithin this category, and, when

comparedto otherexistingperiodicper-hopadjustedalgorithms,presentsbenefitssuch as not

requiringclock synchronization amongnodes andminimizingthe timeoutschedulingoverhead.

Thedetailswillbe discussed laterinthe thesis.

WereferagaintoFigure3. Nodesa,

b,

and c are child nodes.

They

transmittheirresponses

atthe end oftheirrespectivetimeouts, whichare notnecessarilysynchronized.

Theoretically,

sincetheyare allleafnodes,theycouldtransmit earlier,butthisexampleholdsfornodesinternal

to the treeas well. Node d isan aggregator and the parent oftheotherthree nodes. It will

ideally

receive responsesfromall ofitschildren,butwilltimeout after somelengthoftime.

Figure 5 shows the response schedule forrepresentative periodic simple, periodicper-hop,

and periodicper-hopadjustedschemes astheywouldapplyto the treeshownin Figure 3. Two

examplesare givenforeachscheme;child nodesrespond atthesametimeinallthreeschemesin

each example. Theshaded sectionsindicatetimebefore theaggregator'stimeout that itwould

nothavetowaitbecause itschildrenhaveall responded. Inthesecondexample,not all children

(19)

Nodea

II

II

Periodic b

II

II

Simple c

II

II

d

1

1

1

'

1

Nodea

~r

I

I- ~T~

Periodic b

l

r

I

Per-hop c T~

~i

I

T~

d

i

ii

i

i; .

Nodea

~Tl

~n

Periodic b

~i

n

Per-hop c T~

n

Adjusted $ 1i.^^i

r

U

Figure 5: Response Times for Several Aggregation Schemes

Periodicsimpleisthesimplestto

implement,

butexhibitsthemaximum

latency

inall cases.

Periodic per-hopreducesthe

latency

in thefirstexample, as expected. Inthesecondexample,

both oftheseschemes must

drop

nodeb's response(or wait until thenextreporting period).

Finally,

periodicper-hopadjusted performs

ideally

in bothexamples. Becausethechildren are

configured with a shortertimeout, this scheme can aggregate all responses before its timeout

eveninthesecond example.

Directed Diffusion's communication paradigm is basedon information sinks

broadcasting

requests, orinterests,forrelevantdata. Nodeswhohaverelevant informationrespond to these

requests anddatapathsareformed alongthereturn route. "Better"routes aredetermined

by

the

numberand qualityof responses that are sent along it. Data is aggregated opportunistically;

wheneveridenticalresponses orqueries meet at a nodeonlyoneisretransmitted. Though every

node canpotentiallyperformaggregation,nodesintheshortest pathfrom informationsourcesto

thesinks are responsibleformostoftheenergysavings[22].

In periodic simple aggregation protocols, all nodes wait a pre-defined amount of time,

aggregate all thedatareceived withinthat period, and send out a single packet[41]. Directed

Diffusionfalls intothis categoryonlywhen all nodeshaverelevantdatato send. In this case,

reinforcementqueriesfromthesinkspecifythedesiredresponse rate. Notethat nodesare not

necessarilysynchronized when

"clocking

out"

data. Thoughseveralcloselygrouped nodesmay

respond at an identical rateof once perminute, these individualresponses may not go out at

(20)

TAG

[28],

or

Tiny

AGreggation,

is a good example of a periodic per-hop adjusted

aggregation mechanism(see Figure 5). TAG uses aggregationas queries are processed within thenetwork. Somequeries in TAGrequest reportstobesentfromsensors periodically. Inthis

case,TAG

intelligently

subdividesthedatacollection

"epoch"

intosmallerslots. Eachslotisthe

epoch length divided

by

D,

the depth ofthe tree.

Following

per-hop adjusted aggregation

operation, slots are assigned to nodes in

decreasing

order,

D, D-l, D-2,

..., as the query propagatesthrough thenetwork. Thisscheme requiresknowledgeofthenetworktopologyand

time synchronizationbetween nodes,but allows nodes topowerdown when notscheduled to

transmit or receive. Our proposed aggregation scheme is not affected

by

the potential sleep

schedule and ourMAC layerscheme,discussed

later,

takesfulladvantage ofit.

In additiontoTAG

[28],

another well-received aggregation schemeis AIDA [15]. Though

both ofthese schemes suggest several potentially energy-saving

ideas,

they focus on disjoint

aspects of aggregation. TAG'smainfocus ison an efficientquerying languagethatisconducive

to aggregation,but is mainlyan application-level optimization.

AIDA,

ontheother

hand,

inserts

a newlayer intotheprotocol stackthatinterprets and repackagesdataneartheMAC

layer,

but doesnot considerdynamictimingparametersbased on application-level requirements. Bothof

theseframeworksare usefulforreducing energyconsumptionin WSNsand are compatible with

ourtimingprotocol,but donotadequatelyaddresstheissuesthatwehopetosolve. Theconcept of

"cascading

timeouts,"

where nodes would waitfora period oftime

directly

related to their depth in the aggregation tree, was recently proposed in

[41] by

Solis and Obraczka. Though this is a potentially useful optimization, its main shortcoming is that it

requires significantadditionaldatatobetransferred

during

thesetupperiod. Ourtimingcontrol

scheme requires minimal overhead, but is flexible enough to allow expansion for later optimizations.

Wenextdiscussthestate-of-the-artinthearea of cross-layer

design;

thesecondfocusofthis thesis.

3

Cross-Layer Design

3.

1

Traditional

Network Model

-OSI Stack

The

following

isabriefoverview of what we considerthe traditionalnetworkprogramming

model, as defined

by

the International Standards Organization. A moredetailed description is
(21)

Table 1

Layer Name Function

Layer 7 Application User interface

Layer 6 Presentation Formatconversionandinterfaceto theapplicationlayer Layer 5 Session Maintainsmultiplelow-levelconnections asa singleentity

for logicalorganizationintheapplicationlayer. Layer 4 Transport End-to-endreliability

Layer 3 Network

Routing

Layer 2 Data-link

(MAC)

Neighbor-to-neighborcommunication

Layer 1 Physical Communicationacrossthephysical medium

(wire,

fiber optic, radio,etc.)

Ofthese, layers 5 and6arerarelyconsideredseparately insensor networks. Theoperation oftheapplicationlayer is dictated

by

thepurposeofthesensornetwork. Acommon view ofthe

sensor networkisas adatabase. Inthis case, theapplicationbecomesthequery interpreterand little isrequired ofthepresentation or session

layers,

sotheymay be practicallyeliminated.

Thetransport layer is responsiblemainly for end-to-end reliability. This is a specialized requirementin sensor networksbecausetheirusagedeviates froman Internet-type host-to-host communication paradigm. Queriesmust reachmost,ifnotall, targetsensors and responses must be returned, perhapsanonymously, to the sink.

Assuring

that each query and each response

definitively

reachesits destination is infeasibleonthisscale. Becauseoftheredundancyofdata andlargenumber ofsensors, the

functionality

ofthe transportlayer is effectivelyaccomplished

by

therouting layer. Queriesand responses shouldberouted with somereliability, whichmay wellbe lessthan100%. Lostmessages and messages with errorsmay be ignored.

There have been a number of proposed optimizationsfor boththeroutingandMAC

layers,

howevermost all oftheresearchconsiders themseparately

[17] [21]

[48].

Only

recently have researchers begun to classify the problem ofofficially combining functions from these two layers. So

far,

research indicates that significant energy can besaved

by

eliminating at least

some oftheboundaries inthetraditionalmodel[36].

Thisisnotsimply a matterof computation and encapsulationthat must occurbetweentwo layers.

Obviously,

fewer interfaces betweenlayers will reduce code complexity. Inaddition, information traditionally not available to the MAC or

Routing

layer may allow them to
(22)

MAC and routing layers is of particular interest. We will analyze the benefits ofrelating

protocolsfromthese twolayersinChapter4.

3.2

Optimized MAC-layer

design based

on

routing information

Inregardto traditionalMAC layer protocols, TDMAcan make use ofinactivetimeslotsto

put some sensors to sleep and reduce energy use.

However,

were the sleep scheduling

mechanism apprised of whether or notthe sensor wastobeusedto route messages

during

the

nexttime slot, itcould opt notto turnon thesensorforthat slot and

thereby

save even more

energy. This isa majorissue for low datarate sensornetworks,where nodesmay berequiredto

awake from sleep mode farmore often than they are actuallyused to route data queries and

replies.

There exist several customizedMAC protocols, specifically designed forsensornetworks,

such asSensor MAC

(S-MAC)

[48]. Though itshows a reductionin energyconsumption,since it is stillroute-oblivious, itwastesenergy since sleepperiods are not coordinated withrouting

functions.

Even customized MAC protocols such as S-MAC cannot completely account for the

additionalenergyconsumeddueto thefactthatthesehighly-optimized routingprotocolsdonot

consider the operation ofthe MAC layer. As shown in

[36]

and

[50],

energy-efficiencyof

networks using these routing protocols can be improved

by

using a "route-aware" MAC

protocol. Thisreducestheseparationbetweentheoperation oftheMAC layerandrouting layer. We holdthat thisisa usefultechniquefor reducing energyconsumption and examinetheissue in

detail.

Ourprotocol will take thisconcept onestep further

by

combiningrelevantfunctionsofthree

majortraditionalnetworking

layers;

theMAC

layer,

therouting

layer,

andtheapplication layer. The authors of

[50]

analyze the benefits of different layers'

common optimizations as

implemented in wireless sensornetworks. Theirresults are discussed in detail in Chapter 4.

They

arguethat cross-layerdesign isa good

idea,

specifically becausetheoptimizations inone layer may counteract those in another. This is shown

by

the simple example of a routing

algorithm that discovers the shortest route from sensors that sense an event to the data sink. When an event occurs, perhaps after a period of complete

inactivity

on the wireless channel,

these multiple sensors all simultaneously report the event, causing a sudden escalation of

(23)

Zhang

and

Cheng

proposetheirownMACschemeaimedtoavoid contentioninthepresence

of

bursty

data transmissionsand routes pronetohotspotsthatare commonin WSNs. Few details

aregiven,butthemaindrawbackisthattheyspecify theuse of out-of-band signaling. S-MAC avoidsthisrequirementandthe twoideasshouldbe

functionally

compatible.

In

[33],

the authors analyze the idea ofusing MAC-layer information to create optimized

clusters, usually the responsibility ofthe routing layer. An interrogator in the MAC layer realizesitsrole and usesthiscriteriontobecomea clusterheadin therouting layer. Simulations showthat this isafeasibleapproachfor creating clusters. All packetprocessingoccurs inthe

MAC

layer,

which essentially combines the routing and MAC layers. This allows nodes to

dynamically

decide uponthenumber of neighborstocommunicate with(an importantcriterion in theirsystem) based on overheard MAC layer frames.

Logically,

one expectsthat creating

clustersbasedintheMAC layerismore efficientthancreatingacompletelyconnectednetwork, theneliminatingsome oftheselinks basedon aroutingprotocol. Thisassumesthattheclusters formed are comparablein efficiencyto thosethatwouldbe formed

by

usingadedicated routing layer clusteringalgorithm. Performanceoftheproposed protocolisshown,buttheauthorsstop

shortofany directcomparisons.

4

Summary

Wireless sensor networks are a relatively new area of research. As the technology has

developed,

many issues have arisen that are unique to these networks. This is prompting

research on efficient algorithms to organize the networks, how to spend the least amount of

energy to communicate

interesting

data,

and thedifferences ofWSNs that may requirea new networkprogrammingmodel.

After studyingthestate-of-the-artinthesetwo areas,we estimatedthatwecould supplement

and extendthework of others with our newideas on

Timing

Control in Data Aggregationand Cross-layer design. Weproposeto leveragethecurrentresearchand analyze several potential

optimizationsinthese areas. In the

following

chapters we will examine adetailedexample of
(24)

Chapter

3

A

Novel

Timing

Control for

Optimal

Data

Aggregation

We now present the first focus of this thesis. We will analyze the issue of timing

requirements when aggregating data in WSNs and propose a protocol thatprovides increased lifetime and performance. We then propose several new timing control algorithms that are

compatible with our protocol and attempt to

dynamically

update data aggregation timing parametersto extend system lifetime. These performance ofthe algorithms described inthis

chapter are evaluatedlaterinthis thesis.

Chapter organization: In this chapter, we first describe the problem to be addressed in

Section

1,

andthenin Section 2we enumerate some assumptionsnecessary forournetworking

topology setup. In Section 3 we provide detailed descriptions of multiple versions of our

aggregation timing control scheme and in Section 4 we perform a theoretical analysis ofits potentialenergysavings.

Finally,

in Section

5,

welistseveral example applications andhowour

aggregation scheme canbeappliedinnetworks with

differing

priorities.

1

Problem Statement

In orderto minimizeenergyconsumption, many networking schemes attemptto minimize

the amount ofdatatransmitted

by

using someform ofdata aggregation. This tradesoffdata

freshness forsavingsinenergy,becausereportsfromsensor nodesthatarrive at anaggregating

nodemay havetobe heldthereforsome period oftimebefore

being

reportedsothatadditional

reportsmayreachtheaggregatorfromslower nodes. This isa separateissue fromtheprocessing

timeneededtoaggregatedata frommultiplesources.

For

instance,

in Figure

6,

nodeBwillreceivedatafromnodeDbefore itreceivesdata from

nodeC becausenodeCmust waittoreceivedata from bothnodesEand

F,

assumingthatallleaf nodes reportdataand aperfectMAC layer. Inthis example,should nodeBwaittohear from nodeCorpromptly forward D'smessageassoonasit isreceived?

There are a numberofissues that affect this

decision,

such as whetherB is aware ofthe
(25)

willhavetowaitfor exactlythe

delay

incurred

by

a single

hop

transmission, it mayopttowait for C's response(based on knowledgeofthemaximum response latency).

Otherwise,

it may havetowait until atimerexpires. Alsonotethatifthenetwork wereany

larger,

forexampleif

nodeE hadchildrenGand

H,

we would encounter a similartimingproblem,butwith adifferent solution sinceE isfartheraway fromthesink.

Sink

Level 1

Level 0

Figure6: Simple NetworkwithPotentialAggregationTimingProblem

Inthisthesis,we proposetouse a novel intelligenttimerand somehigh-level knowledgeof

the networktoimplementan efficient aggregation timingcontrol scheme. Basedon a node's

position inthe network,itwillknow how

long

itcan waitforreportsfrom itschildren without

exceedingthemaximum

latency

for itsown reporttoitsparent node. Itmustbepossibleforthe sinktoindicatethroughthenetworkthehighestacceptable

latency

andthenodesthroughout the network musthavesomeideaofthenetworktopology.

We assume a reasonably linear relationship betweenthenumber of messages and the time

period. It has beenshown

by

Yuan, Krishnamurthy,

andTripathi in

[49]

that,toapoint, thisis true. This is discussed later in this chapter. With this assumption and the knowledge listed

above, the sink and sensors will be able to calculate maximum timeouts that satisfy the application-leveltimingrequirements.

As discussed in Chapter

2,

timing models can beclassified into three categories, namely:

periodicsimple,periodicper-hop,and periodic per-hopadjusted. Forthepurposes oftimingin

our aggregation model, we propose an efficient periodic per-hop adjusted scheme whereby a

node,

being

aware of its distance in hops from the sink, can reduce the timeout period proportionallybefore retransmittingtherequest.

It is not necessary for a node to know the number oflevels below it in the tree as the

calculations arehandled

by

thesink. Thisis incontrasttoschemes wherethesink mustdiscover the networktopology, thenpropagate this informationtoall nodesin thenetwork. Each node
(26)

specifies too short of anaggregationperiod aggregators somewhere above the leafnodes will

timeout and return alimitednumber of responses.

Only

localsynchronizationisrequired,asthe

wireless propagationtime isassumedto benegligible andis accountedfor

by

the

dynamically

updated globaltimeout.

Thescheme proposedinthischapter canbeachievedsolely intherouting layeranddoesnot

interfere with any potential sleepschedule enforced

by

theMAC

layer,

which is discussed in

Chapter 4. In thenext section we listanddiscuss some assumptions relevanttotourproposed

aggregation

timing

mechanisms.

2

Assumptions

2.1

Topology

assumption:

Cluster-Tree

architecture

IntermsofWSNtopologiesforthepurpose of optimaldataaggregation,we proposetomake

use ofbothmethods(i.e.atreeconsistingofcluster-heads)

[20]

sothatgroups of sensor nodes

will combinetheirreports atthelowest

level,

thenreportsmaycontinuetobeaggregated asthey

passupthe aggregationtree[17]. Wearguethat thisisapromising routing implementation in

termsofscalabilityand overall energyefficiency. Sinceeach cluster-headfirstperforms local

aggregationin itsclusterbefore

forwarding

datatothenextcluster-head, this thesiswillfocuson

the data aggregation issue in the entire tree(i.e. between cluster-heads instead ofinside each

cluster).

Consider Figure 7. In this

figure,

each circle can represent a sensor node. In the

figure,

groups oftwoorthreesensor nodes atlevel iare groupedtogetherunderthecontrol of another

node atleveli+\. Nodesatlevel/areleaves inthe treeandfunction onlyas sensors. Thevast

majorityof nodesinthenetworkfall intothiscategory. Nodesatlevelj+1 and

higher,

uptothe

root ofthetree,wouldbeconsidered clusterheads. Thoughtheyare still sensor nodes andmay

have

individually

senseddatatoreturnto the sink,

they

arefewenoughinnumberthatour main

concern with them is how much data they forward.

Alternatively,

thecircles at level 0could

easily beentireclusters. Membersoftheseclusters communicateto therest ofthenetwork via

(27)

the organization will be the same. In

fact,

a

hierarchy

of some sortis practically guaranteed

whenclustering[13].

In this thesis, we usethe relative

terminology

interchangeably;

each circle in thediagram

may bethoughtof as anode,acluster,or a clusterhead andthecircle atthehighest levelwillbe

referredtoas eithertheclusterhead orthesink,

depending

onthecontext.

Level2,

Sink/Ctusterhead

Level 1, Clusterhead/Node

Level0, Ouster/Leaf Node

Figure7: SimpleTreeNetwork

OnthedeterminationofCluster Size:

Givenafieldof size m

by

nand

disregarding

theeffect of clusteroverlap,onehasa choice as

tohow largetomaketheclusters. Weconsidertheradius of a clustertober,ineitherdistance

orthenumber ofhops2. For simplicity,rshouldexactlydividemandn.

Thetotalarea covered

by

theclustersisthesum oftheareas covered

by

each oftheclusters.

Thetotalnumber of clustersisthenumber of clustersthatfitintothefield

horizontally

times the

number of clusters that fit into the field vertically. These qualities are expressed

by

the

following

formula:

x r r cov erage=

which simplifiesto

cov erage=

(mx

n)n,

showingthatrdoesnotaffectthe totalradio coverage ofthenetwork3.

Though mathematicallythecluster radiusdoesnot affect coverage ofthe

field,

thisdoesnot

representtheenergy efficiencyofthenetwork. Withalargercluster

diameter,

clusterheads will have to transmit fartherto reach each other, but the treeof clusterheads will be simpler. A

2

Inanevenlydistributed fieldofsensors, thesewillbeproportionalunits ofmeasure. 3

Thisapproximationholds ifsensors'

(28)

simplertree leads to more energy-efficienttopology maintenance at this level. Also with an

increasing

r,maintenance and communication costsforeach cluster willbe increasing.

There isabalancetobestruckbetweencluster size andtreesize. Thelargertheclustersare, the smallerthe tree of clusterheads may be and vice versa. Theradius of a cluster, r, is the

independent variable in this case. The dependent variable is the efficiency ofthe network topology. Thiscanbevisualized

by

the

following

figure:

Cluster Sizevs.NetworkComplexity

| *

TreeComplexity

--ClusterComplexity|

Figure8: Tree Sizevs.ClusterComplexity

Theoptimumbalance may be calculated offline and isnotpartof our protocol.

However,

thisplotdoesshowthat thereexists some optimal point. Asuboptimal choiceforrwill resultin

some amount of

inefficiency,

soit isnot enoughtousesolelyatreetopologyor a single cluster.

Asmentioned, we consider anetworkconsistingofa treeofclusters. Witha given number

ofnodes, largerclusters allow asimplertreeandvice versa.

Assuming

clusters arecircular, the

numberof nodesina cluster andthe complexityof eachincreasesexponentiallywithr. Thetree

[image:28.546.104.471.181.411.2]
(29)

showthecomplexityofcommunicationwithinthe tree to change

linearly

with r[13]. This isa

simplifiedexample, butshowsthat thereissomebalanceof organizationalefficiency betweena

network with a single cluster and a network inwhich each sensor nodeis a nodeinatree. The

optimumbalancemay becalculated offline andisnot part of our protocol. Asuboptimal choice

forrwill resultinsome amount of

inefficiency,

soit isnot enoughtousesolelyatreetopology

or a single cluster.

2.2 Other Assumptions:

Animportantnote aboutWSNs ingeneralisthattheyaredata-centric [16]. Thismeansthat

dataisrequested and returnedto thesinkbasedonitsrelevanceto the query,ratherthanbecause

it was requested from a specific node. For our aggregation scheme we assume queries are

definitive fornameddata. This isa minorassumption,and our protocolreallyonlyrequires that

onequerycanbe differentiated fromanother,whichisa reasonable requirement.

Thismeans thatend-to-end addressingwill notbeused, though

hop-to-hop

addressingwill

be. Even in WSNsthatuseclustering, thereis usuallya concept oflocal addressing (withinthe

cluster)

[25]

[26]. Messagesarebroadcasttoall other nodes within radio communication range.

The receiving node mustbe able to recognizeat least the typeof message in orderto decide

whethertointerpretthedatawithin,forwardthemessage,orignorethemessage.

A minor distinction to be made is the communication model. Commensurate with the

wireless nature of

WSNs,

messagesare assumedtobe broadcasttoall nodes withinradio range.

Thisnecessitatestheuse of either

hop-to-hop

addressingor messageIDstoavoidloops. Despite

thephysical

implementations,

conceptually messages are unicast from node to node in that a

parent can send a messagetoitschild and viceversa,with other nodesthatoverhearthemessage

discarding

it. Some WSNschemesrelyon gossiping topropagate messages[16]. This is not

requiredforoperation ofanyoftheaggregationschemesdiscussed inthis thesis.

The final assumption, which is necessary to this

discussion,

is that meaningful data

aggregation mustbeabletobeperformed. That

is,

datagenerated

by

thesensornodes mustbe

able tobe combined in some waythat shortens the total message,while retainingthe original

information. The specific aggregation function is beyond the scope ofthis thesis, but this

(30)

A query that asks forthe maximum value sensed is an ideal candidate for aggregation

becauseanynumber of messages canbe easilycombinedintoa single value attheaggregator. A

query fortheaverage over afield is slightlymore complicated inthat thenumberof responses

mustbe communicatedtocalculatethe average, butthefinal message will stillbeshorterthan

multipleindividualresponses.

Finally,

aquery forall values sensed wouldnotbea candidatefor

aggregationbecauseeach value mustbeappendedtothemessage andthefinalmessage willbe

proportionalinlengthto thenumber of sensorsresponding.

3

Proposed

Time

Synchronization

Algorithm

An important research topic necessary to the idea ofdata aggregation is timing control.

Figure 7 represents a simple network organizedintoa tree topology. Ina realistic WSN there

would be many more nodes and potentially a much deepertree. With deepertrees, timingis

even moreimportant [26].

Aggregationtradesoff

latency

for energysavings. Because aggregatingsensorshavetowait

for data fromtheirchildrentoarrive, therewillnecessarily besomeincrease inthe timeittakes

them to respondto queries. As illustrated in Figure

7,

a node atlevel 2 would need towait

longerthannodes atlevel 1 inordertosend aggregateddatatoitsparent node. There isadesign

tradeoffin the maximum amount oftime to wait. Ifan aggregating node waits too

long,

the

resultsofthequery maynotbetime-relevant. This is especiallytrueinsituations inwhichthe

datasinkmaywantinitialresultsimmediately.

Latency

tradeoffsareapplication-specific,sowe

assumethat theapplication

(running

onthesink)can choosetheoptimaltargetlatency.

Inmost aggregationschemes, latereports aresimply discarded

[26]

[49]. Wefeel that this

decisionshould be upto theapplication.

Depending

ontherequirements oftheapplication, it

maymake more sensetoacceptlateresponses. Notethat thisisa separateissue from usingthe

number oflateresponses tocalculatetheaggregation period. The ability fortheapplicationto

dictatetheaggregation action canbeadded

by

a single

flag

accompanying dataqueries. This is

discussed in detail in Section 3.4.

Amajorconclusiondrawn

by

theauthorsof

[49]

isthat theaggregationtree(not necessarily

thecommunicationtree)plays thelargestrole in theefficiencyoftheaggregation. While it is

true that theaggregationtopologyis

important,

it isnot enoughtodependon anidealtree. Steps
(31)

to performance. Ourprotocol aims to do this

by informing

interested nodes ofthe

(limited)

network

topology

inthemostenergy-efficientmanner.

Using

this

information,

thedatasink can

makeintelligent decisionsaboutthe tradeoffsbetweendata

latency

andpotentialenergysavings.

Our proposed aggregation

timing

control protocol, as do many others, makes use of a

separatesetupphasetodistribute parametersnecessary foraggregation. This istypical of, and

compatiblewith,mostdynamic routingandMACprotocols. The setupphasetakessomefinite

amount oftime and is followed

by

a much longer data collection period. These phases are

scheduledtooccur periodically. The

frequency

with which theyoccuris discussed in Section

3.2. The

following

figureshowsthegeneral scaleinwhich our protocol will operate.

Control Parameter SetupPhase

3=

]

Repeat

DataCollection Phase

Figure 9: Setupvs.DataCollectionPhases

3.1.1

Setup

phase

1. Sink broadcasts "depth

request."

Therequestispropagateddownthrough thenetwork

similarlytoa simpledataquery.

2. Messagereachesbottomandisreturnedbackup. Eachnode returns withits

hop

count,

starting from leavesatlevel 0. Theparents ofthesenodes realizethattheyare atlevel 1and

passthisinformation back up thetree. Thiscontinues untilthesink retrievesinformation

abouttheentiretree. Eachnode propagatesthedepthrequesttoitschildren and each node

returns an answertoitsparent,sothisisaccomplished withcomplexity<9(depth).

Inordertosaveenergy

during

thissetupphase,somelevelof aggregationmay beperformed

atthisstep. Possibilitiesvaryfrom

transmitting

informationthatcompletely describesthe

networktocommunicating onlythemaximumdepthofthe tree. Theseoptions arediscussed

more

fully

in Section 3.4.

3.

Finally,

thesinkwillcalculate andtransmitan appropriate valuefor T basedonthedepthof

thetree,themaximum

latency,

andtheoptimal number ofresponses,whichisassumedtobe
(32)

After performing thisexchange,eachaggregatingnode,

including

thesink, should haveall

knowledgenecessary forittocalculate an appropriatetimeoutperiod.

Depending

onthelevelof

aggregation performedin step

2,

nodesmay beawareofhow balancedor unbalancedthetree

is,

or parent nodesmay beaware oftheentire aggregationtreebelowtheirpositions.

3.1.2

Data

collection phase

1. Thesinktransmitsa new

querywith an updatedtimeoutperiod. This doesnot requireany

timesynchronizationbecausethesensors and aggregatorsjustmaintaintheprevious value

until notified. Aggregatorsthatreceivethequeryreduceituniformlybefore propagatingthe

queryasdescribed inSection3.5.

2. Eachaggregator replies withthenumber of replies received

(average,

perquery)andits

depth inthe tree. Notethatan aggregatorthatistheparentof anotheraggregatingnode will

sumitstotalnumber received

(Nrec)

withitschild's report. Aggregators may optionally

reportthenumber and

timing

oflatereports. Weterm these

Niate

and

Tiate

anddiscussthem

inthe

following

section.

3. Thesink will send out an updated

Tn+i

toallaggregators,basedontheFSM. cisa parameter

chosen

by

theapplicationinthesinkthatrelates

Nop,

toTopt.

3.1.3 Optimization

In order to get the most accurate view ofthe network, aggregators may also report the

number and/ortimeoflatereports. Thiswouldallowthe sink,ifpowerfulenough,tochoose an

appropriate aggregation period even moreintelligently.

For example, the "maximum latency" ofthe application may be provided along with a

flexibility

factor,

flex and its associated weight,flexing*. If the sink does not receive a

satisfactory number of responses, it may calculate the benefit of

increasing

the aggregation

period.

Disregarding

network

fluctuations,

thiscanbe done

by

simply countingthenumber of

nodes whoseadditionallatenesseswere reported tobe lessthantheminimum incrementofthe

aggregationperiod,T.

This number caneasily beweighted and compared to our application-levelparameters,flex

(33)

3.2

Control

Parameter

Refresh Rate

There isa concept of adata-collectionround. Thisroundmay includemultiple requests for

similar ordissimilar

data,

but uses thesame parameters fortiming, etc.forthedurationofthe

round. At the end of a round, the performance experienced is evaluated and parameters are

chosenforthenext round.

As mentioned, the Masternodes, aggregators, or"clusterheads" are

likely

ordinary sensor

nodes, tasked with additional responsibilities. Forthis reason,theMaster(or clusterhead) must

berotatedperiodicallytoprovide areasonablyeven loadon each sensor node

[17]

[40]. This

may be done after each round, or after a specific numberof rounds. This leads to another

tradeoff in energy efficiency. If the network is reconfigured

frequently,

it will lead to

unnecessaryoverheadintheformof control messages and calculations. Attheotherextreme,if

the network

topology

is chosen once and remains static, then a single node will bear

responsibility foran unfair amount oftimeand will

likely

die before itscluster members. This

alsodecreasestheoverall usefulness ofthe network,since whenthisclusterhead

dies,

it maytake

irretrievable datawithit.

It iscommon practicetouse asetupphase where we communicate allthenecessarycontrol

packetsforthecreation ofthenetwork

[17]

[39]. Atthispoint we couldconceivably informthe

aggregating nodes oftheirdepth inthe tree. This is theonly parameternecessary forabasic

aggregationtimingscheme. Anotheroptionistoembed certain control informationwithin each

datapackettransferred. Thisprovides moretimelyfeedbackonthestate ofthe system,butadds

overheadintheformofadditionaldata foreachtransmission.

Wefeelthatbothofthesemethods shouldbecombined. Asmentioned,it is necessarytouse

some sort ofsetupphasetocommunicatethetopologyoftheaggregation network. Thisis

likely

part ofthenetworksetupphaseanyway,andfor simplicitythisoverhead will notbeconsidered

in detail inthisthesis.

The onlyadditional informationthatmayneedtobepassedback upthetree

during

thedata

collection phaseisoptionalinformationsuch asthenumber of messages missed

by

an aggregator

duetoa shortaggregationperiod,or perhapsjustwhether or notanymessages were receivedlate

fromthe lastquery. Thisrequiresonlyafewadditionalbitsas part ofthedatamessagepackets,

(34)

3.3

Finite

State

Machine

Implementation

of

Algorithm

Several finite state machines were developed forevaluation. Each aimed to maintain the

number of messages received as close as possible to the optimal number (determined

by

the

application). In addition, itwasdesiredtoreachthispointasquicklyas possible. The

following

are variables usedinthesestate machines.

N0[),

Optimalnumber ofresponses; determined

by

theapplication.

Nrec

Numberof responsesreceived; tallied

by

theaggregators and reportedto thesink

T Maximumaggregationperiod,distributed

by

thesink L+

Maximum

latency

(application

level)

Tj

Maximumaggregation period(fora node atlevel

i)

n Theround;T iscalculated onceforeachround,whichisassumedtobe

long

enoughto

create aheuristicwithout

becoming

stale.

Topt

Aggregationperiodthatsatisfies

Nrec

=

Nopt

c Aparameter chosen

by

theapplicationinthesinkthatrelatesNopttoTopt.

Nia,c

Numberoflatepackets received

by

aggregators

TiaK

Timeunits whilewaiting for latepackets

D Depthofthe tree

K Levelof nodeiinthe tree.

? Differencebetween Tateachlevelofthe tree

If D is

known,

the sink can derivea maximum

T;

for each level inthe tree, based onthe

maximum

latency (L+)

oftheapplication:

7)

=V-KA

Equation1

Since the nodes know their own level in the tree, they can

individually

calculate their

aggregation periods, using the above formulaand ? ifavailable. Our protocol is capable of

distributing

this deltaas partofthecontrol message. Itwasfound

during

simulationsthat this

parametercouldbeeliminated. Detailscanbe found in Chapter 5. Thoughtheremay besome

advantageusingavariable

?,

thereisatradeoffintheamount of controldatatransmitted. This

issue isanalyzedinmoredetail in Section 3.5. Theempirical performance advantages ofusinga

deltaareleft for futureexperimentation.

Thesimplestfinitestate machineshownbelowwas

initially

derived fromtheapproximation

that the number of responses received

by

the sink is

directly

proportional to the aggregation

period. Thisisa reasonableinitialassumptionsince,

logically,

alongerperiod will allow more
(35)

for choosing an aggregation period, and the amount of control data necessary.

Minimizing

complexityshouldbetheaim ofany algorithm, especiallywhenenergyconsumptionisa major

concern.

This initial assumption may prove to be naive,

depending

on MAC and physical layer

performance and even the traffic model, which is network- and application-specific. Ifthisis

found tobethecase

(experimentally),

our protocol can be easily modified to allow optimized

performance. Evenwith anexampletrafficmodel as shownin

[38],

our protocol should perform

well without modification as it still

dynamically

changes the aggregation period. The most

notableshortcoming isthattheperiod willbereduced

linearly

oncetheoptimal pointisreached.

Thiscanbeeasilymodifiedifthe traffic ismeasuredas shownin

[38],

insteadofthemodel we

basedour protocol on.

Thestate machinein Figure 10 isthesimplest. Ateach evaluation oftheaggregationperiod,

theoptimal number of messages received iscomparedto theactual number received. Ifthere

weretoofewmessagesreceived, theaggregation periodisincreased

by

one atomic unit. Ifmore

messages were received than were needed, the aggregation period is

linearly

decreased in a

similar manner. The amount of increase and decrease can be varied statically to suit the

application and network characteristics.

Thestate machinein Figure 1 1 requires amore complicatedformulatocalculateTn+i. Inthe

figure,

there are additional parameters used to calculate T. This figure also introduces the

concept of an acceptabledeviation. Thiscanlimitthenumberofunnecessarychangesin Tand

theattendantbroadcasts.

Thefinalstatemachine,shownin Figure

12,

doesnot make use ofthisacceptabledeviation

parameter, but abstracts the multiplicand responsible for

increasing

and

decreasing

the

aggregation periodinordertoreducethetimeneededtoprovideNopl.

Thetheoreticalperformance ofthesevarious state machi

Figure

Figure 1: Simple Tree Network
Figure 2thesink).themselves.sensorintermediate three shows a network that has been divided up into clusters [13]
Figure 8: Tree Size vs. Cluster Complexity
Figure 10: FSM #1
+7

References

Related documents

In addition, MT-RPL allows the mobile node to send packets opportunistically, so fixed nodes closer to the root may receive these packets, which will reduce the possibility of

Data acquired from each node is processed, displayed and sent to master processor (CPLD XC9572) that compile data received from different nodes and send this information

The purpose of the node is to acquire data from a connected sensor based on a wireless command packet received from the base station, and send a packet to the base station

At that time, using the ALERT preamble, the node broadcasts a new SYNC packet, with its own address, the address of the sink, the hop distance to the sink, its own