• No results found

Data Mining in Vehicular Sensor Networks: Technical and Marketing Challenges

N/A
N/A
Protected

Academic year: 2021

Share "Data Mining in Vehicular Sensor Networks: Technical and Marketing Challenges"

Copied!
27
0
0

Loading.... (view fulltext now)

Full text

(1)

Data Mining in Vehicular Sensor

Data Mining in Vehicular Sensor

Networks: Technical and Marketing

Networks: Technical and Marketing

Challenges

Challenges

Hillol Kargupta Hillol Kargupta Agnik

Agnik & University of Maryland, Baltimore County& University of Maryland, Baltimore County http://

http://www.cs.umbc.edu/~hillolwww.cs.umbc.edu/~hillol http://

(2)

Roadmap

Roadmap

„

„ MotivationMotivation „

„ Mining vehicular sensor networkMining vehicular sensor network „

„ Building MineFleet Building MineFleet „

„ ChallengesChallenges „

„ Some Algorithmic SolutionsSome Algorithmic Solutions „

(3)

Vehicles: Source of High Volume Data Streams

Vehicles: Source of High Volume Data Streams

„

„ Vehicles generate tons of dataVehicles generate tons of data „

„ Hundreds of different Hundreds of different

parameters from different parameters from different

subsystems subsystems

„

„ High throughput data streamsHigh throughput data streams „

(4)

Breakdowns cost Breakdowns cost thousands of thousands of dollars dollars Bad driving Bad driving costs money costs money--- ---fuel, brake shoe, fuel, brake shoe,

Why Mine Vehicle Data?

Why Mine Vehicle Data?

„

„ Fuel consumption analysisFuel consumption analysis „

„ Fleet analyticsFleet analytics „

„ Vehicle benchmarkingVehicle benchmarking „

„ Predictive health-Predictive health-monitoringmonitoring „

„ Driver behavior analyticsDriver behavior analytics

High gas prices High gas prices

(5)

Fuel Subsystem: Sample Attributes

Fuel Subsystem: Sample Attributes

„

„ Air Fuel RatioAir Fuel Ratio

„

„ Fuel Level Sensor (%) Fuel Level Sensor (%)

„

„ Fuel System Status Bank 1 [Fuel System Status Bank 1 [CategCateg. Attrib.]. Attrib.]

„

„ Oxygen Sensor Bank 1 Sensor 1 [mV] Oxygen Sensor Bank 1 Sensor 1 [mV]

„

„ Oxygen Sensor Bank 1 Sensor 2 [mV] Oxygen Sensor Bank 1 Sensor 2 [mV]

„

„ Oxygen Sensor Bank 2 Sensor 1 [mV]Oxygen Sensor Bank 2 Sensor 1 [mV]

„

„ Oxygen Sensor Bank 2 Sensor 2 [mV]Oxygen Sensor Bank 2 Sensor 2 [mV]

„

„ Long Term Fuel Trim Bank 1 [%] Long Term Fuel Trim Bank 1 [%]

„

„ Short Term Fuel Trim Bank 1[%] Short Term Fuel Trim Bank 1[%]

„

„ Idle Air Control Motor Position Idle Air Control Motor Position

„

„ Injector Pulse Width #1 (Injector Pulse Width #1 (msecmsec) )

„

„ Manifold Absolute Pressure (Hg)Manifold Absolute Pressure (Hg)

„

„ Barometric PressureBarometric Pressure

„

„ Calculated Engine Load(%) Calculated Engine Load(%)

„

„ Engine Coolant Temperature (°F) Engine Coolant Temperature (°F)

„

„ Engine Speed (RPM) Engine Speed (RPM)

„

„ Engine Torque Engine Torque

„

„ Intake Air Temperature (IAT) (°F) Intake Air Temperature (IAT) (°F)

„

„ Mass Air Flow Sensor 1(MAF) (lbs/min) Mass Air Flow Sensor 1(MAF) (lbs/min)

„

„ Start Up Engine Coolant Temp. (°F)Start Up Engine Coolant Temp. (°F)

„

„ Start Up Intake Air Temperature (°F)Start Up Intake Air Temperature (°F)

„

„ Throttle Position Sensor (%) Throttle Position Sensor (%)

„

„ Throttle Position Sensor (degree)Throttle Position Sensor (degree)

„

„ Vehicle Speed (Miles/Hour) Vehicle Speed (Miles/Hour)

„

„ Odometer (Miles)Odometer (Miles)

Fuel Subsystem

(6)

Product Concept: MineFleet

Product Concept: MineFleet

Minimize Wireless Communication

Minimize Wireless Communication

-- Onboard data stream miningOnboard data stream mining

-- Send alerts and analytics only Send alerts and analytics only when problems occur

when problems occur

-Optimize Fuel Economy by

Optimize Fuel Economy by

-- Modeling fuel consumption Modeling fuel consumption behavior

behavior

-- Identifying factors that are Identifying factors that are causing poor fuel economy causing poor fuel economy

-- Benchmarking fuel sub-Benchmarking fuel sub-systemsystem

Predictive Health Monitoring

Predictive Health Monitoring

-- --Automatically execute built-Automatically execute built-in in library of tests for checking the library of tests for checking the health

health-status of the vehicle-status of the vehicle

-- Predictive modeling of the vehicle Predictive modeling of the vehicle sub

sub--systemssystems

Driver Behavior Monitoring

Driver Behavior Monitoring

-- Policy-Policy-based driver behavior based driver behavior monitoring

monitoring

-- Identify the effect of driver Identify the effect of driver behavior on fuel economy behavior on fuel economy

(7)

-MineFleet Architecture

MineFleet Architecture

„

„ Main ComponentsMain Components

‰

‰ Onboard ModuleOnboard Module

‰

‰ MineFleet Control Center ServerMineFleet Control Center Server

‰

‰ MineFleet Client ModulesMineFleet Client Modules

‰

(8)

MineFleet System

MineFleet System

MineFleet Onboard MineFleet Onboard MineFleet Control Center

(9)

Challenges: Accessing Data

Challenges: Accessing Data

„

„

Vehicles generate data for hundreds of attributes

Vehicles generate data for hundreds of attributes

„

„

But manufacturers provide open access to only about

But manufacturers provide open access to only about

20 of those that are needed for emission checks

20 of those that are needed for emission checks

„

„

Off

Off

-

-

the

the

-

-

shelf devices were designed for off

shelf devices were designed for off

-

-

line

line

monitoring by mechanics

(10)

Onboard Computing Platform

Onboard Computing Platform

„

„ First prototype First prototype ---- PDA-PDA-based platformbased platform „

„ Other choices:Other choices:

‰

‰ Cell phones and Cell phones and

‰

‰ LowLow--cost, less powerful embedded devicescost, less powerful embedded devices

„

„ Market Entry PointMarket Entry Point

‰

‰ Location management companiesLocation management companies

‰

‰ M2M companiesM2M companies

„

„ Low Cost Embedded GPS DevicesLow Cost Embedded GPS Devices „

„ Resource constrainedResource constrained „

„ 33--4K run time memory4K run time memory „

„ 250K footprint250K footprint „

„ Resource sharing with GPS programResource sharing with GPS program

Circa 2001 Circa 2001 Circa 2005 Circa 2005 Circa 2007 Circa 2007

(11)

Fuel Economy: Impact of Vehicle Condition and

Fuel Economy: Impact of Vehicle Condition and

Driver Behavior

Driver Behavior

„

„ Quantify the effect of vehicle condition on fuel consumption. Example:Quantify the effect of vehicle condition on fuel consumption. Example:

‰

‰ Effect of air-Effect of air-intake subsystem behavior on fuel economyintake subsystem behavior on fuel economy

‰

‰ Effect of fuel subsystem on fuel economy.Effect of fuel subsystem on fuel economy.

„

„ Quantify the effect of driver behavior Quantify the effect of driver behavior on fuel consumption.

on fuel consumption.

‰

‰ Effect of speeding on fuel economyEffect of speeding on fuel economy

‰

‰ Effect of acceleration on fuel economyEffect of acceleration on fuel economy

‰

‰ Effect of braking on fuel economyEffect of braking on fuel economy

‰

‰ Effect of idling on fuel economyEffect of idling on fuel economy

„

„ Build predictive models of the fuel economy as a function of vehBuild predictive models of the fuel economy as a function of vehicle and icle and driving parameters for optimizing the performance

driving parameters for optimizing the performance

Poor vehicle Poor vehicle components and components and bad driving bad driving reduces gas reduces gas mileage mileage

(12)

Fuel Heat Map & Predictive Modeling

Fuel Heat Map & Predictive Modeling

Fuel heat maps show the vehicle operating points that offer high

Fuel heat maps show the vehicle operating points that offer highfuel economy. Red color represents high fuel economy. Red color represents high fuel economy and blue represents poor.

(13)

Fuel Consumption Summary Panel & Savings Calculator

(14)

Predictive Vehicle Health Management

Predictive Vehicle Health Management

Detailed description of a specific test that the vehicle passed Detailed description of a specific test that the vehicle passed

Identifying all the vehicles in a fleet with a Identifying all the vehicles in a fleet with a

specific problem specific problem

Detect problems using physics

Detect problems using physics--based based model and inductive techniques. model and inductive techniques.

(15)

MineFleet VANET Project

MineFleet VANET Project

„

„ Developing a mobile data stream management system for quick indexing Developing a mobile data stream management system for quick indexing and retrieval of information from the device onboard the vehicle

and retrieval of information from the device onboard the vehicle. . „

„ Distributed indexing and clustering techniquesDistributed indexing and clustering techniques Mobile information sources

Mobile information sources

A VANET

(16)

Algorithmic Challenges

Algorithmic Challenges

„

„ Ensemble-Ensemble-based Approachbased Approach „

„ Exact vs. Approximate techniquesExact vs. Approximate techniques „

„ Approximate monitoring of statistical propertiesApproximate monitoring of statistical properties

‰

‰ For example, Correlation MatrixFor example, Correlation Matrix

„

„ Approximate sequence comparisonApproximate sequence comparison „

„ Approximate modelingApproximate modeling „

„ Similarity preservation, approximation and Similarity preservation, approximation and orthogonalityorthogonality

‰

(17)

Correlation Matrix Computation & Monitoring

Correlation Matrix Computation & Monitoring

„

„ Given data matrix XGiven data matrix X „

„ Naïve computation: Compute XNaïve computation: Compute XTTXX „

„ Compute in the frequency domain (take Fourier transformation)Compute in the frequency domain (take Fourier transformation) „

„ StatStreamStatStream ((ZueZue and and ShashaShasha, 2002) , 2002) „

„ Our Approach Exploits Our Approach Exploits

‰

‰ Divide and ConquerDivide and Conquer

‰

‰ Approximate Approximate orthogonalityorthogonalityof random of random vectorsvectors

„

„ Identify the region of the matrix that contain significantly chaIdentify the region of the matrix that contain significantly changed nged

coefficients coefficients

(18)

Testing A Group of Correlation Coefficients Together

Testing A Group of Correlation Coefficients Together

Given a subset of attributes:

Given a subset of attributes:

A random vector

A random vector

Compute

Compute

}

,...

,

{

x

1

x

2

x

k

x

=

}

,...

,

{

σ

1

σ

2

σ

k

σ

=

T

x

s

=

σ

= l q q l r p

x

x

s

r

, 2 1 2

)

,

(

n

Correlatio

)

(

Variance

1

(19)

Divide

Divide

-

-

and

and

-

-

Conquer Search for Significant Correlation

Conquer Search for Significant Correlation

Coefficients

Coefficients

„

„ Impose a tree-Impose a tree-structure: structure:

‰

‰ Leaf node: a unique correlation coefficientLeaf node: a unique correlation coefficient

‰

(20)

Variational

Variational

Approximation

Approximation

„

„

Formulate as an optimization problem

Formulate as an optimization problem

„

„

Introduce approximations

Introduce approximations

„

„

Example: Finite Element Technique

Example: Finite Element Technique

Solve

Solve

Equivalent to minimizing

Equivalent to minimizing

0 ) ( ) ( ), , ( ), ( ) ( " = ∈ = = −u x f x x a b u a u b

− = b a dx x u x u u J( ) ( *'( ) '( ))2

(21)

Continued

Continued

„

„

Introduce approximation using locally

Introduce approximation using locally

decomposable representation

decomposable representation

„

„

Example:

Example:

„

„

Plug

Plug

-

-

in the approximation in the objective

in the approximation in the objective

function

function

)

(

)

(

x

x

u

i i

γ

α

=

(22)

Regression:

Regression:

Variational

Variational

Formulation

Formulation

= ≠ =

=

=

+

=

=

=

n k i j i k j j j j k k k j j j T T n i k i j i k j T

x

x

C

C

w

C

b

w

Cw

w

b

w

w

J

x

x

C

w

w

C

w

w

w

J

, , , , , 1 , , , * *

2

1

)

(

minimizing

to

reduced

be

Can

where

)

(

)

(

2

1

)

(

Minimize

(23)

Approximate Inner Product Computation

Approximate Inner Product Computation

„

„ Deterministic TechniquesDeterministic Techniques

‰

‰ Orthogonal TransformationsOrthogonal Transformations

„

„ Probabilistic TechniquesProbabilistic Techniques

‰

(24)

Approximate Inner Product Computation

Approximate Inner Product Computation

„

„ b1 and b2 can be found by minimizing the error b1 and b2 can be found by minimizing the error

2

,

1

for

)

(

))

(

)

(

)

(

)

(

(

.

1 2 / 1 2 2 2 1 1 1

=

=

Ψ

Ψ

Ψ

+

Ψ

Ψ

=

p

u

u

v

u

b

v

u

b

v

u

n i p p

((

u

.

v

)

2

(

b

1

Ψ

1

(

u

)

Ψ

1

(

v

)

+

b

2

Ψ

2

(

u

)

Ψ

2

(

v

))

1/2

)

dudv

„

(25)

Approximate Inner Product Computation

Approximate Inner Product Computation

Vector 1

Vector 1 Vector 2Vector 2

„

„

Node 1 computes Z

Node 1 computes Z

1,k1,k

‰

‰

Z

Z

1k1k

=A1.J

=A1.J

11

+..+An.J

+..+An.J

nn

‰

‰

J

J

ii

{+1,

{+1,

-

-

1} with uniform

1} with uniform

probability

probability

„

„

Node 2 calculates Z

Node 2 calculates Z

2,k2,k

‰

‰

Z

Z

2k2k

=B1.J

=B1.J

11

+..+Bn.J

+..+Bn.J

nn

„

„

Compute

Compute

z

z

1,k1,k

.z

.z

2,k 2,k

for a few

for a few

times and take the average

times and take the average

A1 A2 . . An B1 B2 . . Bn Random Seed generator Z1,k Z2,k

(26)

Discussion

Discussion

„

„ Need for lightNeed for light-weight algorithms for real-weight algorithms for real--time embedded applicationstime embedded applications „

„ Data intensive sensor networks may have different needs Data intensive sensor networks may have different needs „

(27)

Announcement

Announcement

„

„ National Science Foundation Symposium on Next Generation Data National Science Foundation Symposium on Next Generation Data

Mining Mining

„

References

Related documents

Epidemiologic data and practices related to venous thromboprophylaxis as considered by the global ENDORSE study, (Epidemiologic International Day for the Evaluation of Patients

ENGINE muSt BE Shut-OFF, COOL, aND SPark PLuG WIrE rEmOVED BEFOrE aNY rEPaIr Or maINtENaNCE CaN BE DONE....

Transient faults are often caused due to external errors (e.g., noise), and their adverse effects disappear rapidly. Therefore, nodes affected by such faults are usually

This work has proposed novel mechanisms for location- aware SON functionalities at indoor femtocell scenarios based on the power level received by the UEs.. Here, the restriction

Rehabilitation Committee (SBRCC) will take responsibility for the project, as it did for the Middle School, the Avery Elementary School and the High School Athletic Complex, if

Abbreviations: BP, blood pressure; CKD, chronic kidney disease; CV, cardiovascular; CVD, cardiovascular disease; DKD, diabetic kidney disease; eGFR, estimated glomerular

This indeed seems to be acknowledged, since in (Beck 1796b, § 215), Beck makes the Formula of Universal Law the expression of freedom, hence, presumably, epistemically derived from

Ghaemi, (2006) in a research named "investigation of short term return of shares from new arrived firms to Stock Exchange "he investigated the