• No results found

The STC for Event Analysis: Scalability Issues

N/A
N/A
Protected

Academic year: 2022

Share "The STC for Event Analysis: Scalability Issues"

Copied!
46
0
0

Loading.... (view fulltext now)

Full text

(1)

The STC for Event Analysis:

Scalability Issues

Georg Fuchs

Gennady Andrienko

(2)

© Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS

Events

“Something [significant] happened s om ew here, s om etim e”

 Analysis goal and domain dependent, e.g.

 “Object starts/stops moving”,

“Object property changes”,

“Earthquake with magnitude > 2 on Richter scale”

 Visualization methods

 Animated and dynamic query maps

 Space-Time Cube (STC)

(3)

The Scalability Challenge

53.000 events

(4)

© Fraunhofer-Institut für Intelligente

Analyse- und Informationssysteme IAIS 4

Analysis of Spatially Distributed Events:

Major Questions

 How are the events distributed in space?

 at one particular time moment, or

 all events that occurred over a time period

 How are the event occurrences distributed over time?

 E.g., how does the overall event frequency vary?

 How does the pattern of spatial distribution of the events change over time?

 How are the events distributed in space + time? Are there any

spatio-temporal clusters?

(5)

Data structure:

Example: Earthquakes in Marmara region (western Turkey and around)

… … … …

(6)

© Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS

Adressing the Scalability Challenge:

Optimized Rendering?

Full Opacity

(7)

Adressing the Scalability Challenge:

Optimized Rendering?

50%

(8)

© Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS

Adressing the Scalability Challenge:

Optimized Rendering?

70%

Transparency

(9)

Events

“Something [significant] happened s om ew here, s om etim e”

 Analysis goal and domain dependent, e.g.

 “Object starts/stops moving”,

“Object property changes”,

“Earthquake with magnitude > 2 on Richter scale”

 Visualization methods

 Animated and dynamic query maps

 Space-Time Cube (STC)

 Analysis methods

 Spatio-Temporal Aggregation

Addressing the Scalability Challenge

(10)

© Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS

Spatio-temporal aggregation

 Spatial aggregation:

by units of any territory division

 E.g., cells of a regular grid

 Temporal aggregation:

by time intervals

 Occlusion is still a problem since ST-aggregates typically use larger glyphs (e.g., spheres) to convey the aggregated region + time interval!

Reduction of object/rendering primitive count

(11)

Events

“Something [significant] happened s om ew here, s om etim e”

 Analysis goal and domain dependent, e.g.

 “Object starts/stops moving”,

“Object property changes”,

“Earthquake with magnitude > 2 on Richter scale”

 Visualization methods

 Animated and dynamic query maps

 Space-Time Cube (STC)

 Analysis methods

 Spatio-Temporal Aggregation

 Event Density Calculation

Addressing the Scalability Challenge

(12)

© Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS

Event Density Calculations

In case of 2D maps: compute density surfaces 1976

1977 1978

Dis claim er:

There are far more

polished tools than

the one used for

these illustrations...

(13)

Event Density Calculations

In case of 3D STC: worthwhile looking at volume visualization?

? ?

© MathWorks

(14)

© Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS

Events

“Something [significant] happened s om ew here, s om etim e”

 Analysis goal and domain dependent, e.g.

 “Object starts/stops moving”,

“Object property changes”,

“Earthquake with magnitude > 2 on Richter scale”

 Visualization methods

 Animated and dynamic query maps

 Space-Time Cube (STC)

 Analysis methods

 Spatio-Temporal Aggregation

 Event Density Calculation

 Spatio-Temporal Clustering

Adressing the Scalability Challenge

(15)

Event Distribution in Space-Time

Finding clusters in Space-Time

This is what we are interested in!

(16)

© Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS

Event Distribution in Space-Time

Finding clusters in Space-Time

We see that all but one events really

occurred very close to each other. We can conclude that this is indeed a spatio-

temporal cluster and, hence, there may be

a relationship between these events

(17)

Event Distribution in Space-Time

Finding clusters in Space-Time

We see that the events seem to split into

two sequences with a certain time lapse

between them

(18)

© Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS

Event Distribution in Space-Time

• The number of clusters must be known in advance

• Returns convex shaped clusters

• Connection between events with a certain distance threshold.

• Difficult to parametrize.

• Extract arbitrarly shaped clusters.

• Doesn‘t require a priori specification of the

amount of clusters.

Automated Detection of ST Event Clusters

(19)

Density based Clustering Algorithm

(20)

© Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS

Event Distribution in Space-Time

 Clusters detection using density-based clustering

 Parameters:

spatial distance threshold = 10 km Temporal distance threshold = 30 days

20

Automated Detection of ST Event Clusters

(21)

Event Distribution in Space-Time

 Clusters detection using density-based clustering

 Observations and caveats:

 The space-time cube reveals an interesting pattern: a west-east shift of cluster locations over the studied time period

 Number of detected clusters (108) exceeds number of discernible colors 

different clusters are often colored very similarly

Automated Detection of ST Event Clusters

(22)

© Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS

Automated Detection of ST Event Clusters

 Density-based algorithms typically assume entire data fits into RAM at once

 Might not hold during initial explorative analysis

 e.g., Flickr photo-taking ~100,000,000 events

 Proposed scalability extension to D B S CAN (EuroVA‘12)

 Scalable to large datasets not fitting in RAM

 Accounts for spatiotemporal nature of the data

 Improved execution time compared to D B S CAN

Scaling to extremely large event data – Extended DBScan

(23)

Extended DBSCAN

Spatio-temporal neighborhood parameters

(24)

© Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS

Extended DBSCAN

Data is successively loaded into RAM in partially overlapping frames

Database

Principal algorithm steps

(25)

Extended DBSCAN

DBSCAN is applied

to each frame independently using ST-neighborhood criterion

Database Main Memory: RAM

Principal algorithm steps

(26)

© Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS

Extended DBSCAN

Database Main Memory: RAM

DBSCAN is applied

to each frame independently

using ST-neighborhood criterion

Principal algorithm steps

(27)

Extended DBSCAN

Database Main Memory: RAM

DBSCAN is applied

to each frame independently

using ST-neighborhood criterion

Principal algorithm steps

(28)

© Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS

Extended DBSCAN

Database Main Memory: RAM

DBSCAN is applied

to each frame independently

using ST-neighborhood criterion

Principal algorithm steps

(29)

Extended DBSCAN

Database Main Memory: RAM

DBSCAN is applied

to each frame independently

using ST-neighborhood criterion

Principal algorithm steps

(30)

© Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS

Extended DBSCAN

When clustering is completed, the clusters of consecutive frames are merged.

Database Main Memory: RAM

Principal algorithm steps

(31)

Extended DBSCAN

Database Main Memory: RAM

When clustering is completed, the clusters of consecutive frames are merged.

Principal algorithm steps

(32)

© Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS

Extended DBSCAN

Database Main Memory: RAM Database

When clustering is completed, the clusters of consecutive frames are merged.

Principal algorithm steps

(33)

Extended DBSCAN

After merging, RAM occupied by old frames is released.

Database Main Memory: RAM Database

Principal algorithm steps

(34)

© Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS

Extended DBSCAN

Database Main Memory: RAM Database

Principal algorithm steps

(35)

Extended DBSCAN

Database Main Memory: RAM Database

Principal algorithm steps

(36)

© Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS

Extended DBSCAN

Database Main Memory: RAM Database

Principal algorithm steps

(37)

Extended DBSCAN

Database Main Memory: RAM Database

Principal algorithm steps

(38)

© Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS

Extended DBSCAN

Merging process

(39)

Extended DBSCAN

Merging process

(40)

© Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS

Extended DBSCAN

 The proposed algorithm can be used for visual analysis

large datasets.

 2 mil. points. / 17.200 GPS- tracks

 Collected in one week.

 Objective:

 Detect traffic jams in the city.

 Investigate the properties of the clusters.

Use for visual exploration

(41)

Extended DBSCAN

 Detection:

 Spatio-temporal clusters of slow movement events

 Remove noise (i.e., spurious slow movements)

 Investigation:

 Temporal distribution of these traffic jams

 Convex hulls/prism representation

 Less objects/glyphs to visualize

 Spatial and/or temporal zooming can be applied

Use for visual exploration

(42)

© Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS

Extended DBSCAN

Use for visual exploration – convex hull cluster representation

(43)

Extended DBSCAN

Use for visual exploration – temporal zooming

(44)

© Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS

Extended DBSCAN

Use for visual exploration

(45)

Extended DBSCAN

 Combine temporal with spatial framing

 Dynamic frame sizes according to local density distribution

 Exploit inherent parallelism of independent frame clustering

Future Work

(46)

© Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS

 STC useful tool for event analysis

 One focus of interest:

scalability of STC visualization and backing analysis methods

 Improved rendering, data reduction (clustering), volume rendering(?)

 Strong interest in software engineering & rendering:

would also like to exchange experiences on architectures, data structures, shader-based graphics pipelines + rendering engines!

Executive Summary

Or: Why is that guy at this workshop?

References

Related documents

Europe: North Europe and mountains of Central and South Europe, North America, North Africa (Anon., 2001), Asia: Turkey (Demirel & al., 2003)..

Checklist Item #5. On all work sites where employees are engaged in Class I or II asbestos work, the competent or qualified person is to: 1) supervise or perform set up of the

For additional information about this publication click this link.. Previous studies have indicated that Q-fever patients may suffer from long-lasting health effects, such as

If your Data Management Studio installation includes an Enrichment license, you need to install the proper USPS, Canada Post, and Geocode databases to do address verification. If

657. Dit ongeacht het feit dat in de Tweede Kamer en daar buiten in de beschreven periode veel weerstand is geweest tegen de belastingheffing naar draagkracht en de ingevoerde

On the other hand, an increase in both cortisol and glucose levels was observed in juvenile cobia fed on the 0% (control) and the 45% microbial floc and

Despite his homophobic behaviour, Francis does signalise pleasure in the company (and harassment) of Tom. Eventually, this leads to the question of whether the bonding between the

Myself being an African American male who dropped out of high school years ago, I am interested in comparing my experience to the experiences of other African American males, in