• No results found

Elettra DAta analysis Tool: a data webhousing tool for heterogeneous log analysis

N/A
N/A
Protected

Academic year: 2021

Share "Elettra DAta analysis Tool: a data webhousing tool for heterogeneous log analysis"

Copied!
25
0
0

Loading.... (view fulltext now)

Full text

(1)

Elettra DAta aNalysis

Elettra DAta aNalysis

Tool:

Tool:

a data webhousing

a data webhousing

tool

tool

for

for

heterogeneous log analysis

heterogeneous log analysis

Roberto

Roberto PugliesePugliese Stefano

Stefano MaraspinMaraspin Alessio

Alessio CurriCurri

Software for Measurements Software for Measurements

Experiment Division Experiment Division

Sincrotrone

(2)

Outline

Outline

Elettra

Elettra

Virtual

Virtual

Collaboratory

Collaboratory

(EVC)

(EVC)

„

„

The portal application

The portal application

„

„

The Collaboration Tools

The Collaboration Tools

„

„

Usage scenarios

Usage scenarios

DANT an advanced Log

DANT an advanced Log

Analyzer

Analyzer

„

„

Software Architecture

Software Architecture

„

„

DANT at work

DANT at work

„
(3)

What is the

What is the

Elettra

Elettra

Virtual

Virtual

Collaboratory

Collaboratory

(EVC)?

(EVC)?

EVC is an example of virtual laboratory, a system which allows a EVC is an example of virtual laboratory, a system which allows a team of researchers distributed anywhere in the world to perform team of researchers distributed anywhere in the world to perform a a

complete experiment on the equipped

complete experiment on the equipped beamlinesbeamlines and experimental and experimental stations of

stations of ElettraElettra. .

User at

ELETTRA Team Memberat Home Lab

Remote Collaborator

Equipment

Control Data CPU

(4)

EVC in action: a web portal

EVC in action: a web portal

EVC is based on

EVC is based on

the “web portal”

the “web portal”

metaphor

metaphor

EVC supports

EVC supports

four different user

four different user

categories:

categories:

„

„ VisitorsVisitors „

„ Normal usersNormal users „

„ Project leadersProject leaders „

(5)

Collaborating to an EVC project

Collaborating to an EVC project

Scientists working

Scientists working

to an EVC project

to an EVC project

can use many

can use many

project related project related collaboration collaboration tools tools EVC presents an EVC presents an adaptive interface adaptive interface changing to suite changing to suite the category the category and expertise and expertise

level of the user

(6)

Collaboration tools:

Collaboration tools:

scientific visualisation

scientific visualisation

Scientists can

Scientists can

browse, visualise

browse, visualise

and process

and process

remotely

remotely

scientific data

scientific data

in real

in real

-

-

time

time

as soon as

as soon as

the data is

the data is

collected

collected

(7)

Collaboration Tools: Remote

Collaboration Tools: Remote

Beamline

Beamline

Control and Supervision

Control and Supervision

Beamwatch

Beamwatch

presents a

presents a

synoptic view

synoptic view

of the

of the

beamlines

beamlines

Autorised

Autorised

people

people

can thus operate

can thus operate

remotely on

remotely on

the

the

beamline

beamline

intrumentation

intrumentation

(8)

Collaboration Tools:

Collaboration Tools:

Electronic Notebook

Electronic Notebook

Web application

Web application whichwhich substitutes the Beamline

substitutes the Beamline

LogBook

LogBook registeringregistering meaningful

meaningful beamlinebeamline events

events using a wikiusing a wiki- -weblog

weblog methaphormethaphor Events can be

Events can be enteredentered manually or automatical

manually or automatical- -ly

ly by a programby a program

Texts and images are

Texts and images are

automatically indexed

automatically indexed

and hence easily

and hence easily

searchable and browsable

(9)

2 mm

Reconstructed stack of 279 slices Volume rendering

EVC usage scenarios: real time

EVC usage scenarios: real time

micro

micro

-

-

tomographic

tomographic

reconstructions

reconstructions

EVC will be used to allow real

EVC will be used to allow real--time microtime micro--tomographictomographic reconstructions on SYRMEP

reconstructions on SYRMEP beamlinebeamline, as soon as , as soon as data is collected

(10)

A new experimental

A new experimental

station was recently

station was recently

added to EVC: added to EVC: ELETTRA Control ELETTRA Control Room Room

Available tools for

Available tools for

this experimental this experimental station are: station are: Chat, Scientific Chat, Scientific File Browser, File Browser, OperatorConsole

OperatorConsole, , ENotebookENotebook, , LogAnalizer LogAnalizer

EVC usage scenarios:

EVC usage scenarios:

remoting

(11)

Collaboration tools:

Collaboration tools:

Advanced

Advanced

LogAnalizer

LogAnalizer

Advanced

Advanced LogAnalizerLogAnalizer isis a web application

a web application whichwhich allows

allows toto selectselect variablesvariables from the logfiles

from the logfiles producedproduced by

by different control anddifferent control and supervision systems,

supervision systems,

and to plot them in a

and to plot them in a

user

user specifiedspecified temporaltemporal interval

interval

Advanced LogAnalizer is

Advanced LogAnalizer is

technically a data warehouse, modular both

technically a data warehouse, modular both considering the considering the data loading and the data visualisation (Visual Data

(12)

About DANT...

About DANT...

Modular, web

Modular, web

-

-

based Log Analysis Tool

based Log Analysis Tool

Capable of storing a large amount of

Capable of storing a large amount of

heterogeneous data into a unique,

heterogeneous data into a unique,

homogeneous data warehouse (thus

homogeneous data warehouse (thus

facilitating information retrieval).

facilitating information retrieval).

Allows (remote) data analysis

Allows (remote) data analysis

Doesn't require specific hardware or

Doesn't require specific hardware or

software to run.

software to run.

It can run properly on any

It can run properly on any

PC with a moderate amount of RAM and

PC with a moderate amount of RAM and

CPU speed (IE 256MB 350Mhz).

CPU speed (IE 256MB 350Mhz).

(13)

DANT Goals

DANT Goals

Problem Context

Problem Context

„

„ ElettraElettra Scientific Instruments generate Log files with different Scientific Instruments generate Log files with different

structures and store them in different locations. This obviously structures and store them in different locations. This obviously doesn't make any analysis easy.

doesn't make any analysis easy.

„

„ Current log files often includeCurrent log files often include significativesignificative redundancies and thus redundancies and thus

are easily prone to inconsistencies or

are easily prone to inconsistencies or unnecesaryunnecesary large storage large storage requirements.

requirements.

Goals

Goals

„

„ Collect all the instruments data logs into a unique, reliable and Collect all the instruments data logs into a unique, reliable and

efficient data base system. efficient data base system.

„

„ Permit therefore easy data analysis or comparisons, possibly with Permit therefore easy data analysis or comparisons, possibly with

other already available databases and tools (i.e.

(14)

Software Architecture

Software Architecture

(15)

Software Architecture

Software Architecture

(16)

Software Features

Software Features

-

-

1

1

Abstract and Layered development Approach

Abstract and Layered development Approach

(Portability and

(Portability and CustomizationCustomization always kept in mind)always kept in mind)

„

„ Abstract DB Connections:Abstract DB Connections: DANT can run upon most DANT can run upon most

common DB systems, including for example: Oracle,

common DB systems, including for example: Oracle, MysqlMysql and

and PostgreSQLPostgreSQL..

„

„ Only software requirementOnly software requirement is to have ais to have a phpphp engineengine installed installed

on the (server) machine where the tool is intended to be run. on the (server) machine where the tool is intended to be run. Nowadays almost every web server supports

Nowadays almost every web server supports php. php.

„

„ Cross Platform Application: being developed as a web Cross Platform Application: being developed as a web

tool, DANT offers full cross

tool, DANT offers full cross plattfromplattfrom support. All of the support. All of the processing and data handling occur on the server side,

processing and data handling occur on the server side,

leaving to the connected clients just the task of displaying

leaving to the connected clients just the task of displaying

some HTML code in a web browser.

(17)

Software Features

Software Features

-

-

2

2

Low Cost

Low Cost

„

„ Entirely based upon Open Source Software.Entirely based upon Open Source Software.

Easy Deployment and Configuration

Easy Deployment and Configuration

„

„ There's a single configuration file keeping track of all system There's a single configuration file keeping track of all system

variables (IE Database Type, Maximum allocable memory..). variables (IE Database Type, Maximum allocable memory..).

„

„ This Configuration file can be additionally generated during This Configuration file can be additionally generated during

installation, so that deployment and usage are straight forward. installation, so that deployment and usage are straight forward.

Modular (thus easily Scalable)

Modular (thus easily Scalable)

„

„ Both Input (indexing) and output (presentation, visualization) Both Input (indexing) and output (presentation, visualization)

extensions to the DB system are built as modules, thus allowing extensions to the DB system are built as modules, thus allowing users to choose (or develop) the most adapt modules to the

users to choose (or develop) the most adapt modules to the situation.

(18)

Input Module Example: BCS Logs

Input Module Example: BCS Logs

Implemented as a Python script

Implemented as a Python script

Receives data either from STDIN, named pipes

Receives data either from STDIN, named pipes

(remote Unix

(remote Unix

syslog

syslog

) or Archived Log Files

) or Archived Log Files

Capable of reading and Indexing compressed

Capable of reading and Indexing compressed

data (

data (

gzipped

gzipped

archives)

archives)

Has SMART Hierarchical Indexing capability

Has SMART Hierarchical Indexing capability

(no need to set up when new instruments are

(no need to set up when new instruments are

added).

added).

(19)

Sample Use Case

Sample Use Case

Step 1

Step 1

(20)

Sample Use Case

Sample Use Case

Step 2

Step 2

(21)

Sample Use Case

Sample Use Case

Step 3

Step 3

(22)

Advanced Analysis

Advanced Analysis

Combined Plots Drawing Capability

Combined Plots Drawing Capability

„

„ Used not to plot variable values in a timeline but rather Used not to plot variable values in a timeline but rather relate different variables together in a single plot,

relate different variables together in a single plot,

displaying the result of a certain expression.

displaying the result of a certain expression.

„

„ Sample Use Case: Measure Machine Efficiency and find Sample Use Case: Measure Machine Efficiency and find out actual

(23)

DANT Status ...

DANT Status ...

Currently Supported Logs:

Currently Supported Logs: BeamlineBeamline Control System, Control System, Beamline

Beamline Access Control System Storage RingAccess Control System Storage Ring Average archived records Indexing (hourly):

Average archived records Indexing (hourly): 35.00035.000

Maximum Acceptable Fetched Values for a Single Plot

Maximum Acceptable Fetched Values for a Single Plot

[upon current hardware]:

[upon current hardware]: 35.00035.000 Average Monthly Stored Records:

Average Monthly Stored Records: around 25.000.000around 25.000.000

„

„

Probably swap

Probably swap

-

-

in/swap

in/swap

-

-

out of older data will be

out of older data will be

required

required

Record Size:

Record Size: 94bytes94bytes

Average Compression Factor

(24)

and future developments

and future developments

Integration with Other Analysis

Integration with Other Analysis

Tools/Instruments

Tools/Instruments

„

„ Add Java Support (applet analysis)Add Java Support (applet analysis) „

„ PermitPermit RealtimeRealtime AnalysisAnalysis „

„ New Presentation Modules development (IDL)New Presentation Modules development (IDL)

Integration with Other Data Sources

Integration with Other Data Sources

„

„ VUO (Virtual User Office)VUO (Virtual User Office) „

„ BPM (BPM (BeamlineBeamline Position Monitor)Position Monitor) „

(25)

Acknowledgements

Acknowledgements

Fulvio

Fulvio

Bille

Bille

’, Michele

’, Michele

Turcinovich

Turcinovich

,

,

Roberto

Roberto

Borghes

Borghes

, Claudio

, Claudio

Scafuri

Scafuri

,

,

Massimiliano

Massimiliano

Chiardone

Chiardone

, Kristina

, Kristina

Djinovic

Djinovic

from ELETTRA

from ELETTRA

Doriano

Doriano

Lamba

Lamba

, Alberto

, Alberto

Cassetta

Cassetta

from

from

CNR

References

Related documents

The WIFC has hosted the 2012 FINA Junior Open Water Swimming World Championships, the 2013 International Canoe Federation Junior and U23 Canoe Spring World Championships, the

Mobile Network Operator (MNO) OEMs, Google Wallet Banks Mobile Phone Baseband IC NFC Modem Chip NFC SWP miniSIM Mobile Phone Baseband IC NFC Modem Chip miniSIM NFC

Advances in medical treatment, management, diagnosis and surgical palliation have improved the quality and longevity of children born with Congenital Heart Disease. As

Potential explanations for the large and seemingly random price variation are: (i) different cost pricing methods used by hospitals, (ii) uncertainty due to frequent changes in

Initially, I had difficulty understanding how it was that students were integrating the various disciplinary perspectives in their pursuit of the question, “What does it mean to

[87] demonstrated the use of time-resolved fluorescence measurements to study the enhanced FRET efficiency and increased fluorescent lifetime of immobi- lized quantum dots on a

T h e second approximation is the narrowest; this is because for the present data the sample variance is substantially smaller than would be expected, given the mean

The critical defect length leading to thermal runaway is determined as a function of the current decay time constant s dump , RRR of the SC cable copper matrix, RRR of the bus