• No results found

System Behavior Analysis by Machine Learning

N/A
N/A
Protected

Academic year: 2021

Share "System Behavior Analysis by Machine Learning"

Copied!
27
0
0

Loading.... (view fulltext now)

Full text

(1)

Motivation and Background Analysis Examples Conclusion References

System Behavior Analysis by Machine Learning

CSC456 OS Survey

Yuncheng Li

[email protected]

(2)

Motivation and Background Analysis Examples Conclusion References

Table of contents

1

Motivation and Background

Motivation

Background

2

Analysis Examples

Windows Error Reporting System[XHF

+

09]

Web server performance metric[BSP

+

08]

Console logs with source code[DB08]

3

Conclusion

4

References

(3)

Motivation and Background Analysis Examples Conclusion References Motivation Background

Table of Contents

1

Motivation and Background

Motivation

Background

2

Analysis Examples

Windows Error Reporting System[XHF

+

09]

Web server performance metric[BSP

+

08]

Console logs with source code[DB08]

3

Conclusion

(4)

Motivation and Background Analysis Examples Conclusion References Motivation Background

Scenarios

Debugging and improving Operating Systems

Dynamic resource allocation

(5)

Motivation and Background Analysis Examples Conclusion References Motivation Background

Challenges

Badly organized data

Big Data

(6)

Motivation and Background Analysis Examples Conclusion References Motivation Background

Methodology

Data driven, rather than problem specific

Pattern Recognition and Data Mining

(7)

Motivation and Background Analysis Examples Conclusion References Motivation Background

Machine Learning Components

Raw Data

, which are usually not well structured, such as

console logs from arbitrary program

Feature Engineering

, refining raw data by removing

unnecessary information and transforming data representation

into a form that could be used by general data model

General Data Model

, such as Tree Based Model (CART),

Principal Components Analysis (PCA), Logistic Model . . .

(8)

Motivation and Background

Analysis Examples

Conclusion References

Windows Error Reporting System[XHF+09] Web server performance metric[BSP+08] Console logs with source code[DB08]

Table of Contents

1

Motivation and Background

Motivation

Background

2

Analysis Examples

Windows Error Reporting System[XHF

+

09]

Web server performance metric[BSP

+

08]

Console logs with source code[DB08]

3

Conclusion

4

References

(9)

Motivation and Background

Analysis Examples

Conclusion References

Windows Error Reporting System[XHF+09]

Web server performance metric[BSP+08] Console logs with source code[DB08]

Data Source

(10)

Motivation and Background

Analysis Examples

Conclusion References

Windows Error Reporting System[XHF+09]

Web server performance metric[BSP+08] Console logs with source code[DB08]

Raw Data

Failure Type, such as segmentation fault and deadlocks

Process Name

Exception code, such as 0x c0 00 00 00 05

(11)

Motivation and Background

Analysis Examples

Conclusion References

Windows Error Reporting System[XHF+09]

Web server performance metric[BSP+08] Console logs with source code[DB08]

Callstack Example

(12)

Motivation and Background

Analysis Examples

Conclusion References

Windows Error Reporting System[XHF+09]

Web server performance metric[BSP+08] Console logs with source code[DB08]

Features

Indicator Function for

Event Type, Process Name and

Exception Code

Extended Minimum Editing Distance for pairwise Callstack

comparison (O(min(m, n)) space complexity)

Editing: insertions, deletions or substitution

Editing Target: letters, frames and callstack groups

Each

pair

of raw data (

failure type, process name, exception code

. . .

) will generate a

similarity vector.

(13)

Motivation and Background

Analysis Examples

Conclusion References

Windows Error Reporting System[XHF+09]

Web server performance metric[BSP+08] Console logs with source code[DB08]

Model

Logistic Model

h(x) =

1

+

e

1

−β·x

to predict whether two failures are

equal based on the similarity vector.

Full Model: No constraints on the

β

Reduced Model: Enforce some constraints on the penalty

coefficients

β

(14)

Motivation and Background

Analysis Examples

Conclusion References

Windows Error Reporting System[XHF+09]

Web server performance metric[BSP+08] Console logs with source code[DB08]

Results – Parameters Learning

(15)

Motivation and Background

Analysis Examples

Conclusion References

Windows Error Reporting System[XHF+09]

Web server performance metric[BSP+08] Console logs with source code[DB08]

Results – Similarity Prediction Performance

(16)

Motivation and Background

Analysis Examples

Conclusion References

Windows Error Reporting System[XHF+09]

Web server performance metric[BSP+08]

Console logs with source code[DB08]

Data Source

(17)

Motivation and Background

Analysis Examples

Conclusion References

Windows Error Reporting System[XHF+09]

Web server performance metric[BSP+08]

Console logs with source code[DB08]

Features

Time series of the following metrics

CPU utilization

number of disk IO

network traffic

. . .

(18)

Motivation and Background

Analysis Examples

Conclusion References

Windows Error Reporting System[XHF+09]

Web server performance metric[BSP+08]

Console logs with source code[DB08]

Analysis Techniques

Clustering

for identifying distinct system states. (Kmeans or

Tree Based Model)

Correlation Analysis

to find which set of attributes always

comes with failures, which is an indication of failure causes

(19)

Motivation and Background

Analysis Examples

Conclusion References

Windows Error Reporting System[XHF+09] Web server performance metric[BSP+08]

Console logs with source code[DB08]

Data Source

(20)

Motivation and Background

Analysis Examples

Conclusion References

Windows Error Reporting System[XHF+09] Web server performance metric[BSP+08]

Console logs with source code[DB08]

Features Extraction

Message types and message variable, such as

sent packet 196.168.3.1

Identify

significant

message types (Have a lot of

variations

)

Grouping message values, by clustering in the feature space

Generate

bag of words

feature, which is a histogram of

message groups

(21)

Motivation and Background

Analysis Examples

Conclusion References

Windows Error Reporting System[XHF+09] Web server performance metric[BSP+08]

Console logs with source code[DB08]

PCA-Based Anomaly Detection

PCA is short for Principal Components Analysis

Certain feature instance is identified as

anomaly

if the

reconstruction error

by principal components is higher than a

threshold

(22)

Motivation and Background

Analysis Examples

Conclusion References

Windows Error Reporting System[XHF+09] Web server performance metric[BSP+08]

Console logs with source code[DB08]

(23)

Motivation and Background Analysis Examples

Conclusion

References

Table of Contents

1

Motivation and Background

Motivation

Background

2

Analysis Examples

Windows Error Reporting System[XHF

+

09]

Web server performance metric[BSP

+

08]

Console logs with source code[DB08]

3

Conclusion

(24)

Motivation and Background Analysis Examples

Conclusion

References

Conclusion

Unstructured data calls for good feature engineering

Feature engineering is really the key

(25)

Motivation and Background Analysis Examples Conclusion

References

Table of Contents

1

Motivation and Background

Motivation

Background

2

Analysis Examples

Windows Error Reporting System[XHF

+

09]

Web server performance metric[BSP

+

08]

Console logs with source code[DB08]

3

Conclusion

(26)

Motivation and Background Analysis Examples Conclusion

References

Bibliography I

Kevin Bartz, Jack W. Stokes, John C. Platt, Ryan Kivett,

David Grant, Silviu Calinoiu, and Gretchen Loihle.

Finding similar failures using callstack similarity.

In

Proceedings of the Third conference on Tackling computer

systems problems with machine learning techniques, SysML’08,

pages 1–1, Berkeley, CA, USA, 2008. USENIX Association.

Songyun Duan and Shivnath Babu.

Empirical comparison of techniques for automated failure

diagnosis.

In

Proceedings of the Third conference on Tackling computer

systems problems with machine learning techniques, SysML’08,

pages 2–2, Berkeley, CA, USA, 2008. USENIX Association.

(27)

Motivation and Background Analysis Examples Conclusion

References

Bibliography II

Wei Xu, Ling Huang, Armando Fox, David Patterson, and

Michael I. Jordan.

Detecting large-scale system problems by mining console logs.

In

Proceedings of the ACM SIGOPS 22nd symposium on

Operating systems principles, SOSP ’09, pages 117–132, New

York, NY, USA, 2009. ACM.

References

Related documents

In addition, this example, of a university President celebrating the building of ‘a very strong network of alumni in China and Hong Kong', highlighted the significance for

The aim of this study is to analysis the web server user access logs of Firat University to help system administrator and Web designer to improve their system by determining

Custom data page jqgrid form to another great feature is where grid data set session timeout in place for the header layer is the end user comes my house.. Layer and searching

experience and the positive outcomes of learning from failure (Baum & Dahlin, 2007; Homsma et al., 2009; Madsen & Desai, 2010), it has remained equivocal an

The purpose of this project was to identify the major concerns of the T2DM clinic patient population and develop a staff diabetic education program to address

Future of World Order, 1860– 1900 (Princeton, NJ: Princeton University Press, 2007); Duncan Bell, ed., Victorian Visions of Global Order: Empire and International Relations

1996 - 2007 Massachusetts Coalition of School-Based Health Centers President (2002-2006), Board Member (1996-2007) 1995 - Present National Association of Pediatric

It is based on three steps: a splitting step which splits events in order to allow the incremental and local resolution of non-determinism, a mapping step which introduces