A Multilevel Approach Towards Challenge Detection in Cloud Computing

(1)

1 A Multilevel Approach Towards Challenge Detection in Cloud Computing

A Multilevel Approach Towards Challenge Detection in

Cloud Computing

Noorulhassan Shirazi, Michael R. Watson, Angelos K. Marnerides, Andreas Mauthe and David Hutchison

School of Computing and Communications, Lancaster University, Lancaster, UK, LA1 4WA {n.shirazi,m.watson,a.marnerides2,a.mauthe,d.hutchison}@lancs.ac

.uk

1 Introduction and Motivation

Cloud computing is considered as an ideal solution for unified mission-critical ICT systems and infrastructures due to its elasticity and computing flexibility derived by its distributed nature. However, the critical infrastructure imposes much stronger re-quirements for security and resilience on a cloud computing environment. In this pa-per, we illustrate a methodology that exploits the hypervisor level of the cloud nodes. In particular, we consider how anomaly detection can be distributed to each node and coordinated with system level detection for overall cloud resilience based on previous work [1].

2 Methodology

Anomaly detection in cloud computing has been a ground of comprehensive re-search over the last few years as it presents several challenging problems. In general, anomaly detection techniques are based on specific assumptions about the data, for example the statistical distributions of events. If these assumptions do not match reali-ty, the outcomes can be unacceptable rates of false negatives and false positives [2]. In this work we aim to devise a methodology for anomaly detection which is suffi-ciently robust to the elasticity of cloud service provisioning environments.

We intend to incorporate statistical anomaly detection techniques in our framework due to their strength of detecting unknown attacks. We derive a model of the normal behavior of a network or system and detect divergence from the normal profile. This enables them to detect both known and unknown challenges. Normal pattern will be derived based on payload byte patterns in traffic or volume and entropy information over the traffic in a whole sample size. Both parametric as well as non-parametric techniques could be applied, however due to the fact that non parametric techniques do not generally assume the knowledge of underlying distributions [3] this could be our potential candidate. For simplicity non-parametric techniques for probability

(2)

den-sity estimation could be employed which involves using kernel functions to estimate the probability distribution functions (

3 Framework

3.1 Architecture

The overall architecture can be seen in Figure 1 with ware node in the cloud. Detection

scenario has to happen at various levels throughout the cloud infrastructure and hence the observation instances need to coordinate to any observ

Each node has a hypervisor, a host VM and a number of guests VMs. Within the host VM of each node there are software components which perform network level analysis and system level analysis. We also incorporate resilience component which provide protection and remediation actions based on the output from network and system level analysis. Such actions include de

information destined to a vulne resilience component.

3.2 Network level analysis

The purpose of the network level analysis is to categorize

to and from a cloud node and to subsequently identify anomalous traffic, suc produced by malware. This is achieved by capturing packets from the network faces of each VM and applying feature extraction and offline analysis. The work towards online feature

anomaly detection. A correlation of layer anomalies at this stage will increase the effectiveness of later correlation with system level data.

ity estimation could be employed which involves using kernel functions to estimate

the probability distribution functions (pdf) for the normal instances.

itecture can be seen in Figure 1 with A representing a single har

. Detection under various challenges such as malicious attack scenario has to happen at various levels throughout the cloud infrastructure and hence

need to coordinate to any observed anomalies.

Fig. 1. Architecture overview

Each node has a hypervisor, a host VM and a number of guests VMs. Within the host VM of each node there are software components which perform network level evel analysis. We also incorporate resilience component which provide protection and remediation actions based on the output from network and system level analysis. Such actions include destroying an infected VM or blocking information destined to a vulnerable VM, However, this paper is not concern with

analysis

The purpose of the network level analysis is to categorize normal traffic travelling to and from a cloud node and to subsequently identify anomalous traffic, such as that produced by malware. This is achieved by capturing packets from the network inte

of each VM and applying feature extraction and offline analysis. The goal is online feature extraction and analysis which would enable real-tim

A correlation of layer anomalies at this stage will increase the effectiveness of later correlation with system level data.

ity estimation could be employed which involves using kernel functions to estimate

representing a single hard-such as malicious attack scenario has to happen at various levels throughout the cloud infrastructure and hence

Each node has a hypervisor, a host VM and a number of guests VMs. Within the host VM of each node there are software components which perform network level evel analysis. We also incorporate resilience component which provide protection and remediation actions based on the output from network and ng an infected VM or blocking with traffic travelling that inter-goal is to time A correlation of layer anomalies at this stage will increase the

(3)

The network level analysis components comprises of various software scripts that

allow an automate post processing by calling tcpdump filtering routines alongside

several precompiled tools that are integrated within the CAIDA’s CoralReef toolbox1.

These scripts perform feature extraction from the outputs of the CoralReef tools and spans into several post processing outputs that convert STDIN input into various normalized statistical properties on a per-packet (e.g. inter-arrival times, etc) and per flow basis (e.g. counts of packets /bytes per flow, average packet size etc.). These scripts also have an interface with MATLAB for further analysis. The output from these scripts is shown in fig. 2 below. The malware sample (Kelihos) is injected in test bed for sixty minutes and custom scripts were used to analyse network features (packets and flow rate) over sixty minutes of recording.

Fig. 2. Packets (left) and Flow (right) per second for sixty minutes of kelihos injection

3.3 System level analysis

The system level detection of anomalies is achieved through the observation of VM memory from the hypervisor using virtual machine introspection, such as that

provided by the libVMI introspection library. The approach is similar to that applied

in the network level, but applied to system level information where we first try to understand the normal behaviour with respect to the operation of processes, after which anomalous behaviour can be identified.

The system level component also comprises of various scripts which invoke cus-tom plug-in for feature extraction and provide interface with Matlab for visual 1_{CoralReef: http://www.caida.org/tools/measurement/coralreef/more_information.xml} 0 500 1000 1500 2000 2500 3000 3500 4000 3.85 3.9 3.95 4 4.05 4.1 4.15 4.2 4.25 Packet Rate Time in seconds Packets / Sec 0 500 1000 1500 2000 2500 3000 3500 4000 0 0.5 1 1.5 2 2.5 3 3.5 4 Flow Rate Time in seconds Flows / Sec

(4)

sis, as Fig.3 shows, virtual size of memory per bytes (Kelihos) in test bed for sixty minutes.

Fig. 3. Virtual size of memory

4 Summary

Critical infrastructure imposes much stronger requirements for security and resil ence on cloud computing environment. Therefore, these

dressed through layered approach i.e., coordinated work-level properties towards

Acknowledgements

This work is sponsored by for Critical Infrastructure IT), g IU-ATC project, grant agreeme

sis, as Fig.3 shows, virtual size of memory per bytes during injection of test sam (Kelihos) in test bed for sixty minutes.

of memory per bytes over sixty minutes of Kelihos injection

infrastructure imposes much stronger requirements for security and resil ence on cloud computing environment. Therefore, these challenges need to be

layered approach i.e., coordinated analysis of both system and ne owards overall resilience of the cloud infrastructure.

This work is sponsored by EU FP7 Project ‘SECCRIT’ (Secure Cloud Computing for Critical Infrastructure IT), grant agreement no. 312758 and UK-EPSRC funded

project, grant agreement no. EP/J016675/1.

test sample

infrastructure imposes much stronger requirements for security and ad-analysis of both system and

net-FP7 Project ‘SECCRIT’ (Secure Cloud Computing EPSRC funded

(5)

References

1. A. K. Marnerides, D. P. Pezaros, and D. Hutchison. Detection and mitigation of abnormal traffic behaviour in autonomic networked environments. In Proceedings of ACM SIGCOMM CoNEXT Conference 2008.

2. P. Angelov, P. Sadeghi-Tehran, R. Ramezani, A real time approach to autonomous novelty detection and object tracking in video stream. International Journal of Intelligent Systems, vol.26 (3) 189-205, 2011.

3. V.Cahndola, A. Banerjee, Anomaly detection: A Survey. In Proceedings of ACM Compu-ting Surveys, September 2009.