• No results found

A Case Study on Constructing a Security Event Management (SEM) System

N/A
N/A
Protected

Academic year: 2021

Share "A Case Study on Constructing a Security Event Management (SEM) System"

Copied!
23
0
0

Loading.... (view fulltext now)

Full text

(1)

A Case Study on Constructing a Security Event

Management (SEM) System

Vijay K. Gurbani

Joint work with D.L. Cook, L.E. Menten, and T.B. Reddington

Security Technology Research Bell Laboratories, Alcatel-Lucent

vkg@{alcatel-lucent.com,bell-labs.com}

3

rd

International Conference on

IT-Incident Management and IT-Forensics

(IMF 2007), Universität Stuttgart

Stuttgart, Germany

(2)

Agenda

ƒ Introduction ƒ Related work

ƒ System design and architecture

ƒ Using the SEM framework

ƒ Lessons learned

ƒ Research agenda in SEM

(3)

Introduction

Only needs to find one weak element

Relies on fact that protection is not perfect May be as knowledgeable (and more so) as the defender

Needs to protect all elements Needs to be perfect in

ƒ Design

ƒ Implementation

ƒ Configuration

ƒ Operations

It will be a long time before designs, implementations,

configurations, and operations become perfect.

It is far easier to be a cacker than a defender!

(4)

Introduction

ƒ Why is network security difficult?

™Delegated to individual hosts and applications;

™Limited communications with others in the ecosystem;

™Applied in reactive mode, not pro-active mode;

™Network security tools do not provide an integrated network view regarding the state of the network: minimal, if any, situational awareness.

ƒ What should the model be?

Prevention Detection Response Security Configuration Event Management Event Messages Prevention Detection Response Host Port-scan in progress Block IP = X

(5)

Introduction

ƒ Security Event Management: ability

of the network to detect, analyze, and interpret discrete events AND take remedial action when events manifest themselves – AGILITY in SECURITY.

User Interface

Control Center

Set/Revoke

Policies Real-time Alerts

Event Correlation Engine

B 1 2 3 4 A C D 5 6 7 8 EF HG Domain A Domain Z LU BB AL BB OL OMS OL OMS OMS OMS

Event correlation Topology mapping Enforcing directives

Event Collectors

Software agents Hosts and applications

• Monitoring log files • Monitoring critical

processes

• Monitoring disk, CPU, or other network activity Prevention Detection Response Security Configuration Event Management Event Messages

(6)

ƒ Introduction

ƒ Related work

ƒ System design and architecture

ƒ Using the SEM framework

ƒ Lessons learned

ƒ Research agenda in SEM

(7)

Related work

ƒ SEM frameworks ƒ Commercial

ƒ Academic: SEM using data-mining techniques:

ƒ Liu et al. – SEM system constructed using CASE-based reasoning.

ƒ Ertoz et al. – MINDS – Minnesota Intrusion Detection System.

ƒ Determining root-cause analysis

ƒ Duan et al., Sekar et al.: enhance IDS to minimize false alarm rate. ƒ Julisch: few dozen root causes account for >= 90% of alarms an IDS

generates.

ƒ Devit et al.: topological proximity approach to wean out implausible alarms.

ƒ Reporting

ƒ Debar et al. [RFC4765]: IDMEF - describes a data model to represent

information exported by a IDS for consummation by a response systems and management systems.

ƒ Feinstein et al. [RFC4767]: IDXP – an application level protocol for exchanging data between intrusion detection entities.

ƒ Mitre’s Common Event Expression (CEE): establishes consistent log formats and terminology.

(8)

ƒ Introduction

ƒ Related work

ƒ System design and architecture ƒ Using the SEM framework

ƒ Lessons learned

ƒ Research agenda in SEM

(9)

System design and architecture Tools and technologies used:

ƒ Event correlation engine: Bell Labs/Alcatel-Lucent correlation engine used in fault management systems.

ƒ Intrusion detecting systems used: ƒ Open source: snort

ƒ Bell Labs: HTTP CLF and SSH LF analyzer

ƒ HTTP: >= 2000 attacks from CVE dictionary;

ƒ Statistical inference module: triggered on inter-arrival time, errors generated, or

links accessed;

ƒ Exponential weighting module: if (flow utilizing >= 75% link bandwidth) drop, diffserv, …

ƒ SSH login failed attempts

ƒ Bell Labs: Filesystem integrity checker

ƒ Load generators:

ƒ Open source: Nikto – web load generator, nmap, and snort

ƒ Firewall: Bell Labs/Alcatel-Lucent firewall providing session establishment rate limiting, traffic rate limiting, IP address/header inspection, etc.

(10)

System design and architecture

Events from HTTP and SSH log analyzer

Events from snort Perl logger

Events from firewall logger

Datasource Datasource Datasource Data collector System logs Data distributor Correlation engine Display Alarm Human intervention/ analysis Execute a rule on a firewall or a policy decision point

(11)

ƒ Introduction

ƒ Related work

ƒ System design and architecture

ƒ Using the SEM framework ƒ Lessons learned

ƒ Research agenda in SEM

(12)

Using the SEM framework

Firewall Machine A: Runs a web server

and hosts user accounts. It has the SSH and HTTP log analyzer running on it.

Machine B: Runs snort and hosts the snort Perl logger

Machine C: Runs the management console to control the firewall

Machine D: SEM system

(13)

Using the SEM framework Find vulnerabilities Break in and expand access Probe

Typical attack cycle

Stage 1: Network reconnaissance

P

F B

Network scan by nmap.

Events generated: snort, firewall (incoming packet rate-limit violation)

Correlation done to associate these events to be part of the same attack.

Remedial action: Issue denunciation to block offending IP.

(14)

Using the SEM framework Find vulnerabilities Break in and expand access Probe

Typical attack cycle

Stage 2: Find and exploit vulnerabilities

1. Machine A targeted.

HTTP exploitation using nikto.

Events generated: Bell Labs HTTP CLF generator.

Remedial action: Issue denunciation to block offending IP. 2. Continue port scans on A.

Discover a server on a high port.

Attack the server (buffer overflow, deus ex machina) Get password file; run password cracking program.

P

F B

(15)

Using the SEM framework Find vulnerabilities Break in and expand access Probe

Typical attack cycle

Stage 3: Break-in and expand access

1. Machine A compromised.

Attacker logs in using ssh; no event generated.

Attacker uses wget to fetch a trojan program; no event generated. Executes trojan; makes outbound connection.

Events generated: snort outbound connection.

Rule fired: No non-HTTP outbound connections allowed. Mitigation action: File integrity check on user.

AT THIS POINT, HUMAN INTERVENTION REQUIRED. ATTACK SUCCEEDED! P F B P F B

(16)

ƒ Introduction

ƒ Related work

ƒ System design and architecture

ƒ Using the SEM framework

ƒ Lessons learned

ƒ Research agenda in SEM

(17)

Lessons learned

1. Network fault management complements, but is different than SEM. Can a NFM be transformed to a SEM? No, not quite.

Root-cause analysis is different. NFM:

y Appropriate solution to address problem is valid once root cause is found.

y Root cause is normally a single component in a fixed geographical location.

y Damage caused typically does not escalate with time.

SEM:

• Attack proceeds in phases; characterized by [1...N] sources accessing [1...M]

destinations, cardinality of N,M is disjoint.

• Root cause proceeds from P->F->B and the cycle repeats with another element in

[1...M].

(18)

Lessons learned

Can a NFM be transformed to a SEM? No, not quite. Network attacks are dynamic.

NFM:

• Many categories of root causes, but small and predictable set of events for identifying root causes.

• Threshold settings relatively stable, and correlation rules change infrequently and are a function of network topology, which is generally static.

SEM:

• Few categories of root causes, but a great many and unpredictable set of events.

(19)

Lessons learned

2. Event thresholding, correlation and mitigation must be pushed to the edges. ƒ Event flooding is a problem for central correlation.

ƒ Use P2P techniques, not to search, but to efficiently divide the space. 3. Security event records must be designed for SEM consumption.

ƒ Firewalls produce many records for a session, but these records did not have a flow label.

ƒ Establish CLF for protocols – HTTP has one, why doesn’t SSH? SIP? SMTP? 4. Adaptive remediation strategies.

ƒ Network operators are reluctant of automatic policy control.

(20)

ƒ Introduction

ƒ Related work

ƒ System design and architecture

ƒ Using the SEM framework

ƒ Lessons learned

ƒ Research agenda in SEM ƒ Open floor

(21)

Research agenda in SEM

Today: Building a SEM is a task in integration and “glue programming.” ƒ No formal language from SEM to control or query edge devices. ƒ No formal language from edge devices to SEM for reporting. Research plan:

ƒ Better network reconnaissance techniques.

y Today’s focus is on DDoS attacks => lot’s of events generated.

y Can we detect a cracker that has created zombies on your network and logs into

the master zombie server to issue a 1-character command?

ƒ Develop resilient protocols.

y Ironically, it is precisely when a network is under attack that it is least able to

devote bandwidth resource for informing a SEM system.

y “Parsimonious Protocols”: idempotent, self-contained, minimal retransmissions and

ACKs – 20% packet loss, 5 copies of a message sent Î 99.6% probability that at least one copy will get through.

y From SEM to edge devices, the protocol must be more than a “TCP connection”.

(22)

Research agenda in SEM

ƒ Policy language and rule-based systems.

y What information should be collected by edge devices? How?

y Can anomaly detection be better done through rule-based systems (AI)?

ƒ Device modeling.

y How to provide SEM with characteristics of each controlled device? Location of

each controlled device? Can a device “learn” from the events so it only reports events of interest to the SEM?

ƒ The effect of network topology on correlation rules.

y Specifics about network topology is embedded in rules and actions encoded in a

SEM system. Will changing the network topology break these rules? Can the ruleset be automatically changed to allow for a topology reconfigaration?

ƒ Integration with OAM&P.

y Many SEM rules end up modifying an ACL at a traffic control point because many

suspect events occur in a short timeframe.

(23)

Open floor Thank you!

Vijay K. Gurbani

Bell Laboratories, Alcatel-Lucent tel: +1 630 224-0216

mailto: [email protected]

References

Related documents

Interventions in Primary Health Care  EIBI are not yet a regular part of GPs’ everyday work..  Anyway, not so small percentage of patients

This paper postulates a “Big Five” marketing strategic thought of Sichuan Operation Management District of the United China, namely, “strategy integration, marketing

Interestingly, neither compound 5 nor isosteviol caused an increase in the percentage of depolarized dead cells after 5 h of treatment; however, cells treated with the uncoupling

The soft TQM factors such as the Workforce commitment, Shared vision and Cooperative supplier relations are each significantly related to three out of four hard TQM elements (Use

The aim of this study is to propose the relationship between organizational commitment, organizational culture, employer branding and organizational performance with the mediating

This development of exports and prices are consistent both with the use of market power by using up water in the summer season by exporting in order to benefit from a higher price

The  rates  of  the  Electricity  Charges  payable  to  RSO  by  the  Applicant  for  the  Electricity  Services  will  be  be  based  on  the  reasonable  costs 

They don’t want us to know that the things we suspect are extraordinary about ourselves are real.. I believe that if everyone sees what just a few people become when they wholly