Incident Management Get Your Basics Right

(1)

(2)

Introduction

•  Neil Thomas

–  Industry experience in IT & IT support –  ITIL Vendor Product Management –  ITIL Consulting

(3)

•  Fully Accredited ITIL Training •  Fully Accredited SDI Training •  ITIL Consultancy

•  eLearning

•  Social Media Training & Consultancy •  Industry Webinars (ITSM & SM)

•  Industry/Organizational Podcasts

•  SDI Partner for Social Media Courses

(4)

The Webinar Series

•  Service Catalog •  Developing a CMDB •  Incident Management •  Problem Management •  Change Management

(5)

Topics today

•  Incident Management & ITIL •  Service Desk

•  Incident versus Service requests •  Other Incident Workflows

•  Knowledge

•  Service Level Agreements •  Incident & Problem

(6)

(7)

If Something Goes Wrong (Incident Management)

Service

If Something Keeps Goes Wrong (Problem Management) What Delivers it (Configuration Management) Need to Improve or Resolve Problems (Change Management) Managing it (Service Portfolio Management & Financial

Management)

Ensuring it’s there in the Future

(Availability Management & Capacity Management &

Service Continuity Management)

Delivering Agreed Changes to Business

(Release Management)

User Needs Something

(Service Requests & Service Catalogue)

How Quickly do we Support

(8)

Incident Management

•  Restore normal services AS QUICKLY AS POSSIBLE while minimizing the impact

•  Incident definition:

(9)

Key Elements

•  Incidents – ANYTHING hardware and software errors •  Reported by email, phone self-service, Twitter etc

(10)

Key Elements

•  Incident detection & recording •  Classification & initial support •  Investigation & diagnosis

•  Resolution & recovery •  Incident closure

•  Ownership, monitoring, tracking, & communication

(11)

No

End No No

From Event Mgmt From Web Interface User Phone Call Email Technical Staff

Incident Identification

Incident Logging

Incident Categorization

Service Request? Yes

Incident Prioritization

To Request Fulfilment

Major Incident Procedure Yes Major Incident?

Initial Diagnosis

Yes Functional Escalation 2/3 Level

Yes _{Functional Escalation} Needed? Investigation & Diagnosis Resolution and Recovery Hierarchic Escalation Needed? Yes Management Escalation Incident Closure

The Incident Management Process

(12)

Record

•  Normally recorded by a Service Desk •  Record all incidents

•  Ensures compliance with SLAs •  Records all relevant data

(13)

Categorize

Effective categorization of incidents has two aspects:

•  Classification to determine incident type (for example IT Service = degraded)

(14)

Priority/Severity Level 4 No Business Impact

No loss of service or resources

Priority/Severity Level 3 Minor Business Impact

Minor loss of service or resources

Priority/Severity Level 2 Serious Business Impact

Severe loss of service or resources acceptable workaround

Priority/Severity Level 1 Critical Business Impact

Complete loss of service or resources and work cannot reasonably continue - the work is considered “mission critical”

(15)

Escalate

•  Rapidly escalate incidents according to agreed service level •  allocate more support resources if necessary

•  Escalation can follow two paths:

–  Horizontal escalation is required when the incident needs to be

escalated to different SME groups that are better able to perform the Incident Management function.

–  Vertical escalation is where the incident needs to gain higher levels of priority.

•  Rules to ensure timely escalation

•  For every resolution attempt, accurate data must be

(16)

Resolve, Recover & Restore

•  Check for known errors and use any “workarounds” •  Resolving the Incident with solutions or workarounds

•  For some solutions, a Request for Change (RFC) will need to be submitted

•  Service Desk confirms with the user the error has been rectified and that the incident can be closed

(17)

Key Functions

•  Take ownership for an incident

•  Provide a prompt recovery of the business within SLA •  Keep the focus on the incident (no blindsiding)

•  Escalating incidents: functional (higher technical skill) •  Escalating incidents: hierarchical (manager decision) •  Keep the customer informed

(18)

(19)

Service Level Agreements

•  Negotiated and AGREED level of response WITH organization

•  Different SLA’s for different:

–  Priorities

–  Configuration Items (assets) –  Service

–  User

•  Appropriate to organizations needs •  Aim to RESTORE service asap given

(20)

Major Incident

(21)

Problem Management

•  A Problem is the cause (typically unknown) of one or more incidents. Activities include:

–  Analyze and identify the root cause of one or more incidents

–  Validate and publish the workaround for incidents whose cause is known (known error)

(22)

Known Errors

(23)

(24)

Knowledge & Incidents

Use of in Self Service

•  Self help (knowledgebases, FAQs etc) •  Script based help

•  Record that it self help has been used

Use of to Construct Knowledge

(25)

Service Desk & Incidents

•  Incident logging •  Customer

satisfaction •  Prioritization

•  First line support •  Request fulfillment •  Escalations

•  Communication

(26)

Know when to stop !

•  Beware over analyzing

•  Appropriate Management Information

–  Closed 4,000 calls

–  Received 45,000 SNMP Traps

•  SIGNIFICANCE •  Why Measure?

•  What is IMPORTANT to the Organization •  Key Performance Indicators

–  Customer satisfaction –  Time to resolution

(27)

(28)

Is it a bird or…?

(29)

Is it a bird or…?

•  Define the Process •  Manage by Priority •  Set realistic SLA’s OR

(30)

New Hire Process

HR Tasks

•  Recruitment request signed, attached and filed •  Recruitment offer signed, attached and filed •  Offer letter and T&C’s sent to candidate •  Signed letter back from candidate

•  Starter letter sent out to candidate

•  Created new employee in external systems •  Personal details completed

•  Informed payroll / reception

•  Collected acknowledgement forms: •  Employee handbook

•  H&S policy •  IT Policy

•  Induction arranged

•  References – requested & received •  Healthcare cover arranged

•  Pension arranged •  Parking permit issued •  Business cards arranged •  End of probation letter sent

IT Tasks

•  PC/Laptop •  Network ID •  Email

•  Telephony – Internal, Cell •  Security card

•  Application access

FM Tasks

(31)

Incident & Change

•  Accurate analysis

•  Identification of Configuration Items

(32)

Configuration Management

•  Defines WHAT delivers a SERVICE

(33)

Why Incident Management?

•  Knowing which Service is most important Incidents to be prioritized •  Defines who a user/customer contact, what is the expected fix time etc •  If not then we fight the same fires over and over again

•  Building better and more repeatable process around this firefighting will drive efficiency and effectiveness and overall greater quality

•  Builds on the body of knowledge of a call

•  DOCUMENTS what has happened, who did what and when •  Stops duplication of work

(34)

Incident Management Get Your Basics Right

Introduction

The Webinar Series

Topics today

Service

Incident Management

Key Elements

Key Elements

The Incident Management Process

Record

Categorize

Escalate

Resolve, Recover & Restore

Key Functions

Service Level Agreements

Major Incident

Problem Management

Known Errors

Knowledge & Incidents

Service Desk & Incidents

Know when to stop !

Is it a bird or…?

Is it a bird or…?

New Hire Process

Incident & Change

Configuration Management

Why Incident Management?

Q & A Time…….