Table of contents
Network Application Performance
Alignment to IT Best Practices
This white paper briefly describes best practices, highlights IT best practices and discusses in detail; IT business service management, the importance of application performance management (APM) automation and the capabilities and benefits of Fluke Networks’ APM product solutions
Best Practices . . . . 2
Business Service Management . . . . 2
Business Service Management Transition 3 Application Availability and Performance 4 APM Alignment to IT Best Practices . . . . . 5
Fluke Networks Application Performance Management . . . . 6
Brief Overview of Network based APM . . 6 Deploy . . . . 6
Manage . . . . 7
Solve . . . . 7
Optimize . . . . 8
Best Practices
Over time, competition, innovation and maturation of certain sectors and business functions drive the creation and implementation of best practices. Industry sector examples such as construction and manufacturing have been well-documented and benchmarked as signifi-cant best practice examples. Business performance improvements driven by best practice adoption in these industries are quantified and empirical. In contrast, technology (IT) can be characterized as emerging in this regard, just beginning to fully focus on best practices disciplines.
IT best practices can be summarized as a set of methods, techniques, processes, behaviors and automation that when consistently applied produce increasingly effective and efficient results. The portfolio of available IT process frameworks (ITIL, CMMI, etc), standards, methodologies and automation solutions is extensive.
The primary goal of IT best practices is to optimize the alignment and utilization of technology resources towards the execution of business objectives.
Business Service Management
Business Service Management (BSM) is a key area of focus within IT best practices. BSM is the practice of quantifying, monitoring, measuring, reporting and managing IT services from a business perspective. Examples of business services include application performance and availability, fulfillment of basic services (such as a new account opening), desktop support and many others. Prior to best practice adoption, a majority of IT organizations delivered services from the technology “silo” or domain perspective. Technology domains include areas such as mainframe and network. Domain based management and reporting resulted in misalignment of expectations and results between IT and business stakeholders. The misalignment drove the need for a perspective change to business service management.
Network Application Performance
Alignment to IT Best Practices
Desktop Network System Database
Business Services
Email Application
Email Availability = 94% (User Experience) Business Service Versus Technology Domain Availability
Business service management best practices generally include the creation of service level agreements between IT and the business stakeholder or customer. Significant transformative activities involving people, process and technology are required by IT to transition from technology domain management to the management of discrete business functions and services.
Technology monitoring and management automation play a critical role in enabling and achieving BSM objectives. “End to end” or holistic visibility of application availability and performance is a critical capability for enabling IT to transition to business service management. Existing technology domain specific automation capabilities (monitoring, management and diagnostics) remain critical to BSM. Ultimately, an integrated eco-system comprised of existing domain automation tools and holistic application availability and performance management automation solutions is needed to achieve best practice BSM objectives.
Business Service Management Transition
IT organizations transitioning to best practice BSM are tasked with leveraging, re-configuring and supplementing existing automation investments to achieve BSM goals. Managing applications holistically with traditional technology domain and overarching “manager of manager” tools is difficult, if not impossible without supplemental automation capability and improved full lifecycle process discipline. Orderly transition of automation to BSM objectives requires IT to document its current automation architecture, define the future-state architecture, perform a gap analysis, select supplemental solutions, and implement the projects that achieve the new automation architecture.
One or more of the automation transition projects will need to generate a “quick win” so that the IT team can demonstrate application and service management capability progress to justify continued investment. Network resident application performance management appliances deliver proven “quick win” or rapid time to value for holistic application and business service management capabilities. These network centric solutions address the immediate challenge of quick starting BSM automation and they easily integrate into the longer term strategic eco-system architecture.
Figure 2: Identifies the monitoring functions, processes and technology domains that are addressed, supported and enhanced by Fluke Networks solutions. Automation Architecture Without Fluke Networks Solutions
Enhanced Visualization Web Portal
Web
Integration Service Views
Policy Engine
App Views Web Portal
Web
Integration Service Views App Views Tech Support Tech Ops Workflow Automation Domain Views Domain Diagnostics Information
Sources ProtocolsEnabling TechniquesCollection Integration Techniques Integration Services Network Systems Application Middleware Database Performance Capacity Security Storage Management Domains Configuration Info Process Automation
Consolidated Event Store
Automation Architecture With Fluke Networks Solutions
Enhanced Visualization Web Portal
Web
Integration Service Views
Policy Engine
App Views Web Portal Web
Integration Service Views
App Views Tech Support Tech Ops Workflow Views Domain Views Domain Diagnostics Information Sources
Fluke Networks Supported Functions and Processes
Enabling
Protocols TechniquesCollection Integration Techniques Integration Services
Network Systems Application Middleware Database Performance Capacity Security Storage Management Domains Configuration Info Process Automation
Consolidated Event Store
Application Availability and Performance
Applications are the lifeblood of business. Underlying network infrastructure is the circulatory system that delivers that lifeblood. Business service management success is contingent upon well designed, implemented and managed applications and supporting infra-structures. Customer relationship management (CRM), sales force automation (SFA), enterprise resource planning (ERP) and Electronic Health Record (EHR), Patient Data Record (PDR) and other applications are fundamental to daily business operations. Application disruptions have significant and tangible costs. Service availability and management of these critical applications has become a key business metric garnering executive attention and requiring cross-functional alignment and management.
Figure 3: Today’s applications often run on networks and a host of other inter-related infrastructure components that are complex and constantly changing. Ensuring the availability of a single infrastructure component such as a server is not sufficient to satisfy business service management requirements.
Threats to application availability and performance come from many potential underlying causes. Examples of these underlying causes include: changes to the infrastructure or applications, security breaches, the implementation of new applications; and component error within the deployed application or infrastructure. Responsibility for ensuring application availability is shared across multiple disciplines within IT. These cross functional teams need an automation tool that provides proactive identification and automated probable cause analysis of service deviations or disruptions.
Holistic application availability and performance management is a difficult challenge as IT personnel lack enterprise-wide visibility of applications and their underlying infrastructure components. Ensuring application availability depends on the ability to dynamically
LB
Web URL (Agg)
TCP 8080
Web Portal NAT 192.168.16.71 ACME WEB-Portal VIP=192.168.16.11 192.168.16.11 192.168.16.12 ACME CL1 VIP=192.168.15.147 SQL=192.168.15.148 192.168.15.139 192.168.15.140 ACME CL2 VIP=192.168.15.149 SQL=192.168.15.150 192.168.15.142 192.168.15.143 ACME App TCP 5000-5100 Windows Svcs TCP 135 ACME App TCP 5000-5100 Mainframe TCP 1414.1515.1815 Mainframe TCP 3040 Windows Svcs TCP 135 Windows Svcs TCP 445 192.168.15.64 /26 192.168.160.102 192.168.3.51 192.168.160.102 Authentication TCP 88 IBM Host *GW connection to
IBM Host using SCON 192.168.15.87
192.168.15.88 local VLAN traffic
to APP servers ACME APP2
ACME Main Frame ACME App1
192.168.15.84 192.168.15.85
ACME App3 172.27.15.90
VIP = outside VLAN query SQL = server to server query 192.168.15.128 /26 ACME CL3 VIP=192.168.15.151 SQL=192.168.15.152 192.166.15.145 192.168.15.146 DC TCP 443 IntAuth 192.168.16.0 /26 192.168.14.11 ACME WEB-VIP-ACE 192.168.14.192 /26 ACME WEB/Outside 192.168.15.0 /26 ACME WEB/Inside Public Web TCP 443 W
eb Outside Server Aggregation W
eb Inside Client Aggregation 192.168.14.20 192.168.14.21 192.168.14.22 192.168.14.23 192.168.14.24 192.168.14.25 192.168.14.26 192.168.14.27 ACME App Server aggregation 192.168.15.75 192.168.15.76 192.168.15.77 192.168.15.78 192.168.15.79 192.168.15.80 192.168.15.81 192.168.15.82 192.168.14.11 192.168.14.12 192.168.14.13 192.168.14.14 192.168.14.15 192.168.14.16 192.168.14.17 192.168.14.18 Database TCP 2255 Windows Svcs TCP 445 Database TCP 1876 Database TCP 1732
APM Alignment to IT Best Practices
Deployment of datacenter resident Application Performance Management (APM) appliances provides holistic transactional visibility to business critical applications. APM information combined with network flow information provides “end to end” transaction visibility including the network components traversed. APM appliance solutions passively monitor application transactions as they occur. These appliances differentiate between – and report response time for – user, network and application tiers. Transaction details and summary information are stored in the appliance database. Detail and summary information is leveraged for troubleshooting, base lining, error detection, performance and availability analysis, trend reporting and many other functions.
APM aligns with, supports and optimizes many BSM best practice disciplines. Examples of the BSM disciplines and a description of the supporting APM functionality, include:
BSM Discipline APM Functionality
Event Management • • Availability and Performance alerts and events are issuedPerformance deviation based events are issued Alerting classification by application, group and location •
Incident Management
“Engineer in a box” that monitors application transactions •
Detects deviations from baseline, provides alerting •
Provides probable cause reducing mean time to resolve •
Ability to verify resolution or fix results •
Availability Management • • Application availability service level reportingTrend reporting by application, user group and location Analysis to support process and application improvement •
Performance Management • • Application performance service level reportingTrend reporting by application, user group and location Analysis to support process and application improvement •
Problem Management • • Provides automated probable cause analysisStores application transaction information for forensic root cause tasks and historical reporting
Capacity Management • • Provide application specific utilization trendingPre and post application release utilization analysis Supports network capacity planning (add network flow) •
Change Management
Pre and post application release utilization analysis •
Identify users and applications potentially impacted by planned changes •
Detects and alerts to “NEW” traffic patterns within the infrastructure •
Service Level Management • Availability and performance information support design improvements
Fluke Networks Application Performance Management
Network based APM systems have emerged as an important advancement in IT business service management. They provide complete, enterprise visibility of network, application and service usage. APM solutions identify deviations, errors and disruptions that impact ser-vices. APM solutions accelerate achievement of IT business service management goals.
Brief Overview of Network based APM
Network based APM solutions improve application availability and performance through functionality that includes: Real-time application discovery
•
Application profiling and base lining •
Behavior and policy monitoring •
Isolation of applications/systems/users that are affecting application availability •
Probable cause analysis to accelerate availability issue resolution •
APM solutions support best practices that enable IT organizations to continually improve availability and performance of business critical applications. Fluke Networks APM solution demonstrates value in a multitude of business technology scenarios. The following paragraphs are used to highlight several of those scenarios or use cases.
Deploy
Deployment of new applications and WAN services can be challenging in today’s complex network environments. Some of the most challenging aspects of change lie in datacenter consolidation, MPLS WAN migration, and implementation of Voice over IP.
Inability to baseline application and underlying infrastructure perfor-mance across the distributed enterprise severely limits pre and post change analysis capabilities. Pre and post change measurement and comparison capability is needed to identify, quantify and resolve change related performance issues. Fluke Networks solution meets the following business needs:
Validate network readiness •
Reduce impact by quantifying actual bandwidth requirements •
Compare current application performance to historical •
Validate and maximize CoS configuration •
Monitor bandwidth allocation and protocol makeup for ALL traffic •
Measure “new” application utilization and more accurately estimate •
production impact during development and testing
Ensure delivery of VoIP services with quality metrics for every link and call-by-call details •
Visual Performance Managers application performance dashboard allows you to quickly see the worst performing applications, servers and sites
Manage
There are two types of situations that organizations must be prepared to manage. They are planned changes and unexpected events. Each has its own unique challenges and IT organizations should have a comprehensive management strategy in place.
Effective strategy for managing planned change includes the ability to establish a performance baseline. The pre-change baseline allows IT to measure the impact to performance and end-user experience after a change.
Effective strategy for managing unexpected events should include the ability to see throughout the distributed enterprise. The supporting management solution should provide rapid in-depth analysis that isolates
the error condition within an application, system or network. The solution should provide a single point of reference that all stakeholders within IT can reference. The single reference allows disparate teams to collaborate and come to faster incident resolution. Fluke Networks solution addresses the following business needs:
Validate the impact of change (network and application) •
Leverage multiple sources of information under a single pane of glass •
Immediately identify the affected resources (application, server or network) •
Understand current performance in context with historical baselines •
Visualize trending for performance degradation •
Report on bandwidth allocation and application growth •
Solve
One important element of an effective management strategy is being able to quickly identify error conditions down to the source technology domain component. Accomplishment of this task requires a solution that is not limited in visibility to one technology domain. The solution must “see” the application path across the enterprise, understand the component inter-relationships and identify the error causing source.
Fluke Networks realizes this fact and has designed a robust unified solution that utilizes different types of data sources depending on the issue at hand. Most importantly, the reporting, regardless of data source, is rolled up into a centralized view with easy to understand charts and graphs. These views are tailored to individual user reporting require-ments. Technology domain error isolation is simplified and the cause of a performance or availability issue is quickly pinpointed. Mean Time to Resolution (MTTR) durations decrease and end user Quality of Experience (QoE) improves. The following business needs are addressed:
Display real-time network, application, and server performance •
Understand the severity and impact of incidents and error conditions •
View and report on every conversation, host and protocol traversing the network •
Identify users, sites and applications affected by a slowdown or outage •
Fast problem resolution with visibility into business critical multi-tier applications with time-correlated backend transactions Cisco WAAS “Aware” reports quickly show response time measurements
Optimize
The proliferation of converged voice, video and data networks creates some unique challenges for IT organizations as they align with corporate business objectives to meet end-user quality of experience requirements. The proliferation of converged voice, video and data networks creates some unique challenges for IT organizations as they align with corporate business objectives to meet end-user quality of experience requirements. An effective QoE management strategy begins with the ability to monitor critical applications across distributed enterprises with end-to-end visibility.
In order to effectively take control of these converged environments, IT organizations must have broad visibility and in-depth analysis capabilities
that show how one service is affecting another service. Fluke Networks capabilities provide the following business benefits: Merge and leverage data sets from different analysis sources (NetFlow, SNMP, probe-based stats, application performance •
management appliances, etc.) Customized application growth reports •
Centralized web access •
Ability to consolidate and leverage troubleshooting information •
Conclusion
Fluke Networks provides a portfolio of tightly integrated solutions that result in network based comprehensive application availability and performance management. The integrated Fluke solutions provide a single holistic view with analysis capability through the Virtual Performance Manager product.
Fluke Networks pre-integrated appliance based solutions deploy in the fraction of the time it takes to reconfigure and supplement exist-ing manufacturer monitorexist-ing frameworks or best of breed eco-systems. Fluke Networks solutions provide rapid business service manage-ment value, are compatible with major framework products and quick-start business service managemanage-ment transformation.
Fore more information on Fluke Networks Application Performance Solutions, visit www.flukenetworks.com/vpm.
Contact Fluke Networks: Phone 800-283-5853 (US/Canada) or 425-446-4519 (other locations). Email: [email protected].