Alert Tuning Solution Accelerator

Published: November 22, 2004

Download

Get the Alert Tuning Solution Accelerator

Update Notifications

Feedback

Executive Summary

In the increasingly important and complex world of information technology (IT) operations, it is essential to implement a robust and reliable systems management infrastructure based on proven methods. Service Monitoring enables data center managers to increase operational efficiency and to achieve a higher state of availability for mission-critical applications and management of Microsoft® Windows® services. Increased levels of performance can be achieved on Windows platforms through the implementation of Service Monitoring, which incorporates best-practice guidance in planning, building, designing, and deploying Microsoft Operations Manager (MOM) 2005 to monitor Windows applications and services.

An overwhelming volume of alerts has been ranked as one of the most crippling issues faced by IT operations. False alerts can create “noise,” which adds costs in headcount, causes inefficiencies for operators having to navigate through an overload of alerts, and perhaps most importantly, creates operational ineffectiveness—resulting from delays in response to legitimate alerts. In such an environment, it can be desirable to adjust Management Packs for a lower level of alert noise.

Under most circumstances, Management Packs will be applicable for the majority of organizations without any adjustments (such as alert tuning). This document is intended to assist large organizations, with complex deployments, in understanding how to utilize alert tuning to achieve the maximum benefit from MOM 2005 and its Management Packs. This process might require a significant up-front investment but, for enterprise IT organizations, it can yield significant benefits over time.

Alert tuning offers increased operational efficiency through:

•	A reduction or prevention of service incidents through the use of proactive remedial action.
•	Faster and more effective responses to service incidents.
•	Improved overall availability of services.
•	An increase in user satisfaction.

Top of page

Introduction

Document Purpose

This guide provides detailed information about alert tuning for organizations that have deployed, or are considering deploying, Microsoft Operations Manager (MOM) in a data center or other type of enterprise computing environment. This information consists of prescriptive guidance and custom Microsoft SQL Server™ reports.

Background

Alert tuning is a practice that is part of the Service Monitoring and Control (SMC) service management function (SMF). The SMC SMF is one of 21 SMFs (shown in Figure 1) defined and described in the Microsoft Operations Framework (MOF) Process Model. Every SMF within MOF benefits from some aspect of SMC because these functions are inherent to ongoing process improvement. This is especially true in the Operating Quadrant of the MOF Process Model, where the SMFs are closely interrelated.

Figure 1. MOF Process Model and related SMFs

Appendix B: Key Performance Indicators contains statistics that should be reviewed to understand the performance of SMC as well as to identify opportunities for improvement.

Intended Audience

This guide is intended for the IT professional in data centers and in large and enterprise organizations. MOM administrators, help-desk personnel, and others involved in service monitoring and control should find this guide helpful. It assumes that the reader is familiar with the intent, background, and fundamental concepts of MOF, MOM, and other Microsoft technologies discussed. (Links to further information are contained in Appendix A: Resources.) An overview of MOF and its companion, Microsoft Solutions Framework (MSF), is available at http://www.microsoft.com/technet/itsolutions/cits/mo/mof/default.mspx.

Terminology

This guide uses terminology that is current for MOM 2005. Table 1 lists the changes in terminology between MOM 2000 Service Pack 1 (SP1) and MOM 2005.

Table 1. Terminology Changes Between MOM 2000 SP1 and MOM 2005

MOM 2000 SP1	MOM 2005
zone configuration group (middle tier)	source management group
master configuration group (top tier)	destination management group
DCAM	(MOM) Management Server
processing rule group	rule group
processing rule	rule

Feedback

Please direct questions and feedback about this guide to msmfeed@microsoft.com.

Top of page

Alert Tuning Overview

Goals and Objectives

This chapter specifies the reasons for implementing alert tuning; in addition, it lists key phases and high-level requirements as they pertain to alert tuning.

Alert tuning is the process of reviewing a Management Pack to determine its applicability to a specific environment; it also involves developing an IT operations plan that is built around Microsoft Operations Manager (MOM). Management Packs are designed to produce alerts only for conditions that require action on the part of an administrator. However, variations in specific operating environments can lead to “noise,” which is a barrage of false alerts that can become overwhelming for operators to process, and which shadow alerts that have real value. Noise is often a symptom of a systems-monitoring capability that has not been optimized. Sources of alert noise can include false positives, false negatives, nonactionable alerts, and multiple alerts with the same root cause.

By adjusting Management Packs for a lower level of alert noise, the highest possible level of operational efficiency can be achieved. The result is that IT operations is able to introduce a more qualified, internally developed Management Pack into the MOM environment. This makes the MOM tool an even more relevant and trusted source for SMC alerts.

The primary goal of alert tuning, therefore, is to increase operational efficiency through the enhanced effectiveness of MOM and the overall effectiveness of service monitoring and control.

The successful implementation of alert tuning achieves the following objectives:

•	Reduction in the number of false alerts
•	Rapid resolution of actual and potential service breaches through the identification of actionable alerts
•	Reduction in investigation of service breaches through the identification of valid alerts
•	Availability of up-to-date infrastructure performance data from an efficiently running MOM infrastructure

Key Phases

Alert tuning is accomplished in seven steps:

1.	Alert Tuning Preparation. Gain consensus on the tuning activities, ensure the entrance criteria are met, and create the lab environments for the specific MOM Management Pack being reviewed.
2.	Health Specification/Health Model Creation and Review. Manually trace the Health Specification, Health Model, and Management Pack, as applicable. These are reviewed for validity, event-to-alert mapping, rules mapping, and the ability to act upon the findings (such as actionability).
3.	Isolated Lab Validation. Test the net effects of introducing the Management Pack on a management group that carries no production load. This allows for the isolated assessment of this introduction onto a scaled-down MOM infrastructure and client base
4.	Pre-Production Lab Review. Analyze the Management Pack behavior in a multihomed lab environment. This environment should include actual production conditions.
5.	Preparation and Deployment Review. Perform a final analysis of all results and prepare for deployment of the tuned Management Pack into production.
6.	Deployment of the Tuned Management Pack. Transfer the Management Pack from the pre-production environment to the production environment
7.	Run-Time Alert Tuning. Provide ongoing tuning of the Management Pack once it has been introduced to a production environment through ongoing assessment, tuning optimization, and feedback to development.

These phases are discussed in Chapter 4, “Processes and Activities.”

High-Level Requirements

The successful implementation of alert tuning includes the following requirements:

•	A good understanding of the Service Monitoring and Control SMF and fulfillment of its requirements. This SMF is available at http://www.microsoft.com/technet/itsolutions/cits/mo/smf/smfsmc.mspx.
•	Completion of entrance criteria as outlined in the “Alert Tuning Preparation” section, later in this document.
•	An intermediate to advanced understanding of MOM. Further information is contained in the MOM 2005 documentation.

Top of page

Processes and Activities

This chapter shows what processes and activities must be completed to implement alert tuning—from initial activities, such as gaining consensus on the tuning activities, to final activities, such as using feedback to improve subsequent implementation cycles.

When implementing alert tuning, organizations should adhere to the Microsoft Solutions Framework (MSF) life cycle and project-focused guidance. MSF provides a flexible and scalable framework for planning, building, and deploying business-driven solutions. More specifically, the MSF Process Model—with its Envisioning, Planning, Developing, Stabilizing, and Deploying Phases—should be applied to the implementation process.

Alert tuning also requires close coordination with other Service Monitoring and Control (SMC) activities. These activities include the six core processes that an IT organization follows to fully adopt SMC. Figure 2 illustrates the relationship between these SMC activities and the steps involved when implementing alert tuning.

Figure 2. Steps in alert tuning as they relate to the six core processes of SMC
See full-sized image

Those involved in alert tuning should be familiar with the MOF Service Monitoring and Control SMF (which describes these six core processes) and with the guidance related to it. Further information is available at http://www.microsoft.com/technet/itsolutions/cits/mo/smf/smfsmc.mspx

The following sections describe the steps required for implementing alert tuning.

Alert Tuning Preparation

Overview

The first step in implementing alert tuning is alert tuning preparation. The objective is to ensure the entrance criteria are met, obtain customer and IT consensus regarding tuning activities, and create the lab environments for the specific MOM Management Pack being reviewed.

Entrance Criteria

This preparation step includes the following entrance criteria:

Select a Management Pack to tune, which can be either:

1.	A Management Pack that has been acquired from a vendor for a commercial off-the-shelf (COTS) software product.
2.	A Management Pack that has been created by IT operations for an internally developed application.

Formalize the Health Model or Health Specification, which can be either:

1.	A complete Health Model (for internally developed software), which was the basis for the Management Pack created by IT operations.
2.	A Health Specification, which is derived from the vendor Management Pack for a COTS application. This can also be a manual walkthrough of the Management Pack itself.

A Health Specification (also called a Health Model for internally developed software) documents significant information used for monitoring a specific component. This may include all actionable events, event exposure and behavior, and instrumentation protocols and behavior. Ideally, this information is directly codified into a language or configuration dataset that MOM can use. Further information about the Health Model and Health Specification is contained in the SMC SMF, available at http://www.microsoft.com/technet/itsolutions/cits/mo/smf/smfsmc.mspx.

Preparation Activities

The following figure illustrates the four key activities needed to prepare for alert tuning, which are detailed in this section. However, in addition to performing these activities, you should consult both the SMC SMF and MSF when implementing any service-management capability.

Figure 3. Activities in the preparation process for alert tuning
See full-sized image

Form Agreement on Tuning Activities

The first preparation step is for all the involved teams to reach consensus on the following activities:

•	Scope of the review process
•	Appropriate and required participants
•	General schedule for the review process
•	Other resources required

Table 2 provides a sample timeline (in business days). Actual schedules might vary; for first-run organizations, additional time for tuning execution might be required, since these Management Packs often present more opportunities for improvement.

Table 2. Sample Alert Tuning Timeline

Identify Participants
1. Identify alert tuning team and service owners.	5 Days
2. Formalize criteria for alert tuning deliverables.	5 Days
Review Health Model/Health Specification
1. Deliver model/specification documents.
2. Review material, provide feedback, and integrate feedback into candidate Management Pack.	15 Days
Test and Deploy Management Pack
1. Deliver build of Management Pack to IT operations.
2. Deploy Management Pack to isolated lab.
a. Install Management Pack into lab management group and run Management Pack against lab agents capturing key performance indicators. (See Appendix B: Key Performance Indicators, for a description of each.)	2 Days
b. Gather and report results of lab run.	1 Day
c. Buffer for resolution of possible performance issues discovered in Management Pack lab pass.	2 Days
3. Deploy Management Pack to pre-production management group.
a. Import the Management Pack into the pre-production management group.
b. Review Management Pack in pre-production management group.	20 Days
Review Alert Tuning
1. Gather results of pre-production run of Management Pack.	5 Days
2. Sign off.	2 Days
Total time	57 Days

Define Roles and Responsibilities

The alert tuning process requires the cooperation and participation of a number of people from groups throughout IT. These teams work together in a virtual-team capacity to complete all activities. Some of the activities, for example, making change requests that are associated with the Management Pack, are only possible or relevant if the organization has access to the Management Pack development team. In short, these roles include:

•	IT operations. The IT operations team is responsible for the engineering, implementation, and support of monitoring and manageability infrastructure.
•	Development team. The development team is responsible for producing enterprise applications that are typically internally developed or that are highly customized or extended. This is an optional role that is only applicable when the Management Pack is developed by the same organization that is reviewing the Management Pack.
•	Service owner and subject matter experts (SMEs). The service owner and SME virtual team consists of one or more individuals from within the company who are responsible for engineering and upper-tier support of the underlying technology or product. This team is generally responsible for evaluating the technical accuracy of alerts and Management Packs from a qualitative perspective.

All the roles within the alert tuning process have particular tasks for which they are responsible. The following is a detailed list of these responsibilities according to role.

Responsibilities Shared by All Alert Tuning Roles

•

Establish the scope and exit criteria for the given Management Pack being reviewed. Establishing a common understanding of what work is going to be done during the review process and what criteria the review process will be based upon is vital to the success of the alert tuning process. The teams involved should agree on at least the following criteria prior to beginning the alert tuning process:

•	Resource commitment. An approximation of how much time per week each team can devote to the alert tuning process.
•	The scope of tuning. Based on the complexity of the Management Pack and the availability of each team, the alert tuning process can range from very limited to very involved. For Management Packs that contain no scripts and for which IT operations cannot provide technical expertise, the review process might not include isolated lab testing or qualitative review by a service owner or by SMEs. For a Management Pack that contains many scripts, and for which IT operations has many teams that will depend on the service’s monitoring, the alert tuning process might require multiple stages of isolated lab testing; it will also require more time for a complete evaluation of the Management Pack. The complexity of the Management Pack needs to be fully understood up front by all teams; additionally, the scope of the alert tuning process needs to be agreed upon by all teams involved.
•	The exit criteria. This needs to be established so that all teams know what they are accountable for delivering at the end of the alert tuning process. This can include, but not be limited to, an overall evaluation of the alert-to-ticket ratio (ATR) that the Management Pack provides, the commitment that the development team will have toward resolving outstanding bugs and improvements based on severity and priority, the number of agents to which the Management Pack will be deployed, and the decision of whether IT operations will implement all or portions of the Management Pack into production after the alert tuning process.

•

Establish the timeline for the alert tuning process. All teams should agree to the start and end dates of the alert tuning process. While this initial schedule can be altered as needed, and as agreed upon by all teams, it is necessary to agree on a general timeline.

•

Follow standard communications. Representatives from each team must subscribe to the Management Pack alert e-mail distribution list, which will be configured as a notification group in the pre-production MOM infrastructure. Representatives will receive all alerts in the form of e-mails.

Responsibilities for IT Operations

•

Coordinate the overall alert tuning process. IT operations is generally responsible for leading the alert tuning process. This includes facilitating recurring meetings and discussions over the course of alert tuning. IT operations is responsible for sending out regular status reports during the alert tuning process, which outlines outstanding issues and progress to date. There will be instances where project-manager representatives from the development teams fulfill all or part of these responsibilities.

•

Perform isolated lab analysis. Provide analysis of the Management Pack being reviewed in an isolated lab prior to it being deployed into the pre-production environment to detect possible performance issues. This step is primarily performed on Management Packs that contain scripts. In instances where this step is deemed necessary, IT operations will import the Management Pack into an isolated lab environment and run the Management Pack against a limited set of agents to ensure that the Management Pack does not introduce any adverse performance impacts.

•

Configure and support the pre-production MOM infrastructure. IT operations is responsible for ensuring the following are in place with respect to the pre-production MOM infrastructure:

•	The pre-production MOM infrastructure is in working order.
•	All necessary MOM agents are members of the pre-production management group.
•	The necessary version of the Management Pack being reviewed is imported in a timely manner and is properly configured according to any release notes or other documentation. This can come from a vendor or can be developed internally.
•	All necessary notification groups have been configured within MOM to allow alert forwarding via e-mail to the relevant consumers for review of the alerts. IT operations will need to set up a distribution list for people to join to receive the alert e-mails that are forwarded to the notification group.

There will be some instances in which the service owners maintain their own pre-production MOM infrastructure. If they host the Management Pack being reviewed within their infrastructure, the responsibilities listed herein would pertain to the service owners.

•

Provide support for the service owners and subject matter experts to make customizations to the Management Pack when required. IT operations is responsible for implementing changes in a timely manner to a given Management Pack whenever a change to a component is deemed necessary. IT operations is responsible for documenting what change was made and providing justification for the change. Additionally, IT operations must, wherever relevant, submit a bug or change request associated with the Management Pack. This allows the development team to review and possibly incorporate the change into the default configuration of the Management Pack.

•

Document and advocate bugs and change requests associated with the Management Pack. As the service owners and SMEs are reviewing the Management Pack, they might raise concerns with various aspects of it. The IT operations team will triage the validity of the issue being raised and present the issue to the alert tuning project team for consideration. Where applicable, IT operations will generate a bug or a change request to get resolution on the issue addressed in the Management Pack. There might be instances where the service owners or SMEs are comfortable in leading this entire process.

•

Provide ongoing quantitative data of the Management Pack. During the course of the alert tuning process, IT operations is responsible for archiving the necessary information from the pre-production management group to provide ongoing and historic quantitative data for the day-to-day review of Management Pack performance. IT operations is also responsible for providing a means for quick analysis of this information.

•

Provide the final quantitative analysis of the Management Pack. At the end of the alert tuning process, IT operations is responsible for providing a final quantitative analysis of both the impact that the Management Pack has had on the MOM infrastructure and the overall quality and acceptability of the Management Pack.

Responsibilities for the Development Team

•	Provide technical representation from the development or application team responsible for developing or managing (in COTS) the underlying technology that the Management Pack is monitoring. In order to ensure that proposed bugs and changes for a given Management Pack can be addressed in a timely manner, a representative from the development or application teams responsible for the underlying technology is required. It is the responsibility of the development team’s representative to ensure that the internal team is involved or that an alternative arrangement is made.
•	Supply up-to-date copies of specification and model documents at the beginning of the alert tuning process. The development team will provide the alert tuning team with an initial copy of the most recent specification and model documents, and will need to give a timely response to feedback provided to these documents.
•	Provide continuous up-to-date copies of the Management Pack under review. The development team is accountable for providing the alert tuning team with an initial copy of the Management Pack being reviewed, as well as copies of the updated Management Pack in a timely manner at least once during the review process. A typical alert tuning process, which is reviewing a Management Pack of low-to-moderate complexity, will require at least two builds of a Management Pack (if internally developed for COTS)—the initial build of the Management Pack, and one build halfway through the process, which incorporates changes based on IT operations’ feedback to that date. If the Management Pack is received from a vendor, the most recently updated version should be acquired.
•	Communicate feedback criteria and expectations to IT operations, service owners, and SMEs to ensure that the development team is receiving sufficient feedback. To ensure that the alert tuning process is as beneficial as possible, the development team is accountable for providing initial direction concerning the feedback that they are looking for. Likewise, the development team should continuously provide guidance on the feedback needed over the course of the alert tuning process. Wherever possible, all feedback criteria and expectations should be communicated and agreed upon in the beginning and documented in the Alert Tuning Exit Criteria. A sample template for status reports can be found in Appendix C: Template for Project Status Reports.

Responsibilities for the Service Owner and Subject Matter Experts

•

Provide technical representation from the relevant support teams to ensure the best possible technical review is given to the Management Pack. The service owners are responsible for finding at least one SME who will be able to dedicate the necessary effort to be involved in the alert tuning process. The time commitment of a SME is approximated as follows:

•	Two to five hours for initial orientation meetings and reading of the alert tuning process documentation
•	Eight hours for the complete review of the Management Pack specification documents
•	One hour per day for the ongoing review of the Management Pack as it is running in the pre-production management group
•	Four to eight hours for the final qualitative analysis of the Management Pack

In some instances, the service owner will also be a SME and will therefore satisfy all necessary roles.

•

Provide ongoing qualitative review of the Management Pack. The primary role of the service owner and the SME is to provide qualitative review of the Management Pack and to raise issues they find based on their review. For the purposes of the alert tuning process, although reducing alert volume is the primary goal, all aspects of the Management Pack are open to review and discussion. SMEs are encouraged to review and propose changes to all attributes provided within the Management Pack.

•

Provide the final qualitative assessment of the Management Pack. The service owners and SMEs are responsible for providing final qualitative assessment of the Management Pack. This includes a detailed analysis of each rule that has generated an alert during the course of the alert tuning process. The analysis will also include the evaluation of the rule’s actionability, validity, quality of the knowledge-base content, suppression, and other relevant feedback.

Develop Isolated and Pre-Production Lab Environment

The lab environment needs to be in place prior to the alert tuning review processes. This environment is illustrated in Figure 4.

Figure 4. Alert tuning infrastructure for isolated and pre-production lab environment

Isolated Lab Environment

The isolated lab environment is used to exercise the agent components of a new Management Pack in order to analyze the impact on the managed node. Ideally, the isolated lab environment would be a scaled-down version of the production MOM client and server infrastructure. This typically has the following configuration:

•	MOM server with MOM Management Server and database
•	Two or three managed nodes representing the operating systems and application (applicable for the Management Pack) used in production. For example, if there are many servers running Windows Server™ 2003 and Windows 2000 in the production environment, there should be at least one of each type in the isolated lab.
•	The lab network should be isolated.
•	All agents in the isolated lab should be running the application service or technology that the Management Pack monitors.
•	The agents should be undergoing as close to no load as possible. The purpose of this test is not to see how the Management Pack performs when load is introduced but rather to understand how it affects a nearly idle system. This allows for better isolation.

Pre-Production Lab Environment.

The pre-production lab environment is used to evaluate the quality of the Management Pack. This typically has the following configuration:

Install and Deploy Alert Tuning Reports

Overview

There are three reports connected with alert tuning. One displays the number of alert counts between two dates based on various conditions, such as for a Management Pack or a combination of Management Pack and computer group. Another report provides information about the alerts that are frequently generated, their names, and descriptions. A third report displays the total number of alerts for a Management Pack and MOM for ranges within different weeks. Further information about each of these reports is contained in Appendix D: Alert Tuning Reports.

This section provides step-by-step procedures to install the Alert Tuning Reports application onto a local machine. The application can be installed onto a server by providing the server name and the database names at the time of the installation from which the application procures data. Once deployed, the reports can be seen on the Web.

Installing Alert Tuning Reports

Before installation and deployment, ensure that:

•	The server running MOM is installed and configured.
•	The OnePoint database is set up and running.
•	The report server is installed.

The Alert Tuning Reports application is installed using the AlertTuningReports.msi provided to the user.

To install the Alert Tuning Reports application

1.	Double-click AlertTuningReports.msi. The Alert Tuning Reports wizard begins.
2.	From the Welcome page in the wizard, click Next.
3.	From the Select Installation Folder page, browse to the location where the application is to be installed (for instance, C:\Alert Tuning Reports). Indicate who the installation is for by selecting either Just me or Everyone. If the path field is left blank, the application will be installed in the default path C:\Program Files\Microsoft\Alert Tuning Reports\.
4.	From the Confirm Installation page, click Next.
5.	From the Enter Installation Details page, in the text box type the name of the server hosting the OnePoint database (for example, ffl-na-mom-01). If the MOM database is not OnePoint, type the correct database name.
6.	If the MOM database is not running on the default instance of SQL Server, select the Select an Instance check box.
7.	Enter the instance name from the drop-down list, and then click Install.
8.	From the Installation Complete page, click Close.

Validating Installation of Alert Tuning Reports

To verify successful installation of the reports

Navigate to the location where the application is installed. There should be a folder called MSMReports that contains the following files and folders:

•	ReportScripts – This is a folder containing the SQL script for the stored procedures to be uploaded onto the server.
•	AlertCountByDates.rdl – file
•	AlertCountByDevice.rdl – file
•	AlertCountByProcessingRules.rdl – file
•	AlertTuningReports.rptproj – file
•	AlertTuningReports.rptproj.user – file
•	AlertTuningReports.sln – file
•	AlertTuningReports.suo – file
•	OnePoint.rds – file

From Control Panel, in Add or Remove Programs, check for Alert Tuning Reports.

Make sure the following stored procedures are present in the OnePoint database that the Alert Tuning Reports are using:

•	sp_AlertByProcessRules
•	sp_AlertCountByDevices
•	sp_GetNumberOfDayssp_GetTopLevelComputerGroups
•	sp_PrintdaysBetweenStartDateandEndDate

Make sure the following user-defined function is present in the OnePoint database: fn_ProcessRuleGroupsMembers

Deploying Alert Tuning Reports

To view the reports on a Web page, the reports must be deployed on the report server. For information on how to do this, either proceed to the following steps, or refer to Reporting Services Books Online, available at http://www.microsoft.com/sql/reporting/.

To deploy the reports

1.	Log on to the server that hosts the report server.
2.	Open Report Manager. Or from Internet Explorer, type the address of the site as http://localhost/Reports if you are accessing the reports from a local machine. Otherwise, if you are accessing them remotely, type the address of the site as http://<machinename/>Reports. This opens up the following home page. Figure 5. Report Manager home page See full-sized image
3.	Click New Folder. To create a new folder with the name “Alert Tuning Reports,” type Alert Tuning Reports in the Name text box. Type an appropriate description, and then click OK.
4.	On the home page, click the Alert Tuning Reports link, and then click Upload File.
5.	In the File to upload box, browse to the location where you have installed the application, and select the report definition (RDL) file. For example, choose AlertCountByDates.rdl to upload the Alert Count By Dates report. In the Name box, type the appropriate name as shown in the following screenshot, and then click OK. Note The three report RDLs must be uploaded individually. Uploading all the RDLs together is not supported. Figure 6. Uploading the RDL file See full-sized image
6.	From the home page that appears once the RDL is uploaded, click New Data Source. The following screenshot appears. (Steps 7 to 10 indicate the details regarding the data source that must be supplied to create the data source for the reports.) Figure 7. Creating the data source See full-sized image
7.	In the Name box, type the name of the data source (for example, OnePoint), and in the Description drop-down list, type a description if needed.
8.	From the Connection Type drop-down list, set the connection to Microsoft SQL Server, and in the Connection String box, type the connection string used in the OnePoint.rds. It should be data source=<SQL Server>; initial Catalog=OnePoint.
9.	Click Credentials stored securely in the report server and type the credentials (user name and password) of the user who has privileges to access the database.
10.	Select the Use as Windows credentials when connecting to the data source check box, and then click Apply.
11.	From the home page, click Alert Tuning Reports. From the Alert Tuning Reports page, click Show Details. Figure 8. Showing details of the uploaded report See full-sized image
12.	Click the Edit icon of the uploaded report.
13.	Click the Properties tab. On the Report Properties page, click Data Sources to the left of the screen. In the Location box, browse and then associate the created data source (OnePoint) with this report, as shown in the following screenshot. Figure 9. Associating the data source with the RDL file See full-sized image
14.	Click OK, and the following screen appears. Figure 10. The data source has been associated with the RDL file. See full-sized image

All the uploaded reports will need to be associated with the data source using the process just explained.

Validating Deployment of Alert Tuning Reports

To verify whether the reports have been successfully deployed

1.	Log on to the report server.
2.	Navigate to the site http://localhosts/Reports. There should be a folder named Alert Tuning Reports.
3.	Click Alert Tuning Reports. The following page will appear. Figure 11. Alert Tuning page with the reports and data source uploaded See full-sized image

Uninstalling Alert Tuning Reports

To uninstall the Alert Tuning Reports application

1.	In Control Panel, click Add or Remove Programs. Click Remove a program and from the Currently Installed Programs list, click Alert Tuning Reports, and then click Remove.
2.	When prompted to remove the reports, click Yes.
3.	Navigate to the location where Alert Tuning Reports was installed. There should be only one file (reportconfigurer.InstallState) in the folder.
4.	Remove the file manually. This will remove Alert Tuning Reports from the server where it was installed and not from the report server. To remove the reports from the report server, follow the guidance provided in Reporting Services Books Online, available at http://www.microsoft.com/sql/reporting/.

Health Model/Health Specification Creation and Review

Overview

The second step in implementing alert tuning is the creation and review of the Health Model and Health Specification. The objective of this process is to perform a manual validation of the event lists and alerts as defined in the Health Model and Health Specification. The value of this activity is in its holistic view of the instrumentation from a service-owner perspective. This establishes a common understanding of how the Management Pack will function, and provides a first-round “validation on paper” of the strategy applied to the Management Pack.

This step may not be applicable for all situations, such as a vendor-provided Management Pack where a Health Specification is not available. However, in many cases, IT operations might make drastic changes or additions to the vendor-provided Management Packs, which should be reviewed in this step. Also, in the case of framework applications, development teams might create applications or extensions to the frameworks; Management Packs are created for the new functionality.

Creation Activities

The Health Model defines what it means for a system to be healthy (operating within normal conditions) or unhealthy (failed or degraded) as well as the transitions in and out of such states. Good information on a system’s health is necessary for the maintenance and diagnosis of running systems. The contents of the Health Model become the basis for system events and instrumentation on which monitoring and automated recovery are built. All too often, system information is supplied in a developer-centric way, which does not help the administrator know what is going on. Monitoring becomes unusable when this happens, and real problems become lost. The Health Model helps to determine what kinds of information should be provided and how the system or the administrator should respond to the information.

Users want to know at a glance if there is a problem in their systems. Many ask for a simple red or green indicator to identify a problem with an application or service, security, configuration, or resource. From this alert, they can then further investigate the affected machine or application. Users also want to know that when a condition is resolved or no longer true, the state will return to “OK.”

Creation of the Health Model includes the following activities:

1.	Document all management instrumentation exposed by an application or service.
2.	Document all service health states and transitions that the application can experience when running.
3.	Determine the instrumentation—events, traces, performance counters, and Windows Management Instrumentation (WMI) objects and probes—necessary to detect, verify, diagnose, and recover from bad or degraded health states.
4.	Document all dependencies, diagnostic steps, and possible recovery actions.
5.	Identify which conditions will require intervention from an administrator.
6.	Improve the model over time by incorporating feedback from customers, product support, and testing resources.

The Health Model is initially built from the management instrumentation exposed by an application. By analyzing this instrumentation and the system-failure modes, Service Monitoring and Control (SMC) can identify where the application lacks the proper instrumentation.

Further information about the Health Model is contained in the Design for Operations white paper available at http://www.microsoft.com/windowsserver2003/techinfo/overview/designops.mspx.

It is common for an IT organization to purchase commercial off-the-shelf (COTS) software. A set of documented information that is identical to the Health Model also needs to be created for all COTS software. However, because COTS software is not developed internally, the term Health Specification is used here to differentiate it from the Health Model. The Health Specification material for COTS software is created by IT operations (such as the SMC staff) and not developers, and it is designed for COTS software and other purchased service components. In some instances, COTS software is accompanied by its MOM Management Pack, and the documentation of the Management Pack in this case serves as its Health Specification. If COTS software is not accompanied by its Management Pack, it requires staff from IT operations to manually create documentation based on the observed behavior of the COTS software once it is installed in the operational environment.

Review Activities

Conduct a thorough review of the Health Specification and or Health Model to check for compliance with service-monitoring standards, accuracy, and actionability. Organize review teams and obtain the Health Specification or Health Model for the Management Pack that will be tuned. The following review activities need to occur:

Conduct an initial review of the Management Pack design and its approach, using the model as a basis. The development team performs this activity for internally created applications, whereas the operations team performs it for vendor Management Packs (for COTS).

Conduct a cursory review of the material for field overloading, which is the misuse of fields and attributes. This step will make sure that these are used appropriately. For example, the Name and Description fields should not contain unique identifiers, and unique identifiers should be unique and organized.

Conduct a line-by-line review of the material, giving special attention to the following areas:

1.	Names. Ensure they make sense and are applicable to the condition they are used for.
2.	Event IDs. Make sure they are not duplicated in this or any Management Packs that might be used together.
3.	Any documented suppression. Validate that it makes sense and applies correctly to the situation it is used for.
4.	Descriptive fields. Make sure the text is understandable and provides adequate information.

Conduct a review of the associated knowledge-base material, and assess its completeness and actionability.

Include the results of this activity in the Health Model, Health Specification, Management Pack, or any of its supporting documentation.

Isolated Lab Validation

Overview

The third step in implementing alert tuning involves using the isolated lab to test the effects of the Management Pack on the agent. The actual test duration in the lab is typically three days, although this can vary depending on resource, and on Management Pack size and complexity. If there are conditions that require retesting or further investigation, the run length might also be longer. This stage is important not only for optimization, but also serves to protect the next stage’s pre-production environment.

Validation Activities

The following activities are performed in the isolated lab that was created in the Alert Tuning Preparation step:

Install the Management Pack into the isolated lab.

Manually tune the Management Pack script frequency to once per minute.

1.	On the MOM Operator console, click each rule group in the Management Pack.
2.	Go into each respective rule and sort the list by the Response column.
3.	Look at the properties for all rules that have script responses.
4.	If the provider type is a timed event, alter the frequency to every one minute.

Use System Monitor to capture the behavior of specific instances on the agents, as shown in Table 3.

Table 3. Capturing the Behavior of Instances on Agents

Performance Object

Instance

Counter

Response to Deviation

Process

MOMService

Working Set