|Chapter 10 - Monitoring Exchange|
|Monitoring and Managing Microsoft Exchange 2000 Server|
|by Mike Daugherty|
|Digital Press 2001|
The goals of System Health Monitoring are:
Rapid detection and notification of actual failures . This allows the administrator to restore the system to proper working order as quickly as possible, thus minimizing the user impact.
Problem resolution . Monitoring provides detailed knowledge of problem, giving the administrator an important head start for restoring the system.
Predicting impending failures . This enables the administrator to correct the impending problem before it affects users.
At a minimum, the scope of system health monitoring should include the Exchange server hardware, Windows 2000 operating system software, Exchange 2000 software, antivirus software, backup software, IIS, and any other software that is essential for reliable operation of the messaging environment. If the network operations staff is not already monitoring key network components such as DNS servers and SMTP relay systems, these should be added to the list of monitored components.
Exchange server events can be generated from system-detected events written to the Windows 2000 event log, events detected by system and network management tools, or events generated by Exchange management software agents as a result of a monitored state change or threshold violation.
Important messaging system health indicators are not all generated by messaging system monitors . The following sources of information are often the first place where a potential problem can be seen.
Events written to the Windows 2000 event log . Critical events should be sent to the monitoring console. This may be done by configuration of the Windows 2000 event log to generate SNMP traps to the management systems, or by use of management agents that generate alerts using a proprietary or custom mechanism.
Events detected by system and network management tools . Often, events detected by agents monitoring the underlying systems or network can have a direct impact on the Exchange messaging environment. The Exchange operations team needs to be informed of critical events detected by system management tools (e.g., Compaq Insight Manager) or by network monitoring tools (e.g., OpenView Network Node Manager). This will allow correlation of system and network faults with Exchange alerts. This correlation of events can be done either manually or automatically using event correlation software.
Exchange 2000 state changes and threshold violations . Events generated by the Exchange management software agents because of detection of a monitored state change or threshold violation may indicate a server fault or a resource that is being depleted.
The process of monitoring the Exchange messaging environment includes periodically checking the Exchange 2000 services, servers, queues, connections, and underlying network for problems, and then alerting the appropriate people to correct the problems. Out-of-the-box, the Windows 2000 Server and Exchange 2000 products come with the following primary monitoring components:
Exchange System Manager
Windows 2000 Performance Monitor
Windows 2000 Event Viewer