I l @ ve RuBoard |
This book is intended for system administrators and operators who are responsible for maintaining the integrity and availability of mission-critical UNIX systems. The book provides a description of the fault monitoring tools and techniques available for UNIX servers, including systems that are configured as high availability clusters. This book can therefore be a handy quick reference for an operator trying to troubleshoot a problem in the customer environment, by pointing out where to find key diagnostic messages and describing how to take recovery actions. A system administrator responsible for the initial configuration and administration of UNIX systems will also find this book useful because it describes the procedures to follow to set up the appropriate levels of system monitoring. The product descriptions can also help in making purchasing decisions as the customer determines the appropriate amount of event monitoring needed in their environment. An overview of the tasks performed by an operator is provided, with details on how events are received and processed . The remainder of the book focuses on the types of events that can be received, how they are detected , how operators receive event notifications, and how problems can be investigated and recovery performed. The goal is to introduce the necessary tools, but not to show how every possible problem can be solved . This book provides numerous descriptions of how fault management tools and products can be used to solve a variety of problems. Many of the chapters are focused on specific computer components , such as disks or databases, to be helpful to operators with specific roles. Here is a description of the individual chapters:
The Glossary contains the important terms used in the book, and their definitions. Although it is assumed that most customers concerned about fault management will implement high availability solutions, this book does not describe how to create highly available computing environments. Readers needing additional information on high availability may check Hewlett-Packard's external Web site on high availability (http://www.hp.com/go/ha) or read Clusters for High Availability by Peter Weygant. In general, this book does not discuss the configuration and installation of the hardware and software components of your UNIX system. You should rely on your vendors' product manuals for this. Many of the examples used in this book were created on HP-UX servers. Other UNIX platforms behave similarly, and we note when tools are supported only on certain UNIX platforms. |
I l @ ve RuBoard |