Sun Fire 15K/12K Auto Diagnosis and Recovery
This article describes the new System Management Services (SMS) 1.4 software features that enhance Sun Fire 15K/12K system availability. This document is useful for support personnel who have a basic knowledge of the Sun Fire 15K/12K systems.
In a mission critical environment, high availability is achieved by system resiliency, appropriate configuration, serviceability, and efficient and automated restoration processes. The SMS 1.4 software enhancements address all of these elements and increase availability, serviceability, and diagnosability of Sun Fire 15K/12K systems.
The SMS software can detect domain hangs/stops, then recover from such situations by resetting and rebooting the domain. The power-on self-test (POST) runs at increasing diagnostic levels when the domain panics repeatedly. POST allows the system to identify and isolate persistent hardware faults.
Standardized messages, component health status, and automatic diagnosis are powerful features for users and service providers. When combined with dynamic reconfiguration (DR), automatic diagnosis on Sun Fire 15K/12K systems greatly increases availability and decreases the scheduled downtime for maintenance.
This chapter contains the following topics:
"Solaris OE Enhancements"
"About the Author"
"Ordering Sun Documents"
"Accessing Sun Documentation Online"
To take advantage of the Sun Fire 15K/12K system's availability enhancements in the SMS 1.4 software, your system must have Solaris Operating Environment (Solaris OE) version 8 (2/02) or version 9 (12/03) running on Sun Fire 15K/12K system controllers (SCs) and domains.
Additionally, you must install the following patches:
If using Solaris 8 OE (2/02), patches 115829-01, 115831-01, and 108528-27 or newer.
If using Solaris 9 OE (12/03), patch 112233-11 or newer.