System action: By default, the SAS RAID Module system information indicator is turned on and the operator is not notified of this alert by e-mail. Alert clears when the other controller is removed or restarted.
Operator response: Remove or reboot the other controller.
This alert can be masked so that it does not trigger the SAS RAID Module system information indicator.
Controller critical alerts
The alerts in this section are generated when a critical failure has occurred.
1103 Controller critical failure detected - baseboard management controller failed
Explanation: The controller continually monitors the health of the Baseboard Management Controller (BMC)
subsystem and controls the power for the controller. If this subsystem fails, the controller might not be able to control indicator lights, check environmental conditions, store critical state information, and retrieve system identity data.
When this alert is generated, the controller can no longer continue normal operation.
System action: By default, the SAS RAID Module system information indicator is turned on and the operator is notified of this alert by e-mail. If the failure occurs in a redundant controller, the remaining controller enters survivor mode and assumes control of all disk drives and associated LUNs while the failed controller enters service mode. If this failure occurs on a controller that is already in survivor mode, the controller attempts to flush all dirty cache and orderly dismount all volumes prior to entering service mode.
Operator response: Reboot the controller. All alerts are cleared. The controller reevaluates the system health status and regenerates all alerts that apply. If alert 1103 is generated again after the reboot, use one of the following procedures to replace the RAID Controller.
Replace the RAID Controller in a dual controller configuration using the RAID Controller command line interface
Perform the following steps to concurrently replace a single RAID Controller in a dual controller configuration using the RAID Controller command line interface (CLI).
1. Log into the chassis using the Advanced Management Module and illuminate the blue chassis identification LED to ensure the repair action is conducted on the correct chassis.
a. Log into the Advanced Management Module.
b. Click Monitors → LEDs → Media Tray and Rear Panel LEDs.
c. Click On or Blink next to Location. Depending on the selection, the blue chassis identification LED will be in the SOLID or BLINKING state.
2. Remove any externally attached devices that are connected to any of the four SAS ports on the RAID Controller that is being replaced.
3. Log into the CLI of the RAID Controller.
4. At the <CLI> prompt, enter alert –get and press Enter to query the active alert list and verify alert 1103 is in the list.
<CLI> alert -get Existing Alerts :
_____________________________________________________________________________________________________________
|AlertCode| Id | Time | SlotID | Severity | WWN | Ackable | MaskState |
|============================================================================================================|
| 1103 | 5 | 20090928165948 | 0 | Warning | 5005076b07419fff | 0 | Unmasked |
|---|
| Msg: Controller critical failure detected – Baseboard management controller failed |
|============================================================================================================|
5. The controller that failed with the BMC failure will automatically prepare itself for service by rebooting to the SERVICE state. At the <CLI> prompt, enter alert –get and press Enter to verify alert 2100 is in the active alert list, or verify the amber LED on the failed controller is in the SOLID state.
5102 • 1103
<CLI> alert -get Existing Alerts :
_______________________________________________________________________________________________________________
|AlertCode| Id | Time | SlotID | Severity | WWN | Ackable | MaskState |
|==============================================================================================================|
| 2100 | 6 | 20090930162105 | 0 | Info | 5005076b0741417f | 0 | Unmasked |
|---|
| Msg: Controller ready for service YK12J089A4BG |
|==============================================================================================================|
6. Remove the RAID Controller.
a. Open the release handle on the RAID Controller (rotate the handle up) to disengage the RAID Controller from the bay.
b. Slide the RAID Controller out of the bay.
7. Install the replacement RAID Controller.
a. Slide the RAID Controller into the same bay you removed the old RAID Controller.
b. Close the release handle (rotate the handle down).
8. The replacement controller’s firmware levels will be automatically updated to the same firmware levels running on the survivor controller. Depending on the state of the system, this may take several minutes to complete 9. At the <CLI> prompt, enter alert -get and press Enter to verify alert 1103 is no longer in the active alert list.
10. Connect any externally attached devices that were previously connected to any of the four SAS ports on the RAID Controller.
11. Log into the Advanced Management Module and turn off the blue chassis identification LED.
a. Log into the Advanced Management Module.
b. Click Monitors → LEDs → Media Tray and Rear Panel LEDs.
c. Click Off next to Location. The blue chassis identification LED will turn off.
Replace the RAID Controller in a single controller configuration using the RAID Controller command line interface
Perform the following steps to non-concurrently replace a single RAID Controller in a single controller or survivor configuration using the RAID Controller command line interface (CLI).
1. Log into the chassis using the Advanced Management Module and illuminate the blue chassis identification LED to ensure the repair action is conducted on the correct chassis.
a. Log into the Advanced Management Module.
b. Click Monitors → LEDs → Media Tray and Rear Panel LEDs.
c. Click On or Blink next to Location. Depending on the selection, the blue chassis identification LED will be in the SOLID or BLINKING state.
2. Remove any externally attached devices that are connected to any of the four SAS ports on the RAID Controller that is being replaced.
3. Log into the CLI of the RAID Controller.
4. At the <CLI> prompt, enter alert –get and press Enter to query the active alert list and verify alert 1103 is in the list.
<CLI> alert -get Existing Alerts :
_____________________________________________________________________________________________________________
|AlertCode| Id | Time | SlotID | Severity | WWN | Ackable | MaskState |
1103
<CLI> alert -get Existing Alerts :
_______________________________________________________________________________________________________________
|AlertCode| Id | Time | SlotID | Severity | WWN | Ackable | MaskState |
|==============================================================================================================|
| 2101 | 6 | 20090930162105 | 0 | Info | 5005076b0741417f | 0 | Unmasked |
|---|
| Msg: Controller ready for service - RAID system is down YK12J089A4BG |
|==============================================================================================================|
6. Remove the RAID Controller.
a. Open the release handle on the RAID Controller (rotate the handle up) to disengage the RAID Controller from the bay.
b. Slide the RAID Controller out of the bay.
7. Install the replacement RAID Controller.
a. Slide the RAID Controller into the same bay you removed the old RAID Controller.
b. Close the release handle (rotate the handle down).
8. At the <CLI> prompt, enter alert -get and press Enter to verify alert 1103 is no longer in the active alert list.
9. Connect any externally attached devices that were previously connected to any of the four SAS ports on the RAID Controller.
10. Log into the Advanced Management Module and turn off the blue chassis identification LED.
a. Log into the Advanced Management Module.
b. Click Monitors → LEDs → Media Tray and Rear Panel LEDs.
c. Click Off next to Location. The blue chassis identification LED will turn off.
Replace the RAID Controller in a dual controller configuration using the IBM Storage Configuration Manager
Perform the following steps to concurrently replace a single RAID Controller in a dual controller configuration using the IBM Storage Configuration Manager.
1. Log into the chassis using the Advanced Management Module and illuminate the blue chassis identification LED to ensure the repair action is conducted on the correct chassis.
a. Log into the Advanced Management Module.
b. Click Monitors → LEDs → Media Tray and Rear Panel LEDs.
c. Click On or Blink next to Location. Depending on the selection, the blue chassis identification LED will be in the SOLID or BLINKING state.
2. Remove any externally attached devices connected to any of the four SAS ports on the controller that is being replaced.
3. Log into the IBM Storage Configuration Manager.
4. From the navigation panel, select BC-S SAS RAID Module → Health → Physical View.
5. Click Controllers.
6. Click the controller that generated alert 1103.
7. The controller that failed with the BMC failure will automatically prepare itself for service by rebooting to the SERVICE state.
a. Select the Alerts tab.
b. Click View alert and verify alert 2100 is in the list, or verify that the amber LED on the controller is in the SOLID state.
8. Remove the RAID Controller.
a. Open the release handle on the RAID Controller (rotate the handle up) to disengage the RAID Controller from the bay.
b. Slide the RAID Controller out of the bay.
9. Install the replacement RAID Controller.
a. Slide the RAID Controller into the same bay you removed the old RAID Controller.
b. Close the release handle (rotate the handle down).
1103
10. The replacement controller’s firmware levels will be automatically updated to the same firmware levels running on the survivor controller. Depending on the state of the system, this may take several minutes to complete 11. Select the Properties tab and verify the controller status is Normal (Online).
12. Ensure that no additional warnings or critical alerts are generated for the replacement controller.
13. Connect any externally attached devices that were previously connected to any of the four SAS ports on the controller.
14. Log into the Advanced Management Module and turn off the blue chassis identification LED.
a. Log into the Advanced Management Module.
b. Click Monitors → LEDs → Media Tray and Rear Panel LEDs.
c. Click Off next to Location. The blue chassis identification LED will turn off.
Replace the RAID Controller in a single controller configuration using the IBM Storage Configuration Manager
Perform the following steps to concurrently replace a single RAID Controller in a single controller or survivor configuration using the IBM Storage Configuration Manager.
1. Log into the chassis using the Advanced Management Module and illuminate the blue chassis identification LED to ensure the repair action is conducted on the correct chassis.
a. Log into the Advanced Management Module.
b. Click Monitors → LEDs → Media Tray and Rear Panel LEDs.
c. Click On or Blink next to Location. Depending on the selection, the blue chassis identification LED will be in the SOLID or BLINKING state.
2. Remove any externally attached devices connected to any of the four SAS ports on the controller that is being replaced.
3. Log into the IBM Storage Configuration Manager.
4. From the navigation panel, select BC-S SAS RAID Module → Health → Physical View.
5. Click Controllers.
6. Click the controller that generated alert 1103.
7. The controller that failed with the BMC failure will automatically prepare itself for service by rebooting to the SERVICE state.
a. Select the Alerts tab.
b. Click View alert and verify alert 2101 is in the list, or verify that the amber LED on the controller is in the SOLID state.
8. Remove the RAID Controller.
a. Open the release handle on the RAID Controller (rotate the handle up) to disengage the RAID Controller from the bay.
b. Slide the RAID Controller out of the bay.
9. Install the replacement RAID Controller.
a. Slide the RAID Controller into the same bay you removed the old RAID Controller.
b. Close the release handle (rotate the handle down).
10. Select the Properties tab and verify the controller status is Normal (Online).
11. Ensure that no additional warnings or critical alerts are generated for the replacement controller.
12. Connect any externally attached devices that were previously connected to any of the four SAS ports on the controller.
13. Log into the Advanced Management Module and turn off the blue chassis identification LED.
a. Log into the Advanced Management Module.
1103
1104 Controller critical failure detected - SES/SAS expander failed
Explanation: The controller continually monitors the health of the SES (System Enclosure Services) controller / SAS expander subsystem. If this subsystem fails, the controller might not be able to control indicator lights, check environmental conditions, and access drives. When this alert is generated, the controller can no longer continue normal operation.
System action: By default, the SAS RAID Module system information indicator is turned on and the operator is notified of this alert by e-mail. If the failure occurs in a redundant controller, the remaining controller enters survivor mode and assumes control of all disk drives and associated LUNs while the failed controller enters service mode. If this failure occurs on a controller that is already in survivor mode, the controller attempts to flush all dirty cache and orderly dismount all volumes prior to entering service mode.
Operator response: Reboot the controller. All alerts are cleared. The controller reevaluates the system health status and regenerates all alerts that apply. If alert 1104 is generated again after the reboot, use one of the following procedures to replace the RAID Controller.
Replace the RAID Controller in a dual controller configuration using the RAID Controller command line interface
Perform the following steps to concurrently replace a single RAID Controller in a dual controller configuration using the RAID Controller command line interface (CLI).
1. Log into the chassis using the Advanced Management Module and illuminate the blue chassis identification LED to ensure the repair action is conducted on the correct chassis.
a. Log into the Advanced Management Module.
b. Click Monitors → LEDs → Media Tray and Rear Panel LEDs.
c. Click On or Blink next to Location. Depending on the selection, the blue chassis identification LED will be in the SOLID or BLINKING state.
2. Remove any externally attached devices that are connected to any of the four SAS ports on the RAID Controller that is being replaced.
3. Log into the CLI of the RAID Controller.
4. At the <CLI> prompt, enter alert –get and press Enter to query the active alert list and verify alert 1104 is in the list.
<CLI> alert -get Existing Alerts :
_____________________________________________________________________________________________________________
|AlertCode| Id | Time | SlotID | Severity | WWN | Ackable | MaskState |
|============================================================================================================|
| 1104 | 5 | 20090928165948 | 0 | Warning | 5005076b07419fff | 0 | Unmasked |
|---|
| Msg: Controller critical failure detected – SES/SAS expander failed |
|============================================================================================================|
5. The controller that failed with the SES/SAS expander failure will automatically prepare itself for service by rebooting to the SERVICE state. At the <CLI> prompt, enter alert –get and press Enter to verify alert 2100 is in the active alert list, or verify the amber LED on the failed controller is in the SOLID state.
<CLI> alert -get Existing Alerts :
_______________________________________________________________________________________________________________
|AlertCode| Id | Time | SlotID | Severity | WWN | Ackable | MaskState |
|==============================================================================================================|
| 2100 | 6 | 20090930162105 | 0 | Info | 5005076b0741417f | 0 | Unmasked |
|---|
| Msg: Controller ready for service YK12J089A4BG |
|==============================================================================================================|
6. Remove the RAID Controller.
a. Open the release handle on the RAID Controller (rotate the handle up) to disengage the RAID Controller from the bay.
b. Slide the RAID Controller out of the bay.
1104
a. Slide the RAID Controller into the same bay you removed the old RAID Controller.
b. Close the release handle (rotate the handle down).
8. The replacement controller’s firmware levels will be automatically updated to the same firmware levels running on the survivor controller. Depending on the state of the system, this may take several minutes to complete 9. At the <CLI> prompt, enter alert -get and press Enter to verify alert 1104 is no longer in the active alert list.
10. Connect any externally attached devices that were previously connected to any of the four SAS ports on the RAID Controller.
11. Log into the Advanced Management Module and turn off the blue chassis identification LED.
a. Log into the Advanced Management Module.
b. Click Monitors → LEDs → Media Tray and Rear Panel LEDs.
c. Click Off next to Location. The blue chassis identification LED will turn off.
Replace the RAID Controller in a single controller configuration using the RAID Controller command line interface
Perform the following steps to non-concurrently replace a single RAID Controller in a single controller or survivor configuration using the RAID Controller command line interface (CLI).
1. Log into the chassis using the Advanced Management Module and illuminate the blue chassis identification LED to ensure the repair action is conducted on the correct chassis.
a. Log into the Advanced Management Module.
b. Click Monitors → LEDs → Media Tray and Rear Panel LEDs.
c. Click On or Blink next to Location. Depending on the selection, the blue chassis identification LED will be in the SOLID or BLINKING state.
2. Remove any externally attached devices that are connected to any of the four SAS ports on the RAID Controller that is being replaced.
3. Log into the CLI of the RAID Controller.
4. At the <CLI> prompt, enter alert –get and press Enter to query the active alert list and verify alert 1104 is in the list.
<CLI> alert -get Existing Alerts :
_____________________________________________________________________________________________________________
|AlertCode| Id | Time | SlotID | Severity | WWN | Ackable | MaskState |
|============================================================================================================|
| 1104 | 5 | 20090928165948 | 0 | Warning | 5005076b07419fff | 0 | Unmasked |
|---|
| Msg: Controller critical failure detected – SES/SAS expander failed. |
|============================================================================================================|
5. At the <CLI> prompt, enter shutdown –ctlr <CONTROLLER 0 OR 1> –state servicemode –readytoremove and press Enter to prepare the controller with the SES/SAS expander failure for service. The SlotID associated with alert 1104 indicates which controller should be prepared for service. The controller that is prepared for service will reboot in the SERVICE state.
<CLI> shutdown -ctlr 0 -state servicemode -readytoremove Shutdown Command accepted.
6. At the <CLI> prompt, enter alert -get and press Enter to query the active alert list and verify alert 2101 is in the list, or verify the amber LED on the failed controller is in the SOLID state.
<CLI> alert -get
1104
a. Open the release handle on the RAID Controller (rotate the handle up) to disengage the RAID Controller from the bay.
b. Slide the RAID Controller out of the bay.
8. Install the replacement RAID Controller.
a. Slide the RAID Controller into the same bay you removed the old RAID Controller.
b. Close the release handle (rotate the handle down).
9. At the <CLI> prompt, enter alert -get and press Enter to verify alert 1104 is no longer in the active alert list.
10. Connect any externally attached devices that were previously connected to any of the four SAS ports on the RAID Controller.
11. Log into the Advanced Management Module and turn off the blue chassis identification LED.
a. Log into the Advanced Management Module.
b. Click Monitors → LEDs → Media Tray and Rear Panel LEDs.
c. Click Off next to Location. The blue chassis identification LED will turn off.
Replace the RAID Controller in a dual controller configuration using the IBM Storage Configuration Manager
Perform the following steps to concurrently replace a single RAID Controller in a dual controller configuration using the IBM Storage Configuration Manager.
1. Log into the chassis using the Advanced Management Module and illuminate the blue chassis identification LED to ensure the repair action is conducted on the correct chassis.
a. Log into the Advanced Management Module.
b. Click Monitors → LEDs → Media Tray and Rear Panel LEDs.
c. Click On or Blink next to Location. Depending on the selection, the blue chassis identification LED will be in the SOLID or BLINKING state.
2. Remove any externally attached devices connected to any of the four SAS ports on the controller that is being replaced.
3. Log into the IBM Storage Configuration Manager.
4. From the navigation panel, select BC-S SAS RAID Module → Health → Physical View.
5. Click Controllers.
6. Click the controller that generated alert 1104.
7. The controller that failed with the SES/SAS expander failure will automatically prepare itself for service by rebooting to the SERVICE state.
a. Select the Alerts tab.
b. Click View alert and verify alert 2100 is in the list, or verify that the amber LED on the controller is in the SOLID state.
8. Remove the RAID Controller.
a. Open the release handle on the RAID Controller (rotate the handle up) to disengage the RAID Controller from the bay.
b. Slide the RAID Controller out of the bay.
9. Install the replacement RAID Controller.
a. Slide the RAID Controller into the same bay you removed the old RAID Controller.
a. Slide the RAID Controller into the same bay you removed the old RAID Controller.