DS8000 Service Documentation Version 7.5

MAP4F20 SRCs BE40020x unavailable resource recovery

This procedure recovers an unavailable CEC enclosure or I/O enclosure to operational.

MAP4F20 Section-1

About this task

Find the system reference code (SRC) in the serviceable event that sent you here in Table 1 and perform the action listed.

Table 1. Actions for BE40020x SRCs
SRC Definition Go to:
BE400201 CEC found unavailable during an I/O enclosure repair. MAP4F20 Section-2
BE400202 CEC found unavailable during a CEC RIO repair. MAP4F20 Section-3
BE400203 I/O enclosure found unavailable during a CEC enclosure repair. MAP4F20 Section-4
BE400204 I/O enclosure found unavailable during an I/O enclosure power/cooling repair. MAP4F20 Section-5

MAP4F20 Section-2

About this task

This procedure addresses a situation in which a failure, in an I/O enclosure, causes a CEC to become unavailable.

Use this procedure after the original I/O enclosure failure has been successfully repaired. This procedure will make the CEC available and will restore the storage facility to dual CEC operational.

  • You completed replacing one or more of the following I/O enclosure FRUs:
    • I/O enclosure PCIe/SPCN card
    • I/O enclosure I/O backplane assembly
    • I/O enclosure PCIe cable
  • The serviceable event that you just repaired has been automatically closed.
  • A new serviceable event was created with "SRC BE400201 = CEC found unavailable during an I/O enclosure repair." A CEC enclosure is unavailable, quiesced, or powered off.

Procedure

  1. Are there any open serviceable events with SRCs BE1E2167, BE1E2543, or BE1E2551?
    • Yes, do not repair these serviceable events; close them. These are expected when an I/O enclosure has serviceable events and a CEC enclosure is unavailable. Go to the next step.
    • No, go to the next step.
  2. Are there any other open serviceable events with CEC enclosure FRUs?
    • Yes, exit this MAP and repair the open serviceable events.
    • No, go to the next step.
  3. The CEC enclosure must be reset using the CEC enclosure processor module exchange procedure. Return to the repair screen you followed to this point and do a pseudo repair of the FRU. A pseudo repair means that you use the normal FRU replacement procedures, but you do not actually replace the FRU.
    1. Read all these substeps before going to substep b, which will close this information center window. To view this MAP in a separate information center window, click Help in the upper right corner of the main HMC GUI screen and navigate to the MAP.
    2. Click Close in the current service information window.
    3. One or more HMC repair screens might prompt you for the result of using the service procedure in the MAP. Select Problem not fixed.
    4. Select No when prompted for whether you exchanged any parts.
    5. Select Yes when prompted for whether you isolated the problem.
    6. Select the CEC enclosure processor module from the FRU list. Click Next. If the FRU is not listed, select Show more FRUs. If it is still not listed, you must manually select the FRU by using the procedure MAP1230 Replace a FRU without using a serviceable event.
    7. The HMC begins the FRU exchange process for the selected FRU.

MAP4F20 Section-3

About this task

This procedure addresses a situation in which a failure in a CEC enclosure-to-CEC enclosure RIO interface causes a CEC to become unavailable.

Use this procedure after the original CEC enclosure RIO interface failure has been successfully repaired. This procedure will make the CEC available and will restore the storage facility to dual CEC to operational.

  • You completed replacing a CEC enclosure RIO card FRU.
  • The serviceable event that you just repaired has been automatically closed.
  • A new serviceable event was created with "SRC BE400202 = CEC found unavailable during a CEC RIO repair." A CEC enclosure is unavailable or in service mode.

Perform the following actions:

Procedure

  1. Are there any other open serviceable events, that you have not already attempted to repair, with CEC enclosure FRUs?
    • Yes, exit this MAP and repair the open serviceable events.
    • No, go to the next step.
  2. The CEC enclosure must be reset using the CEC enclosure processor module exchange procedure. Return to the repair screen you followed to this point and do a pseudo repair of the FRU. A pseudo repair means that you use the normal FRU replacement procedures, but you do not actually replace the FRU.
    1. Read all these substeps before going to substep b, which will close this information center window. To view this MAP in a separate information center window, click Help in the upper right corner of the main HMC GUI screen and navigate to the MAP.
    2. Click Close in the current service information window.
    3. One or more HMC repair screens might prompt you for the result of using the service procedure in the MAP. Select Problem not fixed.
    4. Select No when prompted for whether you exchanged any parts.
    5. Select Yes when prompted for whether you isolated the problem.
    6. Select the CEC enclosure processor module from the FRU list. Click Next. If the FRU is not listed, select Show more FRUs. If it is still not listed, you must manually select the FRU by using the procedure MAP1230 Replace a FRU without using a serviceable event.
    7. The HMC begins the FRU exchange process for the selected FRU.

MAP4F20 Section-4

About this task

This procedure addresses situations in which a failure in a CEC enclosure causes an I/O enclosure to become unavailable.

Situation 1:

Use this procedure after the original CEC enclosure failure has been successfully repaired. This procedure will recover the I/O enclosure to operational.
  • You completed replacing a CEC enclosure FRU.
  • The serviceable event for the CEC enclosure has been automatically closed.
  • A new serviceable event was created with "SRC BE400203 = I/O enclosure found unavailable during a CEC enclosure repair." An I/O enclosure is unavailable, quiesced, or powered off.

Situation 2:

Use this procedure when the original CEC enclosure failure repair failed, for example, in a deactivation phase. This procedure will recover the I/O enclosure to operational.
  • A CEC enclosure FRU repair failed.
  • The serviceable event for the CEC enclosure is not closed.
  • A new serviceable event was created with "SRC BE400203 = I/O enclosure found unavailable during a CEC enclosure repair." An I/O enclosure is unavailable or in service mode.

Perform the following actions:

Procedure

  1. Are there any other open serviceable events, that you have not already attempted to repair, with these I/O enclosure FRUs: I/O enclosure backplane assembly or I/O enclosure PCIe/SPCN card?
    • Yes, exit this MAP and repair the open serviceable events.
    • No, go to the next step.
  2. The I/O enclosure must be reset using the I/O enclosure backplane assembly replace procedure. Return to the repair screen you followed to this point and do a pseudo repair of the FRU. A pseudo repair means that you use the normal FRU replacement procedures, but you do not actually replace the FRU. In this situation it is not necessary to disconnect and reconnect the cables as part of the pseudo repair.
    1. Read all these substeps before going to substep b, which will close this information center window. To view this MAP in a separate information center window, click Help in the upper right corner of the main HMC GUI screen and navigate to the MAP.
    2. Click Close in the current service information window.
    3. One or more HMC repair screens might prompt you for the result of using the service procedure in the MAP. Select Problem not fixed.
    4. Select No when prompted for whether you exchanged any parts.
    5. Select Yes when prompted for whether you isolated the problem.
    6. Select the I/O enclosure backplane assembly from the FRU list. Click Next. If the FRU is not listed, select Show more FRUs. If it is still not listed, you must manually select the FRU by using the procedure MAP1230 Replace a FRU without using a serviceable event.
    7. The HMC begins the FRU exchange process for the selected FRU.
  3. If you are in MAP4F20 Section-4 because of situation 2 (at the top of Section-4), for example, a CEC enclosure FRU repair failed, retry the CEC enclosure FRU repair at this time. Use the CEC enclosure FRU location code from the original serviceable event FRU list and then use MAP1230 Replace a FRU without using a serviceable event to replace this FRU.

MAP4F20 Section-5

About this task

This procedure addresses a situation in which multiple power or cooling failures in an I/O enclosure causes the I/O enclosure to become unavailable.

Use this procedure after the I/O enclosure power or cooling failures have been successfully repaired. This procedure will recover the I/O enclosure to operational.

  • You completed replacing one or more of the following I/O enclosure power or cooling FRUs:
    • I/O enclosure PCIe/SPCN card
    • I/O enclosure backplane assembly
    • I/O enclosure fan
    • I/O enclosure power supply
  • The serviceable events for the I/O enclosure power or cooling FRUs have been automatically closed.
  • A new serviceable event was created with "SRC BE400204 = I/O enclosure found unavailable during an I/O enclosure power/cooling repair." An I/O enclosure is unavailable, quiesced, or powered off.

Procedure

  1. Are there any other open serviceable events, that you have not already attempted to repair, with I/O enclosure FRUs?
    • Yes, exit this MAP and repair the open serviceable events.
    • No, go to the next step.
  2. The I/O enclosure must be reset using the I/O enclosure backplane assembly replace procedure. Return to the repair screen you followed to this point and do a pseudo repair of the FRU. A pseudo repair means that you use the normal FRU replacement procedures, but you do not actually replace the FRU. In this situation it is not necessary to disconnect and reconnect the cables as part of the pseudo repair.
    1. Read all these substeps before going to substep b, which will close this information center window. To view this MAP in a separate information center window, click Help in the upper right corner of the main HMC GUI screen and navigate to the MAP.
    2. Click Close in the current service information window.
    3. One or more HMC repair screens might prompt you for the result of using the service procedure in the MAP. Select Problem not fixed.
    4. Select No when prompted for whether you exchanged any parts.
    5. Select Yes when prompted for whether you isolated the problem.
    6. Select the I/O enclosure backplane assembly from the FRU list. Click Next. If the FRU is not listed, select Show more FRUs. If it is still not listed, you must manually select the FRU by using the procedure MAP1230 Replace a FRU without using a serviceable event.
    7. The HMC begins the FRU exchange process for the selected FRU.