Exadata X8-M flash disk replacement

 Exadata X8-M flash disk replacement

-- Login to storage cell and check Alert histroy

[root@Server300SC03-man ~]# cellcli

CellCLI> list alerthistory

          5_1     2021-07-23T01:28:22+02:00       critical        "A processor component is suspected of causing a fault with a 100% certainty.  Component Name : /SYS/SP  Fault class    : fault.chassis.device.fail  Fault message  : http://support.oracle.com/msg/SPX86A-8007-J7"

         6       2021-08-04T10:58:31+02:00       critical        "Read-only file system detection thread was hung. Cell was power cycled to restore access to the cell."

         7_1     2021-08-04T23:16:00+02:00       critical        "Configuration check discovered the following problems:    System Disk Errors:  [ERROR] Insufficient number of devices for the system containers. The system containers and its devices are running in degraded mode. "

         8_1     2021-08-19T11:14:39+02:00       critical        "M.2 disk failed.  Status        : FAILED  Manufacturer  : INTEL  Model Number  : SSDSCKKB240G8  Size          : 140GB  Serial Number : PHYH023507Q9240J  Firmware      : XC311132  Slot Number   : M.2 Slot: 0 "

[root@Server300SC03-man ~]# mdadm -Q -D /dev/md24

md24     md24p1   md24p10  md24p11  md24p12  md24p13  md24p14  md24p15  md24p16  md24p2   md24p3   md24p4   md24p5   md24p6   md24p7   md24p8   md24p9

-- Check the disk which is reported failed

[root@Server300SC03-man ~]# mdadm -Q -D /dev/md24

/dev/md24:

         Container : /dev/md/imsm0, member 0

        Raid Level : raid1

        Array Size : 104857600 (100.00 GiB 107.37 GB)

     Used Dev Size : 104857600 (100.00 GiB 107.37 GB)

      Raid Devices : 2

     Total Devices : 1

             State : active, degraded

    Active Devices : 1

   Working Devices : 1

    Failed Devices : 0

     Spare Devices : 0

Consistency Policy : resync

              UUID : 40916b67:bffac6f4:696e5a0d:4f5fb015

    Number   Major   Minor   RaidDevice State

       0       8      208        0      active sync   /dev/sdn

       -       0        0        1      removed

-- Oracle Engineer will replace the Flash disk Online / while Storage Server is online 

[root@Server300SC03-man ~]# mdadm -Q -D /dev/md24

/dev/md24:

         Container : /dev/md/imsm0, member 0

        Raid Level : raid1

        Array Size : 104857600 (100.00 GiB 107.37 GB)

     Used Dev Size : 104857600 (100.00 GiB 107.37 GB)

      Raid Devices : 2

     Total Devices : 2

             State : active

    Active Devices : 2

   Working Devices : 2

    Failed Devices : 0

     Spare Devices : 0

Consistency Policy : resync

              UUID : 40916b67:bffac6f4:696e5a0d:4f5fb015

    Number   Major   Minor   RaidDevice State

       0       8      208        0      active sync   /dev/sdn

       2       8      224        1      active sync   /dev/sdo

[root@Server300SC03-man ~]# cellcli

CellCLI> alter cell validate configuration

Cell Server300SC03_man successfully altered

CellCLI> list alerthistory

         5_1     2021-07-23T01:28:22+02:00       critical        "A processor component is suspected of causing a fault with a 100% certainty.  Component Name : /SYS/SP  Fault class    : fault.chassis.device.fail  Fault message  : http://support.oracle.com/msg/SPX86A-8007-J7"

         6       2021-08-04T10:58:31+02:00       critical        "Read-only file system detection thread was hung. Cell was power cycled to restore access to the cell."

         7_1     2021-08-04T23:16:00+02:00       critical        "Configuration check discovered the following problems:    System Disk Errors:  [ERROR] Insufficient number of devices for the system containers. The system containers and its devices are running in degraded mode. "

         7_2     2021-08-19T11:34:58+02:00       clear           "The configuration check was successful."

         8_1     2021-08-19T11:14:39+02:00       critical        "M.2 disk failed.  Status        : FAILED  Manufacturer  : INTEL  Model Number  : SSDSCKKB240G8  Size          : 140GB  Serial Number : PHYH023507Q9240J  Firmware      : XC311132  Slot Number   : M.2 Slot: 0 "

         8_2     2021-08-19T11:16:55+02:00       clear           "M.2 disk was replaced.  Status        : NORMAL  Manufacturer  : INTEL  Model Number  : SSDSCKKB240G8  Size          : 224GB  Serial Number : PHYH01820139240J  Firmware      : XC311120  Slot Number   : M.2 Slot: 0 "

CellCLI> drop alerthistory all

Alert 5_1 successfully dropped

Alert 6 successfully dropped

Alert 7_1 successfully dropped

Alert 7_2 successfully dropped

Alert 8_1 successfully dropped

Alert 8_2 successfully dropped

CellCLI> list alerthistory


Comments

Popular Posts