Exadata X8-M flash disk replacement
Exadata X8-M flash disk replacement
-- Login to storage cell and check Alert histroy
[root@Server300SC03-man ~]# cellcli
CellCLI> list alerthistory
5_1 2021-07-23T01:28:22+02:00 critical "A processor component is suspected of causing a fault with a 100% certainty. Component Name : /SYS/SP Fault class : fault.chassis.device.fail Fault message : http://support.oracle.com/msg/SPX86A-8007-J7"
6 2021-08-04T10:58:31+02:00 critical "Read-only file system detection thread was hung. Cell was power cycled to restore access to the cell."
7_1 2021-08-04T23:16:00+02:00 critical "Configuration check discovered the following problems: System Disk Errors: [ERROR] Insufficient number of devices for the system containers. The system containers and its devices are running in degraded mode. "
8_1 2021-08-19T11:14:39+02:00 critical "M.2 disk failed. Status : FAILED Manufacturer : INTEL Model Number : SSDSCKKB240G8 Size : 140GB Serial Number : PHYH023507Q9240J Firmware : XC311132 Slot Number : M.2 Slot: 0 "
[root@Server300SC03-man ~]# mdadm -Q -D /dev/md24
md24 md24p1 md24p10 md24p11 md24p12 md24p13 md24p14 md24p15 md24p16 md24p2 md24p3 md24p4 md24p5 md24p6 md24p7 md24p8 md24p9
-- Check the disk which is reported failed
[root@Server300SC03-man ~]# mdadm -Q -D /dev/md24
/dev/md24:
Container : /dev/md/imsm0, member 0
Raid Level : raid1
Array Size : 104857600 (100.00 GiB 107.37 GB)
Used Dev Size : 104857600 (100.00 GiB 107.37 GB)
Raid Devices : 2
Total Devices : 1
State : active, degraded
Active Devices : 1
Working Devices : 1
Failed Devices : 0
Spare Devices : 0
Consistency Policy : resync
UUID : 40916b67:bffac6f4:696e5a0d:4f5fb015
Number Major Minor RaidDevice State
0 8 208 0 active sync /dev/sdn
- 0 0 1 removed
-- Oracle Engineer will replace the Flash disk Online / while Storage Server is online
[root@Server300SC03-man ~]# mdadm -Q -D /dev/md24
/dev/md24:
Container : /dev/md/imsm0, member 0
Raid Level : raid1
Array Size : 104857600 (100.00 GiB 107.37 GB)
Used Dev Size : 104857600 (100.00 GiB 107.37 GB)
Raid Devices : 2
Total Devices : 2
State : active
Active Devices : 2
Working Devices : 2
Failed Devices : 0
Spare Devices : 0
Consistency Policy : resync
UUID : 40916b67:bffac6f4:696e5a0d:4f5fb015
Number Major Minor RaidDevice State
0 8 208 0 active sync /dev/sdn
2 8 224 1 active sync /dev/sdo
[root@Server300SC03-man ~]# cellcli
CellCLI> alter cell validate configuration
Cell Server300SC03_man successfully altered
CellCLI> list alerthistory
5_1 2021-07-23T01:28:22+02:00 critical "A processor component is suspected of causing a fault with a 100% certainty. Component Name : /SYS/SP Fault class : fault.chassis.device.fail Fault message : http://support.oracle.com/msg/SPX86A-8007-J7"
6 2021-08-04T10:58:31+02:00 critical "Read-only file system detection thread was hung. Cell was power cycled to restore access to the cell."
7_1 2021-08-04T23:16:00+02:00 critical "Configuration check discovered the following problems: System Disk Errors: [ERROR] Insufficient number of devices for the system containers. The system containers and its devices are running in degraded mode. "
7_2 2021-08-19T11:34:58+02:00 clear "The configuration check was successful."
8_1 2021-08-19T11:14:39+02:00 critical "M.2 disk failed. Status : FAILED Manufacturer : INTEL Model Number : SSDSCKKB240G8 Size : 140GB Serial Number : PHYH023507Q9240J Firmware : XC311132 Slot Number : M.2 Slot: 0 "
8_2 2021-08-19T11:16:55+02:00 clear "M.2 disk was replaced. Status : NORMAL Manufacturer : INTEL Model Number : SSDSCKKB240G8 Size : 224GB Serial Number : PHYH01820139240J Firmware : XC311120 Slot Number : M.2 Slot: 0 "
CellCLI> drop alerthistory all
Alert 5_1 successfully dropped
Alert 6 successfully dropped
Alert 7_1 successfully dropped
Alert 7_2 successfully dropped
Alert 8_1 successfully dropped
Alert 8_2 successfully dropped
CellCLI> list alerthistory
Comments
Post a Comment