|
|
Question : HP SAN drive fail
|
|
|
|
we have a HP SAN MSA1000 in our company. last 2 weeks we saw a message that indicate one of our hard dirve has falied and we should change it (the current array controller has a missing or bad phydsical drive attached to Box 1 Bay 13") we replace hard drive with new one and SAN rebuild it and it worked for one week but today we saw this message again, if we change hard drive it will work for one week but after that this message apper again. Array Diagnostic Report is SLOT SUMMARY: Slot Num Slot Type Array Controllers and Host Adapters Detected -------- --------- -------------------------------------------- SLOT 1 PCI FCA2214 2Gb PCI-X FC HBA for Windows and Linux | -(ID 65536) MSA1000 Array Controller
SLOT 1 (ID 65536) MSA1000 Array Controller ERROR REPORT:
Logical drive 4 status = Interim recovery (volume functional, but not fault tolerant) SCSI Port 2, drive ID 5 failed - REPLACE (A drive insertion has been detected) SCSI Port 1 Drive ID 0 RIS copies within this drive do not match SCSI Port 1 Drive ID 1 RIS copies within this drive do not match SCSI Port 1 Drive ID 2 RIS copies within this drive do not match SCSI Port 1 Drive ID 3 RIS copies within this drive do not match SCSI Port 1 Drive ID 4 RIS copies within this drive do not match SCSI Port 1 Drive ID 5 RIS copies within this drive do not match SCSI Port 1 Drive ID 8 RIS copies within this drive do not match Error occurred reading RIS copy from SCSI Port 2 Drive ID 0 Error occurred reading RIS copy from SCSI Port 2 Drive ID 1 Error occurred reading RIS copy from SCSI Port 2 Drive ID 2 Error occurred reading RIS copy from SCSI Port 2 Drive ID 3 Error occurred reading RIS copy from SCSI Port 2 Drive ID 4 Error occurred reading RIS copy from SCSI Port 2 Drive ID 5 Error occurred reading RIS copy from SCSI Port 2 Drive ID 8
SUBSYSTEM INFORMATION:
Backplane Slot: 0 Chassis Serial Num: SGM06219MQ This Controller World Wide Name: 0x500805f3001d7911 Array Serial Number: PB9840NX3T8039 Cache Serial Number: Not Available Other Controller World Wide Name: Not Available Array Serial Number: Not Available Cache Serial Number: Not Available Storage Subsystem System Ctlr Board Serial Num: SGM06219MQ Pwr Backplane Bd Serial Num: PB8B40FHAT808Y Drv Backplane Bd Serial Num: PB8B40FHAT808Y Fan Module Bd Serial Num: SGM06219MQ 1st Power Supply Serial Num: SGM06219MQ 2nd Power Supply Serial Num: SGM06219MQ Pwr Bay Fan Mod Serial Num: SGM06219MQ
CONTROLLER IDENTIFICATION: Configured Logical Drives: 4 Configuration Signature: 0xa17f9b6b Controller Firmware Rev: 4.48 Controller ROM Revision: 4.48 Controller Hardware Rev: 0x00 Boot Block Version: 0.10 Board ID: 0xe0100e11 Cable or Config Error: 0x00 (No) Invalid Host RAM Address: No CPU Revision: 0x02 CPU to PCI ASIC Rev: 0x03 Cache Controller ASIC Rev: 0xff PCI to Host ASIC Rev: 0x02 Marketing Revision: 0x41 (Rev A) Expand Disable Code: 0x01 SCSI Chip Count: 4 MaximumBlocks: 00008190 Controller Clock Count: 243906991 Max SCSI IDs per Bus: 16 Big Drive Map: 0x013f 0x011f 0x0000 0x0000 0x0000 0x0000 0x0000 0x0000 Big Ext Drive Map: 0x0000 0x0000 0x0000 0x0000 0x0000 0x0000 0x0000 0x0000 Big Non-Disk Drive Map: 0x0000 0x0000 0x0000 0x0000 0x0000 0x0000 0x0000 0x0000 Fibre Chip Count: 1 LKS: 0x00 AMS: 0x00 FS: 0xc0 0x00 0x00 0x00 0x00 0x00 0x2e34 0x3834 0x01 0x00 0x00 0x00 Recovery ROM Inactive Image Revision: 4.48 Recovery ROM Active Image Flags Status: 0x01
LOGICAL DRIVE STATUS:
Logical Drive 1: Drive Status: OK Blocks to Rebuild: 0 Blocks Re-mapped: Spare Status Flags: 0x00 Spare to Replaced Map: See Big Spare to Replace Map: Media Was Exchanged: No Cache Failure: No Expand Failure: 0x00 Unit Flags: 0x00 Big Remap Count: Bus ID Count ID Count ID Count ID Count ID Count 1 4 0007h Big Drive Failure Map: 0x0000 0x0000 0x0000 0x0000 0x0000 0x0000 0x0000 0x0000 Big Replacement Drive Map: 0x0000 0x0000 0x0000 0x0000 0x0000 0x0000 0x0000 0x0000 Big Active Spare Map: 0x0000 0x0000 0x0000 0x0000 0x0000 0x0000 0x0000 0x0000 Big Spare to Replace Map: No spares have replaced any drives Big Spare Marked OK Map: 0x0000 0x0000 0x0000 0x0000 0x0000 0x0000 0x0000 0x0000
Logical Drive 2: Drive Status: OK Blocks to Rebuild: 0 Blocks Re-mapped: Spare Status Flags: 0x00 Spare to Replaced Map: See Big Spare to Replace Map: Media Was Exchanged: No Cache Failure: No Expand Failure: 0x00 Unit Flags: 0x00 Big Remap Count: All Counts Zero Big Drive Failure Map: 0x0000 0x0000 0x0000 0x0000 0x0000 0x0000 0x0000 0x0000 Big Replacement Drive Map: 0x0000 0x0000 0x0000 0x0000 0x0000 0x0000 0x0000 0x0000 Big Active Spare Map: 0x0000 0x0000 0x0000 0x0000 0x0000 0x0000 0x0000 0x0000 Big Spare to Replace Map: No spares have replaced any drives Big Spare Marked OK Map: 0x0000 0x0000 0x0000 0x0000 0x0000 0x0000 0x0000 0x0000
Logical Drive 3: Drive Status: OK Blocks to Rebuild: 0 Blocks Re-mapped: Spare Status Flags: 0x00 Spare to Replaced Map: See Big Spare to Replace Map: Media Was Exchanged: No Cache Failure: No Expand Failure: 0x00 Unit Flags: 0x00 Big Remap Count: All Counts Zero Big Drive Failure Map: 0x0000 0x0000 0x0000 0x0000 0x0000 0x0000 0x0000 0x0000 Big Replacement Drive Map: 0x0000 0x0000 0x0000 0x0000 0x0000 0x0000 0x0000 0x0000 Big Active Spare Map: 0x0000 0x0000 0x0000 0x0000 0x0000 0x0000 0x0000 0x0000 Big Spare to Replace Map: No spares have replaced any drives Big Spare Marked OK Map: 0x0000 0x0000 0x0000 0x0000 0x0000 0x0000 0x0000 0x0000
Logical Drive 4: Drive Status: Interim recovery (volume functional, but not fault tolerant) Blocks to Rebuild: 0 Blocks Re-mapped: Spare Status Flags: 0x00 Spare to Replaced Map: See Big Spare to Replace Map: Media Was Exchanged: No Cache Failure: No Expand Failure: 0x00 Unit Flags: 0x00 Big Remap Count: Bus ID Count ID Count ID Count ID Count ID Count 1 4 0007h Big Drive Failure Map: 0x0000 0x0020 0x0000 0x0000 0x0000 0x0000 0x0000 0x0000 Big Replacement Drive Map: 0x0000 0x0000 0x0000 0x0000 0x0000 0x0000 0x0000 0x0000 Big Active Spare Map: 0x0000 0x0000 0x0000 0x0000 0x0000 0x0000 0x0000 0x0000 Big Spare to Replace Map: No spares have replaced any drives Big Spare Marked OK Map: 0x0000 0x0000 0x0000 0x0000 0x0000 0x0000 0x0000 0x0000
SCSI Port 2, Drive ID 5 Factory: Serial #, Firmware Rev, and Mfg/Model #: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 ................ 00 00 00 00 30 31 30 34 00 00 00 00 43 4f 4d 50 ....0104....COMP 41 51 20 42 44 33 30 30 38 39 42 42 41 20 20 20 AQ BD30089BBA 20 20 20 20 00 00 00 00 00 00 00 00 00 00 00 00 ............ 00 00 00 00 .... Since Power: Serial #, Firmware Rev, and Mfg/Model #: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 ................ 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 ................ 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 ................ 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 ................ 00 00 00 00 .... Threshold Flags: 0x0003 Serial Number Control: 0x0054 Firmware Revision Control: 0x0248 Mfg/Model Number Control: 0x0268
Factory Since Power Threshold Control Serv. Time 0000197c 00000000 ffffffff 2184 Read Blks 0000000030afa067 0000000000000000 0108 Hrd Read 00000000 00000000 ffffffff 2184 Rtry Read 00000000 00000000 ffffffff 2184 ECC Read 0000000000000000 0000000000000000 ffffffffffffffff 2188 Write Blks 000000003d702fb6 0000000000000000 0108 Hrd Write 00000000 00000000 ffffffff 2184 Rtry Write 00000000 00000000 ffffffff 2184 Seeks 0000000000000190 0000000000000000 0108 Seek Errs 0000000000000000 0000000000000000 ffffffffffffffff 2188 Spin Cyls 00000001 00000000 ffffffff 2184 Spin Time 0090 0000 ffff 2282 Test 1 ffff 0000 ffff 2a82 Test 2 0004 0000 ffff 2282 Test 3 003b 0000 ffff 2282 Test 4 0074 0000 ffff 2282 Spare Blks ffffffff 00000000 0a04 Re-mapped 00000000 00000000 ffffffff 2584 DRQ Tmots ffff 0000 ffff 2982 Timeouts 0000 0000 ffff 2182 Rebuilds 0001 0000 ffff 2182 Spn Retrs ffff 0000 ffff 2982 Fl Rd Recv 0000 0000 ffff 2182 Fl Wt Recv 0000 0000 ffff 2182 Format Err 0000 0000 ffff 2182 POST Err ffff 0000 ffff 2982 Drv Nt Ry 00000000 00000000 ffffffff 2184 Reallc Abt ffffffff 00000000 ffffffff 2984 IRQ Gltchs ffffffff 00000000 ffffffff 2984 Bus Flts 00000000 00000000 ffffffff 2184 Hot Plgs 00000000 00000000 ffffffff 2184 Tk Rwt Err ffff 0000 ffff 2982 Rmp Wt Err 0000 0000 ffff 2182 Bg Fw Rev 0000000000000000 0000000000000000 0a48 Med Flrs 0000 0000 ffff 2182 Hrdw Errs 0000 0000 ffff 2182 Abt Cmd Fl 0000 0000 ffff 2182 Spn Up Fl 0000 0000 ffff 2182 Bd Tgt Cnt 0000 0000 ffff 2182 Pred Fails 00000000 00000000 ffffffff 2184
DRIVE ERROR LOG: Error Log Header: Parameter Length = 0x14 Entry Size = 0x0014 Current Entry = 0x02 Total Errors Logged = 0x00000016 Error Log Data:
SCSI CAM Sense Sense Stat Stat Key Code Qual Block(Vl) Time Op Info ---- ---- ----- ----- ---- --------- ---- -- ---- 00 0e 00 00 00 004479df(0) 0000197c 28 0000 00 0e 00 00 00 0cd1f640(0) 0000197c 2a 0000 00 0e 00 00 00 179ab840(0) 0000197c 28 0000 00 0e 00 00 00 179ab940(0) 0000197c 2a 0000 00 0e 00 00 00 05aee940(0) 0000197c 2a 0000 00 0e 00 00 00 0cd1f540(0) 0000197c 2a 0000 00 0e 00 00 00 05aee9c0(0) 0000197c 2a 0000 00 0e 00 00 00 1fa2fa40(0) 0000197c 2a 0000 00 0e 00 00 00 1c3343c0(0) 0000197c 28 0000 00 0e 00 00 00 0cd1f2c0(0) 0000197c 2a 0000 00 0e 00 00 00 1fb24fc0(0) 0000197c 28 0000 00 0e 00 00 00 1c3344bf(0) 0000197c 28 0000 00 0e 00 00 00 179ab6c0(0) 0000197c 28 0000 00 0e 00 00 00 19655840(0) 0000197c 28 0000 00 0e 00 00 00 80001000(0) 0000197c 12 0000 00 0e 00 00 00 1a52d7c0(0) 0000197c 28 0000 00 0e 00 00 00 0cd1f5c0(0) 0000197c 2a 0000 00 0e 00 00 00 179ab740(0) 0000197c 28 0000 00 0e 00 00 00 0044785f(0) 0000197c 28 0000 00 0e 00 00 00 1a9e3dc0(0) 0000197c 28 0000
SCSI Port 2, Drive ID 5
Drive Model: COMPAQ BD30089BBA Product Rev: 0104 Serial Number: DA60P9100J8C Drive Type: 0x00 Parallel SCSI Block Size: 512 bytes/sector Total Blocks: 585937500 sectors/disk Reserved Blocks: 1088 reserved sectors/disk SCSI Inquiry Bits: 0x3A Stamped for M&P: no Last Failure Reason: 0x30 (A drive insertion has been detected) Phys Drive Flags: 0x04 0x21 0x00 0x00 Wide SCSI transfers Enabled Asynchronous SCSI Enabled S.M.A.R.T. Supported S.M.A.R.T. Disabled Configured as part of Logical Drive SCSI LUN: 0 Spi Speed Rules: 0x00000000 Physical Connector: I2 (controller connector attached to drive) Physical Box on Bus: 1 (number of the physical enclosure in which drive resides)
Physical Bay in Box: 13 (number of the physical drive bay in the enclosure)
SCSI Port 2, Drive ID 5: RIS Copy 0: Not Available RIS Copy 1: Not Available
|
|
|
|
Answer : HP SAN drive fail
|
|
i8042prt is relatively simple and created by MS... this driver should never show such BSOD! Are you using PS/2 keyboard? Could you (just for test) change it to USB one?
|
|
|
|