Re: Help with recovering a RAID5 array
From: Mathias Burén <hidden>
Date: 2013-05-02 13:49:39
On 2 May 2013 14:29, Stefan Borggraefe [off-list ref] wrote:
Am Donnerstag, 2. Mai 2013, 14:17:04 schrieb Mathias Burén:quoted
On 2 May 2013 14:14, Stefan Borggraefe [off-list ref] wrote:quoted
Am Donnerstag, 2. Mai 2013, 13:30:22 schrieb Mathias Burén:quoted
I won't scold you for using RAID5 instead of RAID6 with this number of if drives and especially the size of the drives. Could you please post the output of smartctl -a for each device? (from smartmontools) That way we can verify which HDDs are broken, before proceeding. MathiasHello Mathias, RAID6 would have been the safer option clearly, but we needed the extra-space and only had this number of drives available. Here the requested output: smartctl -a /dev/sdc smartctl 5.41 2011-06-09 r3365 [x86_64-linux-3.2.0-37-generic] (local build) Copyright (C) 2002-11 by Bruce Allen, http://smartmontools.sourceforge.net Vendor: Hitachi Product: HUS724040ALE640 Revision: MJAO User Capacity: 4.000.787.030.016 bytes [4,00 TB] Logical block size: 512 bytes Logical Unit id: 0x5000cca22bd08a85 Serial number: PK2331PAH5D0YT Device type: disk Local Time is: Thu May 2 15:09:16 2013 CEST Device supports SMART and is Enabled Temperature Warning Disabled or Not Supported SMART Health Status: OK [...]Hm are these behind some controller of sorts? What about smartctl -x ?We use an Adaptec 71605 controller. smartctl -x does not provide any more useful information, I suppose. I only post the output of one drive as an example this time. They all give a similar result. smartctl -x /dev/sdc smartctl 5.41 2011-06-09 r3365 [x86_64-linux-3.2.0-37-generic] (local build) Copyright (C) 2002-11 by Bruce Allen, http://smartmontools.sourceforge.net Vendor: Hitachi Product: HUS724040ALE640 Revision: MJAO User Capacity: 4.000.787.030.016 bytes [4,00 TB] Logical block size: 512 bytes Logical Unit id: 0x5000cca22bd08a85 Serial number: PK2331PAH5D0YT Device type: disk Local Time is: Thu May 2 15:20:55 2013 CEST Device supports SMART and is Enabled Temperature Warning Disabled or Not Supported SMART Health Status: OK Current Drive Temperature: <not available> Error Counter logging not supported Device does not support Self Test logging Device does not support Background scan results logging scsiPrintSasPhy Log Sense Failed [unsupported field in scsi command] What about the state of the software RAID5? It would be great if I could bring it back to a state where the filesystem on it is fully working again without having to copy the 20 TB of data to it again (copying this amount of data takes some time :( ). -- Best regards, Stefan Borggraefe
Ugh, Adaptec, not my favourite controller. Do you have arcconf installed? You could to arcconf getconfig 1 (or whatever your controller number is) to grab some information regarding your controller and the HDDs connected to it. Do you have /dev/sg? devices? If so, smartctl -a might work on them. Re the software RAID, I would double check the health of your HDDs before attempting anything. Mathias -- To unsubscribe from this list: send the line "unsubscribe linux-raid" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html