Thread (6 messages) 6 messages, 3 authors, 2020-07-30

Re: Software RAID6 broke after power outage

From: Cory Derenburger <hidden>
Date: 2020-07-22 16:29:50

Thanks Wols,

The version on Linux Mint I've been running is quite old.  Once the
server was last configured it did not have updates.  It was put on a
shelf and (mostly) left alone to serve files reliably for years.

$ mdadm --version
mdadm - v3.2.5 - 18th May 2012

uname -a
Linux LIZZY 3.16.0-38-generic #52~14.04.1-Ubuntu SMP Fri May 8
09:43:57 UTC 2015 x86_64 x86_64 x86_64 GNU/Linux

Here is the lsdrv information
./lsdrv
**Warning** The following utility(ies) failed to execute:
  sginfo
Some information may be missing.

Controller platform [None]
└platform floppy.0
 └fd0 0.00k [2:0] Empty/Unknown
PCI [ahci] 00:11.0 SATA controller: Advanced Micro Devices, Inc.
[AMD/ATI] SB7x0/SB8x0/SB9x0 SATA Controller [AHCI mode]
├scsi 0:0:0:0 ATA      MKNSSDEC60GB     {ME150901AS2073580}
│└sda 55.90g [8:0] Partitioned (dos)
│ ├sda1 7.92g [8:1] ext4 {ef60a590-af5c-41f6-9166-3988d6646092}
│ │└Mounted as /dev/disk/by-uuid/ef60a590-af5c-41f6-9166-3988d6646092 @ /
│ ├sda2 1.00k [8:2] Partitioned (dos)
│ ├sda5 36.76g [8:5] ext4 {22fbf184-d791-45c9-8de9-62ee4f0a1776}
│ │└Mounted as /dev/sda5 @ /home
│ └sda6 1.91g [8:6] swap {4326b017-dea7-489d-850a-29c814ea6a99}
├scsi 1:0:0:0 ATA      Hitachi HUA72302 {YFGK3VXD}
│└sdb 1.82t [8:16] Partitioned (dos)
│ └sdb1 1.82t [8:17] MD  (none/) (w/ sdd1,sde1) spare 'LIZZY:0'
{605a2a08-65dc-e76a-967b-4f9e8fc79011}
│  └md0 0.00k [9:0] MD v1.2  () inactive, None (None) None {None}
│                   Empty/Unknown
├scsi 2:0:0:0 ATA      WDC WD20EARS-00M {WD-WCAZA1597296}
│└sdc 1.82t [8:32] Partitioned (dos)
│ └sdc1 1.82t [8:33] Empty/Unknown
├scsi 3:0:0:0 ATA      Hitachi HUA72302 {YFHK9JAA}
│└sdd 1.82t [8:48] Partitioned (dos)
│ └sdd1 1.82t [8:49] MD  (none/) (w/ sdb1,sde1) spare 'LIZZY:0'
{605a2a08-65dc-e76a-967b-4f9e8fc79011}
│  └md0 0.00k [9:0] MD v1.2  () inactive, None (None) None {None}
│                   Empty/Unknown
└scsi 5:0:0:0 ATA      Hitachi HUA72302 {YFG7LWBA}
 └sde 1.82t [8:64] Partitioned (dos)
  └sde1 1.82t [8:65] MD  (none/) (w/ sdb1,sdd1) spare 'LIZZY:0'
{605a2a08-65dc-e76a-967b-4f9e8fc79011}
   └md0 0.00k [9:0] MD v1.2  () inactive, None (None) None {None}
                    Empty/Unknown
PCI [pata_atiixp] 00:14.1 IDE interface: Advanced Micro Devices, Inc.
[AMD/ATI] SB7x0/SB8x0/SB9x0 IDE Controller
└scsi 6:0:0:0 PIONEER  DVD-RW  DVR-116D {PIONEER_DVD-RW_DVR-116D}
 └sr0 1.00g [11:0] Empty/Unknown
PCI [ahci] 02:00.0 SATA controller: Marvell Technology Group Ltd.
Device 9215 (rev 11)
└scsi 8:x:x:x [Empty]
Other Block Devices
├loop0 0.00k [7:0] Empty/Unknown
├loop1 0.00k [7:1] Empty/Unknown
├loop2 0.00k [7:2] Empty/Unknown
├loop3 0.00k [7:3] Empty/Unknown
├loop4 0.00k [7:4] Empty/Unknown
├loop5 0.00k [7:5] Empty/Unknown
├loop6 0.00k [7:6] Empty/Unknown
├loop7 0.00k [7:7] Empty/Unknown
├ram0 64.00m [1:0] Empty/Unknown
├ram1 64.00m [1:1] Empty/Unknown
├ram2 64.00m [1:2] Empty/Unknown
├ram3 64.00m [1:3] Empty/Unknown
├ram4 64.00m [1:4] Empty/Unknown
├ram5 64.00m [1:5] Empty/Unknown
├ram6 64.00m [1:6] Empty/Unknown
├ram7 64.00m [1:7] Empty/Unknown
├ram8 64.00m [1:8] Empty/Unknown
├ram9 64.00m [1:9] Empty/Unknown
├ram10 64.00m [1:10] Empty/Unknown
├ram11 64.00m [1:11] Empty/Unknown
├ram12 64.00m [1:12] Empty/Unknown
├ram13 64.00m [1:13] Empty/Unknown
├ram14 64.00m [1:14] Empty/Unknown
└ram15 64.00m [1:15] Empty/Unknown


smartctrl for the drives
# smartctl --xall /dev/sdb
smartctl 6.2 2013-07-26 r3841 [x86_64-linux-3.16.0-38-generic] (local build)
Copyright (C) 2002-13, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Device Model:     Hitachi HUA723020ALA641
Serial Number:    YFGK3VXD
LU WWN Device Id: 5 000cca 223c7c8d4
Firmware Version: MK7OA840
User Capacity:    2,000,398,934,016 bytes [2.00 TB]
Sector Size:      512 bytes logical/physical
Rotation Rate:    7200 rpm
Device is:        Not in smartctl database [for details use: -P showall]
ATA Version is:   ATA8-ACS T13/1699-D revision 4
SATA Version is:  SATA 2.6, 6.0 Gb/s (current: 3.0 Gb/s)
Local Time is:    Tue Jul 21 12:43:42 2020 PDT
SMART support is: Available - device has SMART capability.
SMART support is: Enabled
AAM feature is:   Unavailable
APM feature is:   Disabled
Rd look-ahead is: Enabled
Write cache is:   Enabled
ATA Security is:  Disabled, NOT FROZEN [SEC1]
Wt Cache Reorder: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x84) Offline data collection activity
                                        was suspended by an
interrupting command from host.
                                        Auto Offline Data Collection: Enabled.
Self-test execution status:      (   0) The previous self-test routine completed
                                        without error or no self-test has ever
                                        been run.
Total time to complete Offline
data collection:                (20116) seconds.
Offline data collection
capabilities:                    (0x5b) SMART execute Offline immediate.
                                        Auto Offline data collection
on/off support.
                                        Suspend Offline collection upon new
                                        command.
                                        Offline surface scan supported.
                                        Self-test supported.
                                        No Conveyance Self-test supported.
                                        Selective Self-test supported.
SMART capabilities:            (0x0003) Saves SMART data before entering
                                        power-saving mode.
                                        Supports SMART auto save timer.
Error logging capability:        (0x01) Error logging supported.
                                        General Purpose Logging supported.
Short self-test routine
recommended polling time:        (   1) minutes.
Extended self-test routine
recommended polling time:        ( 336) minutes.
SCT capabilities:              (0x003d) SCT Status supported.
                                        SCT Error Recovery Control supported.
                                        SCT Feature Control supported.
                                        SCT Data Table supported.

SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAGS    VALUE WORST THRESH FAIL RAW_VALUE
  1 Raw_Read_Error_Rate     PO-R--   100   100   016    -    0
  2 Throughput_Performance  P-S---   133   133   054    -    90
  3 Spin_Up_Time            POS---   100   100   024    -    492
  4 Start_Stop_Count        -O--C-   100   100   000    -    7
  5 Reallocated_Sector_Ct   PO--CK   100   100   005    -    0
  7 Seek_Error_Rate         PO-R--   100   100   067    -    0
  8 Seek_Time_Performance   P-S---   123   123   020    -    31
  9 Power_On_Hours          -O--C-   096   096   000    -    32011
 10 Spin_Retry_Count        PO--C-   100   100   060    -    0
 12 Power_Cycle_Count       -O--CK   100   100   000    -    7
192 Power-Off_Retract_Count -O--CK   100   100   000    -    641
193 Load_Cycle_Count        -O--C-   100   100   000    -    641
194 Temperature_Celsius     -O----   176   176   000    -    34 (Min/Max 23/39)
196 Reallocated_Event_Count -O--CK   100   100   000    -    0
197 Current_Pending_Sector  -O---K   100   100   000    -    0
198 Offline_Uncorrectable   ---R--   100   100   000    -    0
199 UDMA_CRC_Error_Count    -O-R--   200   200   000    -    0
                            ||||||_ K auto-keep
                            |||||__ C event count
                            ||||___ R error rate
                            |||____ S speed/performance
                            ||_____ O updated online
                            |______ P prefailure warning

General Purpose Log Directory Version 1
SMART           Log Directory Version 1 [multi-sector log support]
Address    Access  R/W   Size  Description
0x00       GPL,SL  R/O      1  Log Directory
0x01           SL  R/O      1  Summary SMART error log
0x03       GPL     R/O      1  Ext. Comprehensive SMART error log
0x04       GPL     R/O      7  Device Statistics log
0x06           SL  R/O      1  SMART self-test log
0x07       GPL     R/O      1  Extended self-test log
0x08       GPL     R/O      2  Power Conditions log
0x09           SL  R/W      1  Selective self-test log
0x10       GPL     R/O      1  NCQ Command Error log
0x11       GPL     R/O      1  SATA Phy Event Counters
0x20       GPL     R/O      1  Streaming performance log [OBS-8]
0x21       GPL     R/O      1  Write stream error log
0x22       GPL     R/O      1  Read stream error log
0x24       GPL     R/O     63  Current Device Internal Status Data log
0x80       GPL     R/W     63  Host vendor specific log
0x81-0x9f  GPL,SL  R/W     16  Host vendor specific log
0xe0       GPL,SL  R/W      1  SCT Command/Status
0xe1       GPL,SL  R/W      1  SCT Data Transfer

SMART Extended Comprehensive Error Log Version: 1 (1 sectors)
No Errors Logged

SMART Extended Self-test Log Version: 1 (1 sectors)
Num  Test_Description    Status                  Remaining
LifeTime(hours)  LBA_of_first_error
# 1  Extended offline    Completed without error       00%     31921         -
# 2  Extended offline    Completed without error       00%     31753         -
# 3  Extended offline    Completed without error       00%     31585         -
# 4  Extended offline    Completed without error       00%     31417         -
# 5  Extended offline    Completed without error       00%     31249         -
# 6  Extended offline    Completed without error       00%     31081         -
# 7  Extended offline    Completed without error       00%     30913         -
# 8  Extended offline    Completed without error       00%     30745         -
# 9  Extended offline    Completed without error       00%     30577         -
#10  Extended offline    Completed without error       00%     30409         -
#11  Extended offline    Completed without error       00%     30241         -
#12  Extended offline    Completed without error       00%     30073         -
#13  Extended offline    Completed without error       00%     29905         -
#14  Extended offline    Completed without error       00%     29737         -
#15  Extended offline    Completed without error       00%     29569         -
#16  Extended offline    Completed without error       00%     29401         -
#17  Extended offline    Completed without error       00%     29233         -
#18  Extended offline    Completed without error       00%     29065         -
#19  Extended offline    Completed without error       00%     28897         -

SMART Selective self-test log data structure revision number 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.

SCT Status Version:                  3
SCT Version (vendor specific):       256 (0x0100)
SCT Support Level:                   1
Device State:                        SMART Off-line Data Collection
executing in background (4)
Current Temperature:                    34 Celsius
Power Cycle Min/Max Temperature:     27/34 Celsius
Lifetime    Min/Max Temperature:     23/39 Celsius
Under/Over Temperature Limit Count:   0/0
SCT Temperature History Version:     2
Temperature Sampling Period:         1 minute
Temperature Logging Interval:        1 minute
Min/Max recommended Temperature:      0/60 Celsius
Min/Max Temperature Limit:           -40/70 Celsius
Temperature History Size (Index):    128 (37)

Index    Estimated Time   Temperature Celsius
  38    2020-07-21 10:36    33  **************
 ...    ..(  3 skipped).    ..  **************
  42    2020-07-21 10:40    33  **************
  43    2020-07-21 10:41    34  ***************
  44    2020-07-21 10:42    33  **************
 ...    ..( 11 skipped).    ..  **************
  56    2020-07-21 10:54    33  **************
  57    2020-07-21 10:55    34  ***************
  58    2020-07-21 10:56    33  **************
 ...    ..(  5 skipped).    ..  **************
  64    2020-07-21 11:02    33  **************
  65    2020-07-21 11:03    34  ***************
  66    2020-07-21 11:04    33  **************
  67    2020-07-21 11:05    33  **************
  68    2020-07-21 11:06    34  ***************
  69    2020-07-21 11:07    33  **************
 ...    ..(  8 skipped).    ..  **************
  78    2020-07-21 11:16    33  **************
  79    2020-07-21 11:17    34  ***************
  80    2020-07-21 11:18    33  **************
  81    2020-07-21 11:19    33  **************
  82    2020-07-21 11:20    34  ***************
  83    2020-07-21 11:21    33  **************
 ...    ..( 11 skipped).    ..  **************
  95    2020-07-21 11:33    33  **************
  96    2020-07-21 11:34    34  ***************
  97    2020-07-21 11:35    33  **************
 ...    ..( 11 skipped).    ..  **************
 109    2020-07-21 11:47    33  **************
 110    2020-07-21 11:48    34  ***************
 111    2020-07-21 11:49    33  **************
 ...    ..( 10 skipped).    ..  **************
 122    2020-07-21 12:00    33  **************
 123    2020-07-21 12:01    34  ***************
 124    2020-07-21 12:02    33  **************
 125    2020-07-21 12:03    33  **************
 126    2020-07-21 12:04    34  ***************
 127    2020-07-21 12:05    33  **************
 ...    ..(  9 skipped).    ..  **************
   9    2020-07-21 12:15    33  **************
  10    2020-07-21 12:16    34  ***************
  11    2020-07-21 12:17    33  **************
 ...    ..(  2 skipped).    ..  **************
  14    2020-07-21 12:20    33  **************
  15    2020-07-21 12:21    34  ***************
  16    2020-07-21 12:22    33  **************
  17    2020-07-21 12:23    33  **************
  18    2020-07-21 12:24    34  ***************
  19    2020-07-21 12:25    33  **************
  20    2020-07-21 12:26    33  **************
  21    2020-07-21 12:27    34  ***************
  22    2020-07-21 12:28    33  **************
 ...    ..(  3 skipped).    ..  **************
  26    2020-07-21 12:32    33  **************
  27    2020-07-21 12:33    34  ***************
 ...    ..(  9 skipped).    ..  ***************
  37    2020-07-21 12:43    34  ***************

SCT Error Recovery Control:
           Read: Disabled
          Write: Disabled

Device Statistics (GP Log 0x04)
Page Offset Size         Value  Description
  1  =====  =                =  == General Statistics (rev 1) ==
  1  0x008  4                7  Lifetime Power-On Resets
  1  0x010  4            32011  Power-on Hours
  1  0x018  6       7094397844  Logical Sectors Written
  1  0x020  6         32420890  Number of Write Commands
  1  0x028  6     183722166461  Logical Sectors Read
  1  0x030  6        194678316  Number of Read Commands
  3  =====  =                =  == Rotating Media Statistics (rev 1) ==
  3  0x008  4            32006  Spindle Motor Power-on Hours
  3  0x010  4            32006  Head Flying Hours
  3  0x018  4              641  Head Load Events
  3  0x020  4                0  Number of Reallocated Logical Sectors
  3  0x028  4               12  Read Recovery Attempts
  3  0x030  4                0  Number of Mechanical Start Failures
  4  =====  =                =  == General Errors Statistics (rev 1) ==
  4  0x008  4                0  Number of Reported Uncorrectable Errors
  4  0x010  4                0  Resets Between Cmd Acceptance and Completion
  5  =====  =                =  == Temperature Statistics (rev 1) ==
  5  0x008  1               34  Current Temperature
  5  0x010  1               33~ Average Short Term Temperature
  5  0x018  1               31~ Average Long Term Temperature
  5  0x020  1               39  Highest Temperature
  5  0x028  1               23  Lowest Temperature
  5  0x030  1               37~ Highest Average Short Term Temperature
  5  0x038  1               25~ Lowest Average Short Term Temperature
  5  0x040  1               35~ Highest Average Long Term Temperature
  5  0x048  1               25~ Lowest Average Long Term Temperature
  5  0x050  4                0  Time in Over-Temperature
  5  0x058  1               60  Specified Maximum Operating Temperature
  5  0x060  4                0  Time in Under-Temperature
  5  0x068  1                0  Specified Minimum Operating Temperature
  6  =====  =                =  == Transport Statistics (rev 1) ==
  6  0x008  4              169  Number of Hardware Resets
  6  0x010  4              129  Number of ASR Events
  6  0x018  4                0  Number of Interface CRC Errors
                              |_ ~ normalized value

SATA Phy Event Counters (GP Log 0x11)
ID      Size     Value  Description
0x0001  2            0  Command failed due to ICRC error
0x0002  2            0  R_ERR response for data FIS
0x0003  2            0  R_ERR response for device-to-host data FIS
0x0004  2            0  R_ERR response for host-to-device data FIS
0x0005  2            0  R_ERR response for non-data FIS
0x0006  2            0  R_ERR response for device-to-host non-data FIS
0x0007  2            0  R_ERR response for host-to-device non-data FIS
0x0009  2           25  Transition from drive PhyRdy to drive PhyNRdy
0x000a  2           22  Device-to-host register FISes sent due to a COMRESET
0x000b  2            0  CRC errors within host-to-device FIS
0x000d  2            0  Non-CRC errors within host-to-device FIS

# smartctl --xall /dev/sdc
smartctl 6.2 2013-07-26 r3841 [x86_64-linux-3.16.0-38-generic] (local build)
Copyright (C) 2002-13, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Model Family:     Western Digital Caviar Green (AF)
Device Model:     WDC WD20EARS-00MVWB0
Serial Number:    WD-WCAZA1597296
LU WWN Device Id: 5 0014ee 25a653961
Firmware Version: 51.0AB51
User Capacity:    2,000,398,934,016 bytes [2.00 TB]
Sector Size:      512 bytes logical/physical
Device is:        In smartctl database [for details use: -P show]
ATA Version is:   ATA8-ACS (minor revision not indicated)
SATA Version is:  SATA 2.6, 3.0 Gb/s
Local Time is:    Tue Jul 21 12:45:57 2020 PDT
SMART support is: Available - device has SMART capability.
SMART support is: Enabled
AAM feature is:   Disabled
APM feature is:   Unavailable
Rd look-ahead is: Enabled
Write cache is:   Enabled
ATA Security is:  Disabled, NOT FROZEN [SEC1]
Wt Cache Reorder: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x82) Offline data collection activity
                                        was completed without error.
                                        Auto Offline Data Collection: Enabled.
Self-test execution status:      (   0) The previous self-test routine completed
                                        without error or no self-test has ever
                                        been run.
Total time to complete Offline
data collection:                (38460) seconds.
Offline data collection
capabilities:                    (0x7b) SMART execute Offline immediate.
                                        Auto Offline data collection
on/off support.
                                        Suspend Offline collection upon new
                                        command.
                                        Offline surface scan supported.
                                        Self-test supported.
                                        Conveyance Self-test supported.
                                        Selective Self-test supported.
SMART capabilities:            (0x0003) Saves SMART data before entering
                                        power-saving mode.
                                        Supports SMART auto save timer.
Error logging capability:        (0x01) Error logging supported.
                                        General Purpose Logging supported.
Short self-test routine
recommended polling time:        (   2) minutes.
Extended self-test routine
recommended polling time:        ( 371) minutes.
Conveyance self-test routine
recommended polling time:        (   5) minutes.
SCT capabilities:              (0x3035) SCT Status supported.
                                        SCT Feature Control supported.
                                        SCT Data Table supported.

SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAGS    VALUE WORST THRESH FAIL RAW_VALUE
  1 Raw_Read_Error_Rate     POSR-K   199   199   051    -    2027
  3 Spin_Up_Time            POS--K   167   167   021    -    6641
  4 Start_Stop_Count        -O--CK   100   100   000    -    16
  5 Reallocated_Sector_Ct   PO--CK   200   200   140    -    0
  7 Seek_Error_Rate         -OSR-K   200   200   000    -    0
  9 Power_On_Hours          -O--CK   057   057   000    -    31954
 10 Spin_Retry_Count        -O--CK   100   253   000    -    0
 11 Calibration_Retry_Count -O--CK   100   253   000    -    0
 12 Power_Cycle_Count       -O--CK   100   100   000    -    14
192 Power-Off_Retract_Count -O--CK   200   200   000    -    11
193 Load_Cycle_Count        -O--CK   001   001   000    -    1121775
194 Temperature_Celsius     -O---K   121   115   000    -    29
196 Reallocated_Event_Count -O--CK   200   200   000    -    0
197 Current_Pending_Sector  -O--CK   200   200   000    -    0
198 Offline_Uncorrectable   ----CK   200   200   000    -    0
199 UDMA_CRC_Error_Count    -O--CK   200   200   000    -    0
200 Multi_Zone_Error_Rate   ---R--   199   198   000    -    371
                            ||||||_ K auto-keep
                            |||||__ C event count
                            ||||___ R error rate
                            |||____ S speed/performance
                            ||_____ O updated online
                            |______ P prefailure warning

General Purpose Log Directory Version 1
SMART           Log Directory Version 1 [multi-sector log support]
Address    Access  R/W   Size  Description
0x00       GPL,SL  R/O      1  Log Directory
0x01           SL  R/O      1  Summary SMART error log
0x02           SL  R/O      5  Comprehensive SMART error log
0x03       GPL     R/O      6  Ext. Comprehensive SMART error log
0x06           SL  R/O      1  SMART self-test log
0x07       GPL     R/O      1  Extended self-test log
0x09           SL  R/W      1  Selective self-test log
0x10       GPL     R/O      1  NCQ Command Error log
0x11       GPL     R/O      1  SATA Phy Event Counters
0x80-0x9f  GPL,SL  R/W     16  Host vendor specific log
0xa0-0xa7  GPL,SL  VS      16  Device vendor specific log
0xa8-0xb7  GPL,SL  VS       1  Device vendor specific log
0xc0       GPL,SL  VS       1  Device vendor specific log
0xc1       GPL     VS      93  Device vendor specific log
0xe0       GPL,SL  R/W      1  SCT Command/Status
0xe1       GPL,SL  R/W      1  SCT Data Transfer

SMART Extended Comprehensive Error Log Version: 1 (6 sectors)
Device Error Count: 89 (device log contains only the most recent 24 errors)
        CR     = Command Register
        FEATR  = Features Register
        COUNT  = Count (was: Sector Count) Register
        LBA_48 = Upper bytes of LBA High/Mid/Low Registers ]  ATA-8
        LH     = LBA High (was: Cylinder High) Register    ]   LBA
        LM     = LBA Mid (was: Cylinder Low) Register      ] Register
        LL     = LBA Low (was: Sector Number) Register     ]
        DV     = Device (was: Device/Head) Register
        DC     = Device Control Register
        ER     = Error register
        ST     = Status register
Powered_Up_Time is measured from power on, and printed as
DDd+hh:mm:SS.sss where DD=days, hh=hours, mm=minutes,
SS=sec, and sss=millisec. It "wraps" after 49.710 days.

Error 89 [16] occurred at disk power-on lifetime: 31954 hours (1331
days + 10 hours)
  When the command that caused the error occurred, the device was
active or idle.

  After command completion occurred, registers were:
  ER -- ST COUNT  LBA_48  LH LM LL DV DC
  -- -- -- == -- == == == -- -- -- -- --
  40 -- 51 00 00 00 00 00 00 08 09 40 00  Error: UNC at LBA = 0x00000809 = 2057

  Commands leading to the command that caused the error were:
  CR FEATR COUNT  LBA_48  LH LM LL DV DC  Powered_Up_Time  Command/Feature_Name
  -- == -- == -- == == == -- -- -- -- --  ---------------  --------------------
  60 00 08 00 c8 00 00 00 00 08 08 40 08     04:18:54.610  READ FPDMA QUEUED
  60 00 08 00 c0 00 00 00 00 08 00 40 08     04:18:54.610  READ FPDMA QUEUED
  60 00 08 00 b8 00 00 e8 e0 88 a0 40 08     04:18:54.609  READ FPDMA QUEUED
  60 00 08 00 b0 00 00 e8 e0 88 00 40 08     04:18:54.204  READ FPDMA QUEUED
  b0 00 da 00 00 00 00 00 c2 4f 00 00 08     04:16:10.310  SMART RETURN STATUS

Error 88 [15] occurred at disk power-on lifetime: 31953 hours (1331
days + 9 hours)
  When the command that caused the error occurred, the device was
active or idle.

  After command completion occurred, registers were:
  ER -- ST COUNT  LBA_48  LH LM LL DV DC
  -- -- -- == -- == == == -- -- -- -- --
  40 -- 51 00 00 00 00 00 00 08 09 40 00  Error: UNC at LBA = 0x00000809 = 2057

  Commands leading to the command that caused the error were:
  CR FEATR COUNT  LBA_48  LH LM LL DV DC  Powered_Up_Time  Command/Feature_Name
  -- == -- == -- == == == -- -- -- -- --  ---------------  --------------------
  60 00 08 00 30 00 00 00 00 08 08 40 08     03:55:50.295  READ FPDMA QUEUED
  ef 00 10 00 02 00 00 00 00 00 00 a0 08     03:55:50.295  SET
FEATURES [Enable SATA feature]
  27 00 00 00 00 00 00 00 00 00 00 e0 08     03:55:50.293  READ NATIVE
MAX ADDRESS EXT [OBS-ACS-3]
  ec 00 00 00 00 00 00 00 00 00 00 a0 08     03:55:50.290  IDENTIFY DEVICE
  ef 00 03 00 46 00 00 00 00 00 00 a0 08     03:55:50.290  SET
FEATURES [Set transfer mode]

Error 87 [14] occurred at disk power-on lifetime: 31953 hours (1331
days + 9 hours)
  When the command that caused the error occurred, the device was
active or idle.

  After command completion occurred, registers were:
  ER -- ST COUNT  LBA_48  LH LM LL DV DC
  -- -- -- == -- == == == -- -- -- -- --
  40 -- 51 00 00 00 00 00 00 08 09 40 00  Error: UNC at LBA = 0x00000809 = 2057

  Commands leading to the command that caused the error were:
  CR FEATR COUNT  LBA_48  LH LM LL DV DC  Powered_Up_Time  Command/Feature_Name
  -- == -- == -- == == == -- -- -- -- --  ---------------  --------------------
  60 00 08 00 28 00 00 00 00 08 08 40 08     03:55:50.136  READ FPDMA QUEUED
  60 00 08 00 20 00 00 00 00 08 20 40 08     03:55:50.136  READ FPDMA QUEUED
  60 00 08 00 18 00 00 00 00 0a 00 40 08     03:55:50.135  READ FPDMA QUEUED
  60 00 08 00 10 00 00 00 00 0b f8 40 08     03:55:50.135  READ FPDMA QUEUED
  60 00 08 00 08 00 00 00 00 0b f0 40 08     03:55:50.135  READ FPDMA QUEUED

Error 86 [13] occurred at disk power-on lifetime: 31953 hours (1331
days + 9 hours)
  When the command that caused the error occurred, the device was
active or idle.

  After command completion occurred, registers were:
  ER -- ST COUNT  LBA_48  LH LM LL DV DC
  -- -- -- == -- == == == -- -- -- -- --
  40 -- 51 00 00 00 00 00 00 08 09 40 00  Error: UNC at LBA = 0x00000809 = 2057

  Commands leading to the command that caused the error were:
  CR FEATR COUNT  LBA_48  LH LM LL DV DC  Powered_Up_Time  Command/Feature_Name
  -- == -- == -- == == == -- -- -- -- --  ---------------  --------------------
  60 00 08 00 e0 00 00 00 00 08 08 40 08     03:55:49.960  READ FPDMA QUEUED
  ef 00 10 00 02 00 00 00 00 00 00 a0 08     03:55:49.960  SET
FEATURES [Enable SATA feature]
  27 00 00 00 00 00 00 00 00 00 00 e0 08     03:55:49.958  READ NATIVE
MAX ADDRESS EXT [OBS-ACS-3]
  ec 00 00 00 00 00 00 00 00 00 00 a0 08     03:55:49.955  IDENTIFY DEVICE
  ef 00 03 00 46 00 00 00 00 00 00 a0 08     03:55:49.955  SET
FEATURES [Set transfer mode]

Error 85 [12] occurred at disk power-on lifetime: 31953 hours (1331
days + 9 hours)
  When the command that caused the error occurred, the device was
active or idle.

  After command completion occurred, registers were:
  ER -- ST COUNT  LBA_48  LH LM LL DV DC
  -- -- -- == -- == == == -- -- -- -- --
  40 -- 51 00 00 00 00 00 00 08 09 40 00  Error: UNC at LBA = 0x00000809 = 2057

  Commands leading to the command that caused the error were:
  CR FEATR COUNT  LBA_48  LH LM LL DV DC  Powered_Up_Time  Command/Feature_Name
  -- == -- == -- == == == -- -- -- -- --  ---------------  --------------------
  60 00 08 00 d8 00 00 00 00 08 08 40 08     03:55:49.798  READ FPDMA QUEUED
  60 00 08 00 d0 00 00 00 00 08 78 40 08     03:55:49.798  READ FPDMA QUEUED
  60 00 08 00 c8 00 00 00 00 08 38 40 08     03:55:49.798  READ FPDMA QUEUED
  60 00 08 00 c0 00 00 00 00 08 18 40 08     03:55:49.780  READ FPDMA QUEUED
  ef 00 10 00 02 00 00 00 00 00 00 a0 08     03:55:49.780  SET
FEATURES [Enable SATA feature]

Error 84 [11] occurred at disk power-on lifetime: 31953 hours (1331
days + 9 hours)
  When the command that caused the error occurred, the device was
active or idle.

  After command completion occurred, registers were:
  ER -- ST COUNT  LBA_48  LH LM LL DV DC
  -- -- -- == -- == == == -- -- -- -- --
  40 -- 51 00 00 00 00 00 00 08 09 40 00  Error: UNC at LBA = 0x00000809 = 2057

  Commands leading to the command that caused the error were:
  CR FEATR COUNT  LBA_48  LH LM LL DV DC  Powered_Up_Time  Command/Feature_Name
  -- == -- == -- == == == -- -- -- -- --  ---------------  --------------------
  60 00 08 00 b8 00 00 00 00 08 08 40 08     03:55:49.624  READ FPDMA QUEUED
  ef 00 10 00 02 00 00 00 00 00 00 a0 08     03:55:49.624  SET
FEATURES [Enable SATA feature]
  27 00 00 00 00 00 00 00 00 00 00 e0 08     03:55:49.622  READ NATIVE
MAX ADDRESS EXT [OBS-ACS-3]
  ec 00 00 00 00 00 00 00 00 00 00 a0 08     03:55:49.619  IDENTIFY DEVICE
  ef 00 03 00 46 00 00 00 00 00 00 a0 08     03:55:49.619  SET
FEATURES [Set transfer mode]

Error 83 [10] occurred at disk power-on lifetime: 31953 hours (1331
days + 9 hours)
  When the command that caused the error occurred, the device was
active or idle.

  After command completion occurred, registers were:
  ER -- ST COUNT  LBA_48  LH LM LL DV DC
  -- -- -- == -- == == == -- -- -- -- --
  40 -- 51 00 00 00 00 00 00 08 09 40 00  Error: UNC at LBA = 0x00000809 = 2057

  Commands leading to the command that caused the error were:
  CR FEATR COUNT  LBA_48  LH LM LL DV DC  Powered_Up_Time  Command/Feature_Name
  -- == -- == -- == == == -- -- -- -- --  ---------------  --------------------
  60 00 08 00 b0 00 00 00 00 08 08 40 08     03:55:49.468  READ FPDMA QUEUED
  ef 00 10 00 02 00 00 00 00 00 00 a0 08     03:55:49.468  SET
FEATURES [Enable SATA feature]
  27 00 00 00 00 00 00 00 00 00 00 e0 08     03:55:49.466  READ NATIVE
MAX ADDRESS EXT [OBS-ACS-3]
  ec 00 00 00 00 00 00 00 00 00 00 a0 08     03:55:49.463  IDENTIFY DEVICE
  ef 00 03 00 46 00 00 00 00 00 00 a0 08     03:55:49.463  SET
FEATURES [Set transfer mode]

Error 82 [9] occurred at disk power-on lifetime: 31953 hours (1331
days + 9 hours)
  When the command that caused the error occurred, the device was
active or idle.

  After command completion occurred, registers were:
  ER -- ST COUNT  LBA_48  LH LM LL DV DC
  -- -- -- == -- == == == -- -- -- -- --
  40 -- 51 00 00 00 00 00 00 08 09 40 00  Error: UNC at LBA = 0x00000809 = 2057

  Commands leading to the command that caused the error were:
  CR FEATR COUNT  LBA_48  LH LM LL DV DC  Powered_Up_Time  Command/Feature_Name
  -- == -- == -- == == == -- -- -- -- --  ---------------  --------------------
  60 00 08 00 a8 00 00 00 00 08 08 40 08     03:55:49.312  READ FPDMA QUEUED
  ef 00 10 00 02 00 00 00 00 00 00 a0 08     03:55:49.312  SET
FEATURES [Enable SATA feature]
  27 00 00 00 00 00 00 00 00 00 00 e0 08     03:55:49.310  READ NATIVE
MAX ADDRESS EXT [OBS-ACS-3]
  ec 00 00 00 00 00 00 00 00 00 00 a0 08     03:55:49.307  IDENTIFY DEVICE
  ef 00 03 00 46 00 00 00 00 00 00 a0 08     03:55:49.307  SET
FEATURES [Set transfer mode]

SMART Extended Self-test Log Version: 1 (1 sectors)
Num  Test_Description    Status                  Remaining
LifeTime(hours)  LBA_of_first_error
# 1  Extended offline    Completed without error       00%     31866         -
# 2  Extended offline    Completed without error       00%     31698         -
# 3  Extended offline    Completed without error       00%     31530         -
# 4  Extended offline    Completed without error       00%     31363         -
# 5  Extended offline    Completed without error       00%     31195         -
# 6  Extended offline    Completed without error       00%     31027         -
# 7  Extended offline    Completed without error       00%     30859         -
# 8  Extended offline    Completed without error       00%     30691         -
# 9  Extended offline    Completed without error       00%     30523         -
#10  Extended offline    Completed without error       00%     30356         -
#11  Extended offline    Completed without error       00%     30188         -
#12  Extended offline    Completed without error       00%     30020         -
#13  Extended offline    Completed without error       00%     29852         -
#14  Extended offline    Completed without error       00%     29685         -
#15  Extended offline    Completed without error       00%     29517         -
#16  Extended offline    Completed without error       00%     29349         -
#17  Extended offline    Completed without error       00%     29182         -
#18  Extended offline    Completed without error       00%     29014         -

SMART Selective self-test log data structure revision number 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.

SCT Status Version:                  3
SCT Version (vendor specific):       258 (0x0102)
SCT Support Level:                   1
Device State:                        Active (0)
Current Temperature:                    29 Celsius
Power Cycle Min/Max Temperature:     26/29 Celsius
Lifetime    Min/Max Temperature:     26/35 Celsius
Under/Over Temperature Limit Count:   0/0
SCT Temperature History Version:     2
Temperature Sampling Period:         1 minute
Temperature Logging Interval:        1 minute
Min/Max recommended Temperature:      0/60 Celsius
Min/Max Temperature Limit:           -41/85 Celsius
Temperature History Size (Index):    478 (465)

Index    Estimated Time   Temperature Celsius
 466    2020-07-21 04:48    29  **********
 ...    ..( 14 skipped).    ..  **********
   3    2020-07-21 05:03    29  **********
   4    2020-07-21 05:04    31  ************
 ...    ..( 21 skipped).    ..  ************
  26    2020-07-21 05:26    31  ************
  27    2020-07-21 05:27    32  *************
 ...    ..(  3 skipped).    ..  *************
  31    2020-07-21 05:31    32  *************
  32    2020-07-21 05:32    31  ************
  33    2020-07-21 05:33    32  *************
  34    2020-07-21 05:34    31  ************
  35    2020-07-21 05:35    31  ************
  36    2020-07-21 05:36    32  *************
  37    2020-07-21 05:37    32  *************
  38    2020-07-21 05:38    31  ************
 ...    ..( 72 skipped).    ..  ************
 111    2020-07-21 06:51    31  ************
 112    2020-07-21 06:52    30  ***********
 113    2020-07-21 06:53    31  ************
 ...    ..( 18 skipped).    ..  ************
 132    2020-07-21 07:12    31  ************
 133    2020-07-21 07:13    30  ***********
 134    2020-07-21 07:14    31  ************
 ...    ..( 56 skipped).    ..  ************
 191    2020-07-21 08:11    31  ************
 192    2020-07-21 08:12    32  *************
 193    2020-07-21 08:13    31  ************
 194    2020-07-21 08:14    32  *************
 195    2020-07-21 08:15    31  ************
 196    2020-07-21 08:16    32  *************
 197    2020-07-21 08:17    32  *************
 198    2020-07-21 08:18    32  *************
 199    2020-07-21 08:19    31  ************
 200    2020-07-21 08:20    31  ************
 201    2020-07-21 08:21    32  *************
 202    2020-07-21 08:22    31  ************
 ...    ..(  2 skipped).    ..  ************
 205    2020-07-21 08:25    31  ************
 206    2020-07-21 08:26    32  *************
 207    2020-07-21 08:27    32  *************
 208    2020-07-21 08:28    31  ************
 209    2020-07-21 08:29    31  ************
 210    2020-07-21 08:30    31  ************
 211    2020-07-21 08:31    30  ***********
 212    2020-07-21 08:32    31  ************
 ...    ..(  6 skipped).    ..  ************
 219    2020-07-21 08:39    31  ************
 220    2020-07-21 08:40     ?  -
 221    2020-07-21 08:41    26  *******
 ...    ..( 13 skipped).    ..  *******
 235    2020-07-21 08:55    26  *******
 236    2020-07-21 08:56    27  ********
 ...    ..(  9 skipped).    ..  ********
 246    2020-07-21 09:06    27  ********
 247    2020-07-21 09:07    28  *********
 ...    ..( 29 skipped).    ..  *********
 277    2020-07-21 09:37    28  *********
 278    2020-07-21 09:38    29  **********
 279    2020-07-21 09:39    28  *********
 ...    ..( 37 skipped).    ..  *********
 317    2020-07-21 10:17    28  *********
 318    2020-07-21 10:18    29  **********
 319    2020-07-21 10:19    28  *********
 ...    ..(  7 skipped).    ..  *********
 327    2020-07-21 10:27    28  *********
 328    2020-07-21 10:28    29  **********
 329    2020-07-21 10:29    29  **********
 330    2020-07-21 10:30    29  **********
 331    2020-07-21 10:31    28  *********
 332    2020-07-21 10:32    28  *********
 333    2020-07-21 10:33    29  **********
 334    2020-07-21 10:34    29  **********
 335    2020-07-21 10:35    28  *********
 ...    ..(  2 skipped).    ..  *********
 338    2020-07-21 10:38    28  *********
 339    2020-07-21 10:39    29  **********
 340    2020-07-21 10:40    28  *********
 ...    ..( 58 skipped).    ..  *********
 399    2020-07-21 11:39    28  *********
 400    2020-07-21 11:40    29  **********
 401    2020-07-21 11:41    29  **********
 402    2020-07-21 11:42    28  *********
 ...    ..( 25 skipped).    ..  *********
 428    2020-07-21 12:08    28  *********
 429    2020-07-21 12:09    29  **********
 430    2020-07-21 12:10    28  *********
 431    2020-07-21 12:11    28  *********
 432    2020-07-21 12:12    29  **********
 433    2020-07-21 12:13    29  **********
 434    2020-07-21 12:14    29  **********
 435    2020-07-21 12:15    28  *********
 ...    ..(  3 skipped).    ..  *********
 439    2020-07-21 12:19    28  *********
 440    2020-07-21 12:20    29  **********
 441    2020-07-21 12:21    28  *********
 442    2020-07-21 12:22    29  **********
 ...    ..( 22 skipped).    ..  **********
 465    2020-07-21 12:45    29  **********

SCT Error Recovery Control command not supported

Device Statistics (GP Log 0x04) not supported

SATA Phy Event Counters (GP Log 0x11)
ID      Size     Value  Description
0x0001  2            0  Command failed due to ICRC error
0x0002  2            0  R_ERR response for data FIS
0x0003  2            0  R_ERR response for device-to-host data FIS
0x0004  2            0  R_ERR response for host-to-device data FIS
0x0005  2            0  R_ERR response for non-data FIS
0x0006  2            0  R_ERR response for device-to-host non-data FIS
0x0007  2            0  R_ERR response for host-to-device non-data FIS
0x000a  2           23  Device-to-host register FISes sent due to a COMRESET
0x000b  2            0  CRC errors within host-to-device FIS
0x8000  4        15858  Vendor specific

# smartctl --xall /dev/sdd
smartctl 6.2 2013-07-26 r3841 [x86_64-linux-3.16.0-38-generic] (local build)
Copyright (C) 2002-13, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Device Model:     Hitachi HUA723020ALA641
Serial Number:    YFHK9JAA
LU WWN Device Id: 5 000cca 223d5f593
Firmware Version: MK7OA840
User Capacity:    2,000,398,934,016 bytes [2.00 TB]
Sector Size:      512 bytes logical/physical
Rotation Rate:    7200 rpm
Device is:        Not in smartctl database [for details use: -P showall]
ATA Version is:   ATA8-ACS T13/1699-D revision 4
SATA Version is:  SATA 2.6, 6.0 Gb/s (current: 3.0 Gb/s)
Local Time is:    Tue Jul 21 12:47:13 2020 PDT
SMART support is: Available - device has SMART capability.
SMART support is: Enabled
AAM feature is:   Unavailable
APM feature is:   Disabled
Rd look-ahead is: Enabled
Write cache is:   Enabled
ATA Security is:  Disabled, NOT FROZEN [SEC1]
Wt Cache Reorder: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x84) Offline data collection activity
                                        was suspended by an
interrupting command from host.
                                        Auto Offline Data Collection: Enabled.
Self-test execution status:      (   0) The previous self-test routine completed
                                        without error or no self-test has ever
                                        been run.
Total time to complete Offline
data collection:                (19618) seconds.
Offline data collection
capabilities:                    (0x5b) SMART execute Offline immediate.
                                        Auto Offline data collection
on/off support.
                                        Suspend Offline collection upon new
                                        command.
                                        Offline surface scan supported.
                                        Self-test supported.
                                        No Conveyance Self-test supported.
                                        Selective Self-test supported.
SMART capabilities:            (0x0003) Saves SMART data before entering
                                        power-saving mode.
                                        Supports SMART auto save timer.
Error logging capability:        (0x01) Error logging supported.
                                        General Purpose Logging supported.
Short self-test routine
recommended polling time:        (   1) minutes.
Extended self-test routine
recommended polling time:        ( 327) minutes.
SCT capabilities:              (0x003d) SCT Status supported.
                                        SCT Error Recovery Control supported.
                                        SCT Feature Control supported.
                                        SCT Data Table supported.

SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAGS    VALUE WORST THRESH FAIL RAW_VALUE
  1 Raw_Read_Error_Rate     PO-R--   100   100   016    -    0
  2 Throughput_Performance  P-S---   134   134   054    -    88
  3 Spin_Up_Time            POS---   100   100   024    -    498
  4 Start_Stop_Count        -O--C-   100   100   000    -    7
  5 Reallocated_Sector_Ct   PO--CK   100   100   005    -    0
  7 Seek_Error_Rate         PO-R--   100   100   067    -    0
  8 Seek_Time_Performance   P-S---   125   125   020    -    30
  9 Power_On_Hours          -O--C-   096   096   000    -    32010
 10 Spin_Retry_Count        PO--C-   100   100   060    -    0
 12 Power_Cycle_Count       -O--CK   100   100   000    -    7
192 Power-Off_Retract_Count -O--CK   100   100   000    -    648
193 Load_Cycle_Count        -O--C-   100   100   000    -    648
194 Temperature_Celsius     -O----   181   181   000    -    33 (Min/Max 23/37)
196 Reallocated_Event_Count -O--CK   100   100   000    -    0
197 Current_Pending_Sector  -O---K   100   100   000    -    0
198 Offline_Uncorrectable   ---R--   100   100   000    -    0
199 UDMA_CRC_Error_Count    -O-R--   200   200   000    -    0
                            ||||||_ K auto-keep
                            |||||__ C event count
                            ||||___ R error rate
                            |||____ S speed/performance
                            ||_____ O updated online
                            |______ P prefailure warning

General Purpose Log Directory Version 1
SMART           Log Directory Version 1 [multi-sector log support]
Address    Access  R/W   Size  Description
0x00       GPL,SL  R/O      1  Log Directory
0x01           SL  R/O      1  Summary SMART error log
0x03       GPL     R/O      1  Ext. Comprehensive SMART error log
0x04       GPL     R/O      7  Device Statistics log
0x06           SL  R/O      1  SMART self-test log
0x07       GPL     R/O      1  Extended self-test log
0x08       GPL     R/O      2  Power Conditions log
0x09           SL  R/W      1  Selective self-test log
0x10       GPL     R/O      1  NCQ Command Error log
0x11       GPL     R/O      1  SATA Phy Event Counters
0x20       GPL     R/O      1  Streaming performance log [OBS-8]
0x21       GPL     R/O      1  Write stream error log
0x22       GPL     R/O      1  Read stream error log
0x24       GPL     R/O     63  Current Device Internal Status Data log
0x80       GPL     R/W     63  Host vendor specific log
0x81-0x9f  GPL,SL  R/W     16  Host vendor specific log
0xe0       GPL,SL  R/W      1  SCT Command/Status
0xe1       GPL,SL  R/W      1  SCT Data Transfer

SMART Extended Comprehensive Error Log Version: 1 (1 sectors)
No Errors Logged

SMART Extended Self-test Log Version: 1 (1 sectors)
Num  Test_Description    Status                  Remaining
LifeTime(hours)  LBA_of_first_error
# 1  Extended offline    Completed without error       00%     31920         -
# 2  Extended offline    Completed without error       00%     31752         -
# 3  Extended offline    Completed without error       00%     31584         -
# 4  Extended offline    Completed without error       00%     31416         -
# 5  Extended offline    Completed without error       00%     31248         -
# 6  Extended offline    Completed without error       00%     31080         -
# 7  Extended offline    Completed without error       00%     30912         -
# 8  Extended offline    Completed without error       00%     30744         -
# 9  Extended offline    Completed without error       00%     30576         -
#10  Extended offline    Completed without error       00%     30408         -
#11  Extended offline    Completed without error       00%     30240         -
#12  Extended offline    Completed without error       00%     30072         -
#13  Extended offline    Completed without error       00%     29904         -
#14  Extended offline    Completed without error       00%     29736         -
#15  Extended offline    Completed without error       00%     29568         -
#16  Extended offline    Completed without error       00%     29400         -
#17  Extended offline    Completed without error       00%     29232         -
#18  Extended offline    Completed without error       00%     29064         -
#19  Extended offline    Completed without error       00%     28896         -

SMART Selective self-test log data structure revision number 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.

SCT Status Version:                  3
SCT Version (vendor specific):       256 (0x0100)
SCT Support Level:                   1
Device State:                        SMART Off-line Data Collection
executing in background (4)
Current Temperature:                    33 Celsius
Power Cycle Min/Max Temperature:     27/34 Celsius
Lifetime    Min/Max Temperature:     23/37 Celsius
Under/Over Temperature Limit Count:   0/0
SCT Temperature History Version:     2
Temperature Sampling Period:         1 minute
Temperature Logging Interval:        1 minute
Min/Max recommended Temperature:      0/60 Celsius
Min/Max Temperature Limit:           -40/70 Celsius
Temperature History Size (Index):    128 (6)

Index    Estimated Time   Temperature Celsius
   7    2020-07-21 10:40    33  **************
   8    2020-07-21 10:41    33  **************
   9    2020-07-21 10:42    32  *************
  10    2020-07-21 10:43    33  **************
  11    2020-07-21 10:44    33  **************
  12    2020-07-21 10:45    32  *************
  13    2020-07-21 10:46    33  **************
  14    2020-07-21 10:47    32  *************
  15    2020-07-21 10:48    32  *************
  16    2020-07-21 10:49    33  **************
  17    2020-07-21 10:50    32  *************
  18    2020-07-21 10:51    33  **************
 ...    ..( 11 skipped).    ..  **************
  30    2020-07-21 11:03    33  **************
  31    2020-07-21 11:04    32  *************
  32    2020-07-21 11:05    33  **************
 ...    ..( 15 skipped).    ..  **************
  48    2020-07-21 11:21    33  **************
  49    2020-07-21 11:22    32  *************
  50    2020-07-21 11:23    33  **************
  51    2020-07-21 11:24    33  **************
  52    2020-07-21 11:25    33  **************
  53    2020-07-21 11:26    32  *************
  54    2020-07-21 11:27    32  *************
  55    2020-07-21 11:28    33  **************
 ...    ..(  2 skipped).    ..  **************
  58    2020-07-21 11:31    33  **************
  59    2020-07-21 11:32    32  *************
  60    2020-07-21 11:33    32  *************
  61    2020-07-21 11:34    33  **************
  62    2020-07-21 11:35    32  *************
  63    2020-07-21 11:36    32  *************
  64    2020-07-21 11:37    33  **************
  65    2020-07-21 11:38    32  *************
  66    2020-07-21 11:39    33  **************
  67    2020-07-21 11:40    32  *************
  68    2020-07-21 11:41    32  *************
  69    2020-07-21 11:42    33  **************
  70    2020-07-21 11:43    32  *************
  71    2020-07-21 11:44    32  *************
  72    2020-07-21 11:45    33  **************
  73    2020-07-21 11:46    33  **************
  74    2020-07-21 11:47    32  *************
  75    2020-07-21 11:48    33  **************
  76    2020-07-21 11:49    32  *************
  77    2020-07-21 11:50    33  **************
 ...    ..( 45 skipped).    ..  **************
 123    2020-07-21 12:36    33  **************
 124    2020-07-21 12:37    34  ***************
 125    2020-07-21 12:38    34  ***************
 126    2020-07-21 12:39    33  **************
 ...    ..(  7 skipped).    ..  **************
   6    2020-07-21 12:47    33  **************

SCT Error Recovery Control:
           Read: Disabled
          Write: Disabled

Device Statistics (GP Log 0x04)
Page Offset Size         Value  Description
  1  =====  =                =  == General Statistics (rev 1) ==
  1  0x008  4                7  Lifetime Power-On Resets
  1  0x010  4            32010  Power-on Hours
  1  0x018  6       7053496316  Logical Sectors Written
  1  0x020  6         30154975  Number of Write Commands
  1  0x028  6     183776882028  Logical Sectors Read
  1  0x030  6        197249758  Number of Read Commands
  3  =====  =                =  == Rotating Media Statistics (rev 1) ==
  3  0x008  4            32005  Spindle Motor Power-on Hours
  3  0x010  4            32005  Head Flying Hours
  3  0x018  4              648  Head Load Events
  3  0x020  4                0  Number of Reallocated Logical Sectors
  3  0x028  4                0  Read Recovery Attempts
  3  0x030  4                0  Number of Mechanical Start Failures
  4  =====  =                =  == General Errors Statistics (rev 1) ==
  4  0x008  4                0  Number of Reported Uncorrectable Errors
  4  0x010  4                0  Resets Between Cmd Acceptance and Completion
  5  =====  =                =  == Temperature Statistics (rev 1) ==
  5  0x008  1               33  Current Temperature
  5  0x010  1               33~ Average Short Term Temperature
  5  0x018  1               30~ Average Long Term Temperature
  5  0x020  1               37  Highest Temperature
  5  0x028  1               23  Lowest Temperature
  5  0x030  1               34~ Highest Average Short Term Temperature
  5  0x038  1               25~ Lowest Average Short Term Temperature
  5  0x040  1               33~ Highest Average Long Term Temperature
  5  0x048  1               25~ Lowest Average Long Term Temperature
  5  0x050  4                0  Time in Over-Temperature
  5  0x058  1               60  Specified Maximum Operating Temperature
  5  0x060  4                0  Time in Under-Temperature
  5  0x068  1                0  Specified Minimum Operating Temperature
  6  =====  =                =  == Transport Statistics (rev 1) ==
  6  0x008  4              175  Number of Hardware Resets
  6  0x010  4              130  Number of ASR Events
  6  0x018  4                0  Number of Interface CRC Errors
                              |_ ~ normalized value

SATA Phy Event Counters (GP Log 0x11)
ID      Size     Value  Description
0x0001  2            0  Command failed due to ICRC error
0x0002  2            0  R_ERR response for data FIS
0x0003  2            0  R_ERR response for device-to-host data FIS
0x0004  2            0  R_ERR response for host-to-device data FIS
0x0005  2            0  R_ERR response for non-data FIS
0x0006  2            0  R_ERR response for device-to-host non-data FIS
0x0007  2            0  R_ERR response for host-to-device non-data FIS
0x0009  2           25  Transition from drive PhyRdy to drive PhyNRdy
0x000a  2           22  Device-to-host register FISes sent due to a COMRESET
0x000b  2            0  CRC errors within host-to-device FIS
0x000d  2            0  Non-CRC errors within host-to-device FIS



# smartctl --xall /dev/sde
smartctl 6.2 2013-07-26 r3841 [x86_64-linux-3.16.0-38-generic] (local build)
Copyright (C) 2002-13, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Device Model:     Hitachi HUA723020ALA641
Serial Number:    YFG7LWBA
LU WWN Device Id: 5 000cca 223c3757b
Firmware Version: MK7OA840
User Capacity:    2,000,398,934,016 bytes [2.00 TB]
Sector Size:      512 bytes logical/physical
Rotation Rate:    7200 rpm
Device is:        Not in smartctl database [for details use: -P showall]
ATA Version is:   ATA8-ACS T13/1699-D revision 4
SATA Version is:  SATA 2.6, 6.0 Gb/s (current: 3.0 Gb/s)
Local Time is:    Tue Jul 21 12:47:56 2020 PDT
SMART support is: Available - device has SMART capability.
SMART support is: Enabled
AAM feature is:   Unavailable
APM feature is:   Disabled
Rd look-ahead is: Enabled
Write cache is:   Enabled
ATA Security is:  Disabled, NOT FROZEN [SEC1]
Wt Cache Reorder: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x84) Offline data collection activity
                                        was suspended by an
interrupting command from host.
                                        Auto Offline Data Collection: Enabled.
Self-test execution status:      (   0) The previous self-test routine completed
                                        without error or no self-test has ever
                                        been run.
Total time to complete Offline
data collection:                (20614) seconds.
Offline data collection
capabilities:                    (0x5b) SMART execute Offline immediate.
                                        Auto Offline data collection
on/off support.
                                        Suspend Offline collection upon new
                                        command.
                                        Offline surface scan supported.
                                        Self-test supported.
                                        No Conveyance Self-test supported.
                                        Selective Self-test supported.
SMART capabilities:            (0x0003) Saves SMART data before entering
                                        power-saving mode.
                                        Supports SMART auto save timer.
Error logging capability:        (0x01) Error logging supported.
                                        General Purpose Logging supported.
Short self-test routine
recommended polling time:        (   1) minutes.
Extended self-test routine
recommended polling time:        ( 344) minutes.
SCT capabilities:              (0x003d) SCT Status supported.
                                        SCT Error Recovery Control supported.
                                        SCT Feature Control supported.
                                        SCT Data Table supported.

SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAGS    VALUE WORST THRESH FAIL RAW_VALUE
  1 Raw_Read_Error_Rate     PO-R--   100   100   016    -    0
  2 Throughput_Performance  P-S---   134   134   054    -    87
  3 Spin_Up_Time            POS---   100   100   024    -    493
  4 Start_Stop_Count        -O--C-   100   100   000    -    7
  5 Reallocated_Sector_Ct   PO--CK   100   100   005    -    0
  7 Seek_Error_Rate         PO-R--   100   100   067    -    0
  8 Seek_Time_Performance   P-S---   133   133   020    -    27
  9 Power_On_Hours          -O--C-   096   096   000    -    32010
 10 Spin_Retry_Count        PO--C-   100   100   060    -    0
 12 Power_Cycle_Count       -O--CK   100   100   000    -    7
192 Power-Off_Retract_Count -O--CK   100   100   000    -    647
193 Load_Cycle_Count        -O--C-   100   100   000    -    647
194 Temperature_Celsius     -O----   181   181   000    -    33 (Min/Max 23/37)
196 Reallocated_Event_Count -O--CK   100   100   000    -    0
197 Current_Pending_Sector  -O---K   100   100   000    -    0
198 Offline_Uncorrectable   ---R--   100   100   000    -    0
199 UDMA_CRC_Error_Count    -O-R--   200   200   000    -    0
                            ||||||_ K auto-keep
                            |||||__ C event count
                            ||||___ R error rate
                            |||____ S speed/performance
                            ||_____ O updated online
                            |______ P prefailure warning

General Purpose Log Directory Version 1
SMART           Log Directory Version 1 [multi-sector log support]
Address    Access  R/W   Size  Description
0x00       GPL,SL  R/O      1  Log Directory
0x01           SL  R/O      1  Summary SMART error log
0x03       GPL     R/O      1  Ext. Comprehensive SMART error log
0x04       GPL     R/O      7  Device Statistics log
0x06           SL  R/O      1  SMART self-test log
0x07       GPL     R/O      1  Extended self-test log
0x08       GPL     R/O      2  Power Conditions log
0x09           SL  R/W      1  Selective self-test log
0x10       GPL     R/O      1  NCQ Command Error log
0x11       GPL     R/O      1  SATA Phy Event Counters
0x20       GPL     R/O      1  Streaming performance log [OBS-8]
0x21       GPL     R/O      1  Write stream error log
0x22       GPL     R/O      1  Read stream error log
0x24       GPL     R/O     63  Current Device Internal Status Data log
0x80       GPL     R/W     63  Host vendor specific log
0x81-0x9f  GPL,SL  R/W     16  Host vendor specific log
0xe0       GPL,SL  R/W      1  SCT Command/Status
0xe1       GPL,SL  R/W      1  SCT Data Transfer

SMART Extended Comprehensive Error Log Version: 1 (1 sectors)
Device Error Count: 12 (device log contains only the most recent 4 errors)
        CR     = Command Register
        FEATR  = Features Register
        COUNT  = Count (was: Sector Count) Register
        LBA_48 = Upper bytes of LBA High/Mid/Low Registers ]  ATA-8
        LH     = LBA High (was: Cylinder High) Register    ]   LBA
        LM     = LBA Mid (was: Cylinder Low) Register      ] Register
        LL     = LBA Low (was: Sector Number) Register     ]
        DV     = Device (was: Device/Head) Register
        DC     = Device Control Register
        ER     = Error register
        ST     = Status register
Powered_Up_Time is measured from power on, and printed as
DDd+hh:mm:SS.sss where DD=days, hh=hours, mm=minutes,
SS=sec, and sss=millisec. It "wraps" after 49.710 days.

Error 12 [3] occurred at disk power-on lifetime: 14988 hours (624 days
+ 12 hours)
  When the command that caused the error occurred, the device was
active or idle.

  After command completion occurred, registers were:
  ER -- ST COUNT  LBA_48  LH LM LL DV DC
  -- -- -- == -- == == == -- -- -- -- --
  40 -- 51 03 ba 00 00 15 53 13 7d 05 00  Error: UNC 954 sectors at
LBA = 0x1553137d = 357766013

  Commands leading to the command that caused the error were:
  CR FEATR COUNT  LBA_48  LH LM LL DV DC  Powered_Up_Time  Command/Feature_Name
  -- == -- == -- == == == -- -- -- -- --  ---------------  --------------------
  25 00 00 04 00 00 00 15 53 13 37 e0 08  2d+22:13:36.175  READ DMA EXT
  27 00 00 00 00 00 00 00 00 00 00 e0 08  2d+22:13:36.171  READ NATIVE
MAX ADDRESS EXT [OBS-ACS-3]
  ec 03 00 00 00 00 00 00 00 00 00 a0 08  2d+22:13:36.168  IDENTIFY DEVICE
  ef 00 03 00 46 e0 88 af 00 00 00 a0 08  2d+22:13:36.165  SET
FEATURES [Set transfer mode]
  27 00 00 00 00 00 00 00 00 00 00 e0 08  2d+22:13:36.162  READ NATIVE
MAX ADDRESS EXT [OBS-ACS-3]

Error 11 [2] occurred at disk power-on lifetime: 14988 hours (624 days
+ 12 hours)
  When the command that caused the error occurred, the device was
active or idle.

  After command completion occurred, registers were:
  ER -- ST COUNT  LBA_48  LH LM LL DV DC
  -- -- -- == -- == == == -- -- -- -- --
  40 -- 51 03 ba 00 00 15 53 13 7d 05 00  Error: UNC 954 sectors at
LBA = 0x1553137d = 357766013

  Commands leading to the command that caused the error were:
  CR FEATR COUNT  LBA_48  LH LM LL DV DC  Powered_Up_Time  Command/Feature_Name
  -- == -- == -- == == == -- -- -- -- --  ---------------  --------------------
  25 00 00 04 00 00 00 15 53 13 37 e0 08  2d+22:13:32.148  READ DMA EXT
  27 00 00 00 00 00 00 00 00 00 00 e0 08  2d+22:13:32.143  READ NATIVE
MAX ADDRESS EXT [OBS-ACS-3]
  ec 03 00 00 00 00 00 00 00 00 00 a0 08  2d+22:13:32.140  IDENTIFY DEVICE
  ef 00 03 00 46 e0 88 af 00 00 00 a0 08  2d+22:13:32.137  SET
FEATURES [Set transfer mode]
  27 00 00 00 00 00 00 00 00 00 00 e0 08  2d+22:13:32.133  READ NATIVE
MAX ADDRESS EXT [OBS-ACS-3]

Error 10 [1] occurred at disk power-on lifetime: 14988 hours (624 days
+ 12 hours)
  When the command that caused the error occurred, the device was
active or idle.

  After command completion occurred, registers were:
  ER -- ST COUNT  LBA_48  LH LM LL DV DC
  -- -- -- == -- == == == -- -- -- -- --
  40 -- 51 03 ba 00 00 15 53 13 7d 05 00  Error: UNC 954 sectors at
LBA = 0x1553137d = 357766013

  Commands leading to the command that caused the error were:
  CR FEATR COUNT  LBA_48  LH LM LL DV DC  Powered_Up_Time  Command/Feature_Name
  -- == -- == -- == == == -- -- -- -- --  ---------------  --------------------
  25 00 00 04 00 00 00 15 53 13 37 e0 08  2d+22:13:28.564  READ DMA EXT
  27 00 00 00 00 00 00 00 00 00 00 e0 08  2d+22:13:28.560  READ NATIVE
MAX ADDRESS EXT [OBS-ACS-3]
  ec 03 00 00 00 00 00 00 00 00 00 a0 08  2d+22:13:28.556  IDENTIFY DEVICE
  ef 00 03 00 46 e0 88 af 00 00 00 a0 08  2d+22:13:28.553  SET
FEATURES [Set transfer mode]
  27 00 00 00 00 00 00 00 00 00 00 e0 08  2d+22:13:28.550  READ NATIVE
MAX ADDRESS EXT [OBS-ACS-3]

Error 9 [0] occurred at disk power-on lifetime: 14988 hours (624 days
+ 12 hours)
  When the command that caused the error occurred, the device was
active or idle.

  After command completion occurred, registers were:
  ER -- ST COUNT  LBA_48  LH LM LL DV DC
  -- -- -- == -- == == == -- -- -- -- --
  40 -- 51 03 ba 00 00 15 53 13 7d 05 00  Error: UNC 954 sectors at
LBA = 0x1553137d = 357766013

  Commands leading to the command that caused the error were:
  CR FEATR COUNT  LBA_48  LH LM LL DV DC  Powered_Up_Time  Command/Feature_Name
  -- == -- == -- == == == -- -- -- -- --  ---------------  --------------------
  25 00 00 04 00 00 00 15 53 13 37 e0 08  2d+22:13:24.974  READ DMA EXT
  27 00 00 00 00 00 00 00 00 00 00 e0 08  2d+22:13:24.971  READ NATIVE
MAX ADDRESS EXT [OBS-ACS-3]
  ec 03 00 00 00 00 00 00 00 00 00 a0 08  2d+22:13:24.967  IDENTIFY DEVICE
  ef 00 03 00 46 e0 88 af 00 00 00 a0 08  2d+22:13:24.967  SET
FEATURES [Set transfer mode]
  27 00 00 00 00 00 00 00 00 00 00 e0 08  2d+22:13:24.963  READ NATIVE
MAX ADDRESS EXT [OBS-ACS-3]

SMART Extended Self-test Log Version: 1 (1 sectors)
Num  Test_Description    Status                  Remaining
LifeTime(hours)  LBA_of_first_error
# 1  Extended offline    Completed without error       00%     31921         -
# 2  Extended offline    Completed without error       00%     31753         -
# 3  Extended offline    Completed without error       00%     31585         -
# 4  Extended offline    Completed without error       00%     31417         -
# 5  Extended offline    Completed without error       00%     31249         -
# 6  Extended offline    Completed without error       00%     31081         -
# 7  Extended offline    Completed without error       00%     30913         -
# 8  Extended offline    Completed without error       00%     30745         -
# 9  Extended offline    Completed without error       00%     30576         -
#10  Extended offline    Completed without error       00%     30409         -
#11  Extended offline    Completed without error       00%     30241         -
#12  Extended offline    Completed without error       00%     30073         -
#13  Extended offline    Completed without error       00%     29905         -
#14  Extended offline    Completed without error       00%     29737         -
#15  Extended offline    Completed without error       00%     29569         -
#16  Extended offline    Completed without error       00%     29401         -
#17  Extended offline    Completed without error       00%     29233         -
#18  Extended offline    Completed without error       00%     29065         -
#19  Extended offline    Completed without error       00%     28897         -

SMART Selective self-test log data structure revision number 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.

SCT Status Version:                  3
SCT Version (vendor specific):       256 (0x0100)
SCT Support Level:                   1
Device State:                        SMART Off-line Data Collection
executing in background (4)
Current Temperature:                    33 Celsius
Power Cycle Min/Max Temperature:     27/34 Celsius
Lifetime    Min/Max Temperature:     23/37 Celsius
Under/Over Temperature Limit Count:   0/0
SCT Temperature History Version:     2
Temperature Sampling Period:         1 minute
Temperature Logging Interval:        1 minute
Min/Max recommended Temperature:      0/60 Celsius
Min/Max Temperature Limit:           -40/70 Celsius
Temperature History Size (Index):    128 (111)

Index    Estimated Time   Temperature Celsius
 112    2020-07-21 10:40    33  **************
 ...    ..(111 skipped).    ..  **************
  96    2020-07-21 12:32    33  **************
  97    2020-07-21 12:33    34  ***************
 ...    ..(  5 skipped).    ..  ***************
 103    2020-07-21 12:39    34  ***************
 104    2020-07-21 12:40    33  **************
 ...    ..(  6 skipped).    ..  **************
 111    2020-07-21 12:47    33  **************

SCT Error Recovery Control:
           Read: Disabled
          Write: Disabled

Device Statistics (GP Log 0x04)
Page Offset Size         Value  Description
  1  =====  =                =  == General Statistics (rev 1) ==
  1  0x008  4                7  Lifetime Power-On Resets
  1  0x010  4            32010  Power-on Hours
  1  0x018  6       7079444176  Logical Sectors Written
  1  0x020  6         32145267  Number of Write Commands
  1  0x028  6     183726144100  Logical Sectors Read
  1  0x030  6        193643146  Number of Read Commands
  3  =====  =                =  == Rotating Media Statistics (rev 1) ==
  3  0x008  4            32005  Spindle Motor Power-on Hours
  3  0x010  4            32005  Head Flying Hours
  3  0x018  4              647  Head Load Events
  3  0x020  4                0  Number of Reallocated Logical Sectors
  3  0x028  4              176  Read Recovery Attempts
  3  0x030  4                0  Number of Mechanical Start Failures
  4  =====  =                =  == General Errors Statistics (rev 1) ==
  4  0x008  4                0  Number of Reported Uncorrectable Errors
  4  0x010  4                0  Resets Between Cmd Acceptance and Completion
  5  =====  =                =  == Temperature Statistics (rev 1) ==
  5  0x008  1               33  Current Temperature
  5  0x010  1               33~ Average Short Term Temperature
  5  0x018  1               31~ Average Long Term Temperature
  5  0x020  1               37  Highest Temperature
  5  0x028  1               23  Lowest Temperature
  5  0x030  1               35~ Highest Average Short Term Temperature
  5  0x038  1               25~ Lowest Average Short Term Temperature
  5  0x040  1               33~ Highest Average Long Term Temperature
  5  0x048  1               25~ Lowest Average Long Term Temperature
  5  0x050  4                0  Time in Over-Temperature
  5  0x058  1               60  Specified Maximum Operating Temperature
  5  0x060  4                0  Time in Under-Temperature
  5  0x068  1                0  Specified Minimum Operating Temperature
  6  =====  =                =  == Transport Statistics (rev 1) ==
  6  0x008  4              184  Number of Hardware Resets
  6  0x010  4              129  Number of ASR Events
  6  0x018  4                0  Number of Interface CRC Errors
                              |_ ~ normalized value

SATA Phy Event Counters (GP Log 0x11)
ID      Size     Value  Description
0x0001  2            0  Command failed due to ICRC error
0x0002  2            0  R_ERR response for data FIS
0x0003  2            0  R_ERR response for device-to-host data FIS
0x0004  2            0  R_ERR response for host-to-device data FIS
0x0005  2            0  R_ERR response for non-data FIS
0x0006  2            0  R_ERR response for device-to-host non-data FIS
0x0007  2            0  R_ERR response for host-to-device non-data FIS
0x0009  2           25  Transition from drive PhyRdy to drive PhyNRdy
0x000a  2           22  Device-to-host register FISes sent due to a COMRESET
0x000b  2            0  CRC errors within host-to-device FIS
0x000d  2            0  Non-CRC errors within host-to-device FIS






On Wed, Jul 22, 2020 at 2:14 AM Wols Lists [off-list ref] wrote:
On 22/07/20 08:41, Cory Derenburger wrote:
quoted
My server lost power this morning. The server is running Linux Mint
(14?) on a battery backup and I believe it shutdown before losing
power. Upon restarting the server the computer hung for a while, and
after resetting and booting up in recovery mode my RAID is now
nonfunctional.

The server was set up years ago with a RAID 6 array built with mdadm.
To be honest I don't really know what is wrong with the array, it
seems to be an issue with disk sdc. I wanted to reach out for help to
confirm the issue and get some guidance before proceeding (or making
things worse).

Any assistance that can help me determine what steps to take to get
this server back up and running would be greatly appreciated. It's
been 4+ since I have touched RAID, and only attempted a recovery once.
If anyone can help I would be super appreciative.
https://raid.wiki.kernel.org/index.php/Linux_Raid#When_Things_Go_Wrogn
https://raid.wiki.kernel.org/index.php/Asking_for_help

I see you've included some stuff which is helpful, but can you do
everything that last page asks for. In particular, lsdrv.
quoted
Below I'm including outputs from various commands for the 3rd disk
which seems to be the culprit

dmesg - boot section section where first errors begin occurring
[    2.637856] md: bind<sdd1>
[    2.646987] random: nonblocking pool is initialized
[    2.647432] md: bind<sde1>
[    2.651429] md: bind<sdb1>
[    2.863538] ata3.00: exception Emask 0x0 SAct 0x10 SErr 0x0 action 0x0
[    2.863594] ata3.00: irq_stat 0x40000008
[    2.863643] ata3.00: failed command: READ FPDMA QUEUED
[    2.863695] ata3.00: cmd 60/08:20:08:08:00/00:00:00:00:00/40 tag 4
ncq 4096 in
[    2.863695]          res 41/40:00:09:08:00/00:00:00:00:00/40 Emask
0x409 (media error) <F>
[    2.863775] ata3.00: status: { DRDY ERR }
[    2.863822] ata3.00: error: { UNC }
[    2.873407] ata3.00: configured for UDMA/133
[    2.873476] sd 2:0:0:0: [sdc] Unhandled sense code
[    2.873525] sd 2:0:0:0: [sdc]
[    2.873571] Result: hostbyte=DID_OK driverbyte=DRIVER_SENSE
[    2.873619] sd 2:0:0:0: [sdc]
[    2.873665] Sense Key : Medium Error [current] [descriptor]
[    2.873819] Descriptor sense data with sense descriptors (in hex):
[    2.873901]         72 03 11 04 00 00 00 0c 00 0a 80 00 00 00 00 00
[    2.874544]         00 00 08 09
[    2.874764] sd 2:0:0:0: [sdc]
[    2.874811] Add. Sense: Unrecovered read error - auto reallocate failed
[    2.874895] sd 2:0:0:0: [sdc] CDB:
[    2.874941] Read(10): 28 00 00 00 08 08 00 00 08 00
[    2.875428] end_request: I/O error, dev sdc, sector 2057
[    2.875478] Buffer I/O error on device sdc1, logical block 1

cat /proc/mdstat
Personalities : [linear] [multipath] [raid0] [raid1] [raid6] [raid5]
[raid4] [raid10]
md0 : inactive sdb1[0](S) sde1[3](S) sdd1[2](S)
      5860147464 blocks super 1.2

{not sure why these drives are now showing as spares}
This is very common when an array fails to assemble properly.
Unfortunately, when there's one error, it often triggers a cascade of
fake errors, and this is probably the case here.
quoted
Below running mdstat for sdc.  Checking sdb, sdd, sde appear fine.

mdadm --examine /dev/sdc
/dev/sdc:   MBR Magic : aa55
Partition[0] :   3907027120 sectors at         2048 (type fd)

mdadm --examine /dev/sdc1
mdadm: No md superblock detected on /dev/sdc1.

fdisk -l
Disk /dev/sdb: 2000.4 GB, 2000398934016 bytes
81 heads, 63 sectors/track, 765633 cylinders, total 3907029168 sectors
Units = sectors of 1 * 512 = 512 bytes
Sector size (logical/physical): 512 bytes / 512 bytes
I/O size (minimum/optimal): 512 bytes / 512 bytes
Disk identifier: 0x38389fdc

   Device Boot      Start         End      Blocks   Id  System
/dev/sdb1            2048  3907029167  1953513560   fd  Linux raid autodetect

Disk /dev/sdc: 2000.4 GB, 2000398934016 bytes
81 heads, 63 sectors/track, 765633 cylinders, total 3907029168 sectors
Units = sectors of 1 * 512 = 512 bytes
Sector size (logical/physical): 512 bytes / 512 bytes
I/O size (minimum/optimal): 512 bytes / 512 bytes
Disk identifier: 0xd108824d

   Device Boot      Start         End      Blocks   Id  System
/dev/sdc1            2048  3907029167  1953513560   fd  Linux raid autodetect

Disk /dev/sdd: 2000.4 GB, 2000398934016 bytes
81 heads, 63 sectors/track, 765633 cylinders, total 3907029168 sectors
Units = sectors of 1 * 512 = 512 bytes
Sector size (logical/physical): 512 bytes / 512 bytes
I/O size (minimum/optimal): 512 bytes / 512 bytes
Disk identifier: 0x6207659a

   Device Boot      Start         End      Blocks   Id  System
/dev/sdd1            2048  3907029167  1953513560   fd  Linux raid autodetect

Disk /dev/sde: 2000.4 GB, 2000398934016 bytes
81 heads, 63 sectors/track, 765633 cylinders, total 3907029168 sectors
Units = sectors of 1 * 512 = 512 bytes
Sector size (logical/physical): 512 bytes / 512 bytes
I/O size (minimum/optimal): 512 bytes / 512 bytes
Disk identifier: 0xd9a4afcf

   Device Boot      Start         End      Blocks   Id  System
/dev/sde1            2048  3907029167  1953513560   fd  Linux raid autodetect


Is there other information needed to determine the issue?  Where do I
go from here?
How old is linux mint? Have you kept it up-to-date? Unfortunately, it
seems a lot of older systems suffer issues when the kernel is heavily
patched and mdadm is not updated, and this regularly surfaces on this
list where Ubuntu is concerned ...

mdadm --version
uname -a

Make sure you have a "latest and greatest" rescue disk to hand, and
we'll see what the others say.

Cheers,
Wol
Keyboard shortcuts
hback out one level
jnext message in thread
kprevious message in thread
ldrill in
Escclose help / fold thread tree
?toggle this help