[mythtv-users] only on Sunday

Stephen Worthington stephen_agent at jsw.gen.nz
Mon Dec 10 09:46:59 UTC 2018


On Sun, 9 Dec 2018 22:42:34 -0500, you wrote:

>Normal use is 5 or 6 hours in the evening recording and watching
>recordings, on Sunday I start recording NFL at 1 and at some point
>thereafter my disk "storage2" drops out and reconnects. I've had it
>connected to different sata ports on the mobo but always "storage2" not the
>other two drives "storage1&3". A replacement mobo is on the way, but does
>this log indicate that it will solve the problem?
>
>http://paste.ubuntu.com/p/Rrnd3VmsNT/
>
>the disk dropped in and out around 7 thru 10 pm
>TIA  Daryl

Even when /dev/sdb does connect, it only connects at 1.5 Mbit/s, the
slowest possible SATA speed.  Which is too slow for a modern drive to
perform properly.  When it connects, it has lots of retries first.  So
my first guess from those logs would be a bad SATA cable - those
errors look similar to what I have had with a bad cable.  Try swapping
that drive to the SATA and power cable you are using for one of the
good drives and see if that helps.  If the problem stays with the
cables, you know what the problem is.  If it stays with the drive,
then it is likely faulty electronics on the drive.

This does not look good either:

Device: /dev/sdc [SAT], SMART Usage Attribute: 194 Temperature_Celsius
changed from 253 to 171

If those numbers are real, then that hard drive is so far over its
maximum temperature limit that it is likely dead - it is way over the
safe non-operational temperature for a hard drive.  I would expect the
bearings to seize at that sort of temperature.  But before believing
those temperature numbers, try running this command:

sudo update-smart-drivedb

to get the latest smartd database.  If that drive is fairly new, it
may be that it is not in the existing database and SMART is not
reading its temperature correctly.

And then there are these too:

Device: /dev/sda [SAT], SMART Usage Attribute: 194 Temperature_Celsius
changed from 74 to 66

The maximum temperature specifications for drives varies, usually 60,
65 or 70 degrees Celsius, so it might be within specifications, just.
But it is not a good idea to be running a drive that hot - you would
expect it to have a shorter lifetime.  You really want all the drive
temperatures to stay well below 50 degrees Celsius for most of the
life of the drive.


Here are all the temperature reports I filtered out of the log:

[P:\]grep "/dev/sda" log.txt
Dec  9 13:20:52 trieli smartd[875]: Device: /dev/sda [SAT], SMART
Usage Attribute: 194 Temperature_Celsius changed from 74 to 66
Dec  9 13:50:52 trieli smartd[875]: Device: /dev/sda [SAT], SMART
Usage Attribute: 194 Temperature_Celsius changed from 66 to 64
Dec  9 14:20:52 trieli smartd[875]: Device: /dev/sda [SAT], SMART
Usage Attribute: 194 Temperature_Celsius changed from 64 to 63
Dec  9 14:50:52 trieli smartd[875]: Device: /dev/sda [SAT], SMART
Usage Attribute: 194 Temperature_Celsius changed from 63 to 62
Dec  9 15:20:52 trieli smartd[875]: Device: /dev/sda [SAT], SMART
Usage Attribute: 194 Temperature_Celsius changed from 62 to 61
Dec  9 17:20:52 trieli smartd[875]: Device: /dev/sda [SAT], SMART
Usage Attribute: 194 Temperature_Celsius changed from 61 to 62
Dec  9 19:20:52 trieli smartd[875]: Device: /dev/sda [SAT], SMART
Usage Attribute: 194 Temperature_Celsius changed from 62 to 63

[P:\]grep "/dev/sdb" log.txt | grep 194
Dec  9 13:20:52 trieli smartd[875]: Device: /dev/sdb [SAT], SMART
Usage Attribute: 194 Temperature_Celsius changed from 22 to 37
Dec  9 13:50:52 trieli smartd[875]: Device: /dev/sdb [SAT], SMART
Usage Attribute: 194 Temperature_Celsius changed from 37 to 40
Dec  9 14:20:52 trieli smartd[875]: Device: /dev/sdb [SAT], SMART
Usage Attribute: 194 Temperature_Celsius changed from 40 to 42
Dec  9 14:50:52 trieli smartd[875]: Device: /dev/sdb [SAT], SMART
Usage Attribute: 194 Temperature_Celsius changed from 42 to 43
Dec  9 15:50:52 trieli smartd[875]: Device: /dev/sdb [SAT], SMART
Usage Attribute: 194 Temperature_Celsius changed from 43 to 44
Dec  9 16:50:52 trieli smartd[875]: Device: /dev/sdb [SAT], SMART
Usage Attribute: 194 Temperature_Celsius changed from 44 to 43
Dec  9 17:50:52 trieli smartd[875]: Device: /dev/sdb [SAT], SMART
Usage Attribute: 194 Temperature_Celsius changed from 43 to 42
Dec  9 18:20:52 trieli smartd[875]: Device: /dev/sdb [SAT], SMART
Usage Attribute: 194 Temperature_Celsius changed from 42 to 43
Dec  9 18:50:52 trieli smartd[875]: Device: /dev/sdb [SAT], SMART
Usage Attribute: 194 Temperature_Celsius changed from 43 to 42
Dec  9 19:50:53 trieli smartd[875]: Device: /dev/sdb [SAT], SMART
Usage Attribute: 194 Temperature_Celsius changed from 42 to 41

[P:\]grep "/dev/sdc" log.txt
Dec  9 13:20:53 trieli smartd[875]: Device: /dev/sdc [SAT], SMART
Usage Attribute: 194 Temperature_Celsius changed from 253 to 171
Dec  9 13:50:52 trieli smartd[875]: Device: /dev/sdc [SAT], SMART
Usage Attribute: 194 Temperature_Celsius changed from 171 to 162
Dec  9 14:20:52 trieli smartd[875]: Device: /dev/sdc [SAT], SMART
Usage Attribute: 194 Temperature_Celsius changed from 162 to 157
Dec  9 14:50:53 trieli smartd[875]: Device: /dev/sdc [SAT], SMART
Usage Attribute: 194 Temperature_Celsius changed from 157 to 153
Dec  9 15:20:52 trieli smartd[875]: Device: /dev/sdc [SAT], SMART
Usage Attribute: 194 Temperature_Celsius changed from 153 to 150
Dec  9 17:20:52 trieli smartd[875]: Device: /dev/sdc [SAT], SMART
Usage Attribute: 194 Temperature_Celsius changed from 150 to 153
Dec  9 19:20:53 trieli smartd[875]: Device: /dev/sdc [SAT], SMART
Usage Attribute: 194 Temperature_Celsius changed from 153 to 157
Dec  9 21:20:53 trieli smartd[875]: Device: /dev/sdc [SAT], SMART
Usage Attribute: 194 Temperature_Celsius changed from 157 to 153
Dec  9 21:50:53 trieli smartd[875]: Device: /dev/sdc [SAT], SMART
Usage Attribute: 194 Temperature_Celsius changed from 153 to 150
Dec  9 22:20:52 trieli smartd[875]: Device: /dev/sdc [SAT], SMART
Usage Attribute: 194 Temperature_Celsius changed from 150 to 153

[P:\]grep "/dev/sdd" log.txt | grep 194
Dec  9 13:20:53 trieli smartd[875]: Device: /dev/sdd [SAT], SMART
Usage Attribute: 194 Temperature_Celsius changed from 22 to 38
Dec  9 13:50:53 trieli smartd[875]: Device: /dev/sdd [SAT], SMART
Usage Attribute: 194 Temperature_Celsius changed from 38 to 40
Dec  9 14:20:53 trieli smartd[875]: Device: /dev/sdd [SAT], SMART
Usage Attribute: 194 Temperature_Celsius changed from 40 to 41
Dec  9 14:50:53 trieli smartd[875]: Device: /dev/sdd [SAT], SMART
Usage Attribute: 194 Temperature_Celsius changed from 41 to 42
Dec  9 15:20:53 trieli smartd[875]: Device: /dev/sdd [SAT], SMART
Usage Attribute: 194 Temperature_Celsius changed from 42 to 44
Dec  9 16:20:53 trieli smartd[875]: Device: /dev/sdd [SAT], SMART
Usage Attribute: 194 Temperature_Celsius changed from 44 to 43
Dec  9 17:50:53 trieli smartd[875]: Device: /dev/sdd [SAT], SMART
Usage Attribute: 194 Temperature_Celsius changed from 43 to 42
Dec  9 18:20:53 trieli smartd[875]: Device: /dev/sdd [SAT], SMART
Usage Attribute: 194 Temperature_Celsius changed from 42 to 43
Dec  9 18:50:53 trieli smartd[875]: Device: /dev/sdd [SAT], SMART
Usage Attribute: 194 Temperature_Celsius changed from 43 to 42
Dec  9 20:20:53 trieli smartd[875]: Device: /dev/sdd [SAT], SMART
Usage Attribute: 194 Temperature_Celsius changed from 42 to 41
Dec  9 21:20:53 trieli smartd[875]: Device: /dev/sdd [SAT], SMART
Usage Attribute: 194 Temperature_Celsius changed from 41 to 42
Dec  9 21:50:53 trieli smartd[875]: Device: /dev/sdd [SAT], SMART
Usage Attribute: 194 Temperature_Celsius changed from 42 to 41
Dec  9 22:20:53 trieli smartd[875]: Device: /dev/sdd [SAT], SMART
Usage Attribute: 194 Temperature_Celsius changed from 41 to 42

The /dev/sdb temperatures (where it was able to report them), and the
/dev/sdd temperatures are following an expected pattern - it looks
like the PC was off before the start of the log you posted, and the
drives were at the ambient temperature of the room (22 deg. C).  Then
they climb to a normal operational temperature over time, and after
that hover around that temperature, going up a bit when they are busy
(or the room temperature changes).

I do not understand the /dev/sda temperatures.  It initially drops
rapidly, as though a fan has come on, and then stabilizes around a
very high temperature.  The ridiculous numbers for /dev/sdc follow the
same pattern as /dev/sda.d


More information about the mythtv-users mailing list