<div dir="auto"><div dir="auto"></div><div class="gmail_extra"><br><div class="gmail_quote">On Dec 10, 2018 4:47 AM, "Stephen Worthington" <<a href="mailto:stephen_agent@jsw.gen.nz" target="_blank" rel="noreferrer">stephen_agent@jsw.gen.nz</a>> wrote:<br type="attribution"><blockquote class="m_-8752919662836093806quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div class="m_-8752919662836093806elided-text">On Sun, 9 Dec 2018 22:42:34 -0500, you wrote:<br>
<br>
>Normal use is 5 or 6 hours in the evening recording and watching<br>
>recordings, on Sunday I start recording NFL at 1 and at some point<br>
>thereafter my disk "storage2" drops out and reconnects. I've had it<br>
>connected to different sata ports on the mobo but always "storage2" not the<br>
>other two drives "storage1&3". A replacement mobo is on the way, but does<br>
>this log indicate that it will solve the problem?<br>
><br>
><a href="http://paste.ubuntu.com/p/Rrnd3VmsNT/" rel="noreferrer noreferrer noreferrer" target="_blank">http://paste.ubuntu.com/p/Rrnd3VmsNT/</a><br>
><br>
>the disk dropped in and out around 7 thru 10 pm<br>
>TIA Daryl<br>
<br></div>
Even when /dev/sdb does connect, it only connects at 1.5 Mbit/s, the<br>
slowest possible SATA speed. Which is too slow for a modern drive to<br>
perform properly. When it connects, it has lots of retries first. So<br>
my first guess from those logs would be a bad SATA cable - those<br>
errors look similar to what I have had with a bad cable. Try swapping<br>
that drive to the SATA and power cable you are using for one of the<br>
good drives and see if that helps. If the problem stays with the<br>
cables, you know what the problem is. If it stays with the drive,<br>
then it is likely faulty electronics on the drive.<br>
<br>
This does not look good either:<br>
<br>
Device: /dev/sdc [SAT], SMART Usage Attribute: 194 Temperature_Celsius<br>
changed from 253 to 171<br>
<br>
If those numbers are real, then that hard drive is so far over its<br>
maximum temperature limit that it is likely dead - it is way over the<br>
safe non-operational temperature for a hard drive. I would expect the<br>
bearings to seize at that sort of temperature. But before believing<br>
those temperature numbers, try running this command:<br>
<br>
sudo update-smart-drivedb<br>
<br>
to get the latest smartd database. If that drive is fairly new, it<br>
may be that it is not in the existing database and SMART is not<br>
reading its temperature correctly.<br>
<br>
And then there are these too:<br>
<br>
Device: /dev/sda [SAT], SMART Usage Attribute: 194 Temperature_Celsius<br>
changed from 74 to 66<br>
<br>
The maximum temperature specifications for drives varies, usually 60,<br>
65 or 70 degrees Celsius, so it might be within specifications, just.<br>
But it is not a good idea to be running a drive that hot - you would<br>
expect it to have a shorter lifetime. You really want all the drive<br>
temperatures to stay well below 50 degrees Celsius for most of the<br>
life of the drive.<br>
<br>
<br>
Here are all the temperature reports I filtered out of the log:<br>
<br>
[P:\]grep "/dev/sda" log.txt<br>
Dec 9 13:20:52 trieli smartd[875]: Device: /dev/sda [SAT], SMART<br>
Usage Attribute: 194 Temperature_Celsius changed from 74 to 66<br>
Dec 9 13:50:52 trieli smartd[875]: Device: /dev/sda [SAT], SMART<br>
Usage Attribute: 194 Temperature_Celsius changed from 66 to 64<br>
Dec 9 14:20:52 trieli smartd[875]: Device: /dev/sda [SAT], SMART<br>
Usage Attribute: 194 Temperature_Celsius changed from 64 to 63<br>
Dec 9 14:50:52 trieli smartd[875]: Device: /dev/sda [SAT], SMART<br>
Usage Attribute: 194 Temperature_Celsius changed from 63 to 62<br>
Dec 9 15:20:52 trieli smartd[875]: Device: /dev/sda [SAT], SMART<br>
Usage Attribute: 194 Temperature_Celsius changed from 62 to 61<br>
Dec 9 17:20:52 trieli smartd[875]: Device: /dev/sda [SAT], SMART<br>
Usage Attribute: 194 Temperature_Celsius changed from 61 to 62<br>
Dec 9 19:20:52 trieli smartd[875]: Device: /dev/sda [SAT], SMART<br>
Usage Attribute: 194 Temperature_Celsius changed from 62 to 63<br>
<br>
[P:\]grep "/dev/sdb" log.txt | grep 194<br>
Dec 9 13:20:52 trieli smartd[875]: Device: /dev/sdb [SAT], SMART<br>
Usage Attribute: 194 Temperature_Celsius changed from 22 to 37<br>
Dec 9 13:50:52 trieli smartd[875]: Device: /dev/sdb [SAT], SMART<br>
Usage Attribute: 194 Temperature_Celsius changed from 37 to 40<br>
Dec 9 14:20:52 trieli smartd[875]: Device: /dev/sdb [SAT], SMART<br>
Usage Attribute: 194 Temperature_Celsius changed from 40 to 42<br>
Dec 9 14:50:52 trieli smartd[875]: Device: /dev/sdb [SAT], SMART<br>
Usage Attribute: 194 Temperature_Celsius changed from 42 to 43<br>
Dec 9 15:50:52 trieli smartd[875]: Device: /dev/sdb [SAT], SMART<br>
Usage Attribute: 194 Temperature_Celsius changed from 43 to 44<br>
Dec 9 16:50:52 trieli smartd[875]: Device: /dev/sdb [SAT], SMART<br>
Usage Attribute: 194 Temperature_Celsius changed from 44 to 43<br>
Dec 9 17:50:52 trieli smartd[875]: Device: /dev/sdb [SAT], SMART<br>
Usage Attribute: 194 Temperature_Celsius changed from 43 to 42<br>
Dec 9 18:20:52 trieli smartd[875]: Device: /dev/sdb [SAT], SMART<br>
Usage Attribute: 194 Temperature_Celsius changed from 42 to 43<br>
Dec 9 18:50:52 trieli smartd[875]: Device: /dev/sdb [SAT], SMART<br>
Usage Attribute: 194 Temperature_Celsius changed from 43 to 42<br>
Dec 9 19:50:53 trieli smartd[875]: Device: /dev/sdb [SAT], SMART<br>
Usage Attribute: 194 Temperature_Celsius changed from 42 to 41<br>
<br>
[P:\]grep "/dev/sdc" log.txt<br>
Dec 9 13:20:53 trieli smartd[875]: Device: /dev/sdc [SAT], SMART<br>
Usage Attribute: 194 Temperature_Celsius changed from 253 to 171<br>
Dec 9 13:50:52 trieli smartd[875]: Device: /dev/sdc [SAT], SMART<br>
Usage Attribute: 194 Temperature_Celsius changed from 171 to 162<br>
Dec 9 14:20:52 trieli smartd[875]: Device: /dev/sdc [SAT], SMART<br>
Usage Attribute: 194 Temperature_Celsius changed from 162 to 157<br>
Dec 9 14:50:53 trieli smartd[875]: Device: /dev/sdc [SAT], SMART<br>
Usage Attribute: 194 Temperature_Celsius changed from 157 to 153<br>
Dec 9 15:20:52 trieli smartd[875]: Device: /dev/sdc [SAT], SMART<br>
Usage Attribute: 194 Temperature_Celsius changed from 153 to 150<br>
Dec 9 17:20:52 trieli smartd[875]: Device: /dev/sdc [SAT], SMART<br>
Usage Attribute: 194 Temperature_Celsius changed from 150 to 153<br>
Dec 9 19:20:53 trieli smartd[875]: Device: /dev/sdc [SAT], SMART<br>
Usage Attribute: 194 Temperature_Celsius changed from 153 to 157<br>
Dec 9 21:20:53 trieli smartd[875]: Device: /dev/sdc [SAT], SMART<br>
Usage Attribute: 194 Temperature_Celsius changed from 157 to 153<br>
Dec 9 21:50:53 trieli smartd[875]: Device: /dev/sdc [SAT], SMART<br>
Usage Attribute: 194 Temperature_Celsius changed from 153 to 150<br>
Dec 9 22:20:52 trieli smartd[875]: Device: /dev/sdc [SAT], SMART<br>
Usage Attribute: 194 Temperature_Celsius changed from 150 to 153<br>
<br>
[P:\]grep "/dev/sdd" log.txt | grep 194<br>
Dec 9 13:20:53 trieli smartd[875]: Device: /dev/sdd [SAT], SMART<br>
Usage Attribute: 194 Temperature_Celsius changed from 22 to 38<br>
Dec 9 13:50:53 trieli smartd[875]: Device: /dev/sdd [SAT], SMART<br>
Usage Attribute: 194 Temperature_Celsius changed from 38 to 40<br>
Dec 9 14:20:53 trieli smartd[875]: Device: /dev/sdd [SAT], SMART<br>
Usage Attribute: 194 Temperature_Celsius changed from 40 to 41<br>
Dec 9 14:50:53 trieli smartd[875]: Device: /dev/sdd [SAT], SMART<br>
Usage Attribute: 194 Temperature_Celsius changed from 41 to 42<br>
Dec 9 15:20:53 trieli smartd[875]: Device: /dev/sdd [SAT], SMART<br>
Usage Attribute: 194 Temperature_Celsius changed from 42 to 44<br>
Dec 9 16:20:53 trieli smartd[875]: Device: /dev/sdd [SAT], SMART<br>
Usage Attribute: 194 Temperature_Celsius changed from 44 to 43<br>
Dec 9 17:50:53 trieli smartd[875]: Device: /dev/sdd [SAT], SMART<br>
Usage Attribute: 194 Temperature_Celsius changed from 43 to 42<br>
Dec 9 18:20:53 trieli smartd[875]: Device: /dev/sdd [SAT], SMART<br>
Usage Attribute: 194 Temperature_Celsius changed from 42 to 43<br>
Dec 9 18:50:53 trieli smartd[875]: Device: /dev/sdd [SAT], SMART<br>
Usage Attribute: 194 Temperature_Celsius changed from 43 to 42<br>
Dec 9 20:20:53 trieli smartd[875]: Device: /dev/sdd [SAT], SMART<br>
Usage Attribute: 194 Temperature_Celsius changed from 42 to 41<br>
Dec 9 21:20:53 trieli smartd[875]: Device: /dev/sdd [SAT], SMART<br>
Usage Attribute: 194 Temperature_Celsius changed from 41 to 42<br>
Dec 9 21:50:53 trieli smartd[875]: Device: /dev/sdd [SAT], SMART<br>
Usage Attribute: 194 Temperature_Celsius changed from 42 to 41<br>
Dec 9 22:20:53 trieli smartd[875]: Device: /dev/sdd [SAT], SMART<br>
Usage Attribute: 194 Temperature_Celsius changed from 41 to 42<br>
<br>
The /dev/sdb temperatures (where it was able to report them), and the<br>
/dev/sdd temperatures are following an expected pattern - it looks<br>
like the PC was off before the start of the log you posted, and the<br>
drives were at the ambient temperature of the room (22 deg. C). Then<br>
they climb to a normal operational temperature over time, and after<br>
that hover around that temperature, going up a bit when they are busy<br>
(or the room temperature changes).<br>
<br>
I do not understand the /dev/sda temperatures. It initially drops<br>
rapidly, as though a fan has come on, and then stabilizes around a<br>
very high temperature. The ridiculous numbers for /dev/sdc follow the<br>
same pattern as /dev/sda.d<div class="m_-8752919662836093806elided-text"><br>
_______________________________________________<br>
mythtv-users mailing list<br>
<a href="mailto:mythtv-users@mythtv.org" rel="noreferrer noreferrer" target="_blank">mythtv-users@mythtv.org</a><br>
<a href="http://lists.mythtv.org/mailman/listinfo/mythtv-users" rel="noreferrer noreferrer noreferrer" target="_blank">http://lists.mythtv.org/mailman/listinfo/mythtv-users</a><br>
<a href="http://wiki.mythtv.org/Mailing_List_etiquette" rel="noreferrer noreferrer noreferrer" target="_blank">http://wiki.mythtv.org/Mailing_List_etiquette</a><br>
MythTV Forums: <a href="https://forum.mythtv.org" rel="noreferrer noreferrer noreferrer" target="_blank">https://forum.mythtv.or</a>g<br></div></blockquote></div></div><div dir="auto"><br></div><div class="gmail_extra" dir="auto"><div class="gmail_quote"><blockquote class="m_-8752919662836093806quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div class="m_-8752919662836093806elided-text">
</div></blockquote></div>I have changed sata cables, and sata ports, and it is only "storage2" that drops out, and only after extended use, but its a brand new drive, a lemon, maybe? I bought it because I received a warning of eminent failure.</div><div class="gmail_extra" dir="auto">Last Sunday, same problem, but no test switching of cables or ports and Monday thru Saturday operation just fine. So I'm leaning towards it being a heat issue.</div><div class="gmail_extra" dir="auto">I have one SSD for the OS and three HD storage drives, "disks" shows the temp of the ssd at 99 degrees celcius, and the others range between 35 and 44. All drives report as "OK" too.</div><div class="gmail_extra" dir="auto">Yes, the system powered on just before the syslog. I will try the other suggstions when I return home.</div></div>