<br><br><div class="gmail_quote">On 24 February 2011 04:13, John Drescher <span dir="ltr"><<a href="mailto:drescherjm@gmail.com">drescherjm@gmail.com</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin: 0pt 0pt 0pt 0.8ex; border-left: 1px solid rgb(204, 204, 204); padding-left: 1ex;">
On Wed, Feb 23, 2011 at 1:07 PM, John Drescher <<a href="mailto:drescherjm@gmail.com">drescherjm@gmail.com</a>> wrote:<br>
> On Wed, Feb 23, 2011 at 12:53 PM, Rob Smith <<a href="mailto:kormoc@mythtv.org">kormoc@mythtv.org</a>> wrote:<br>
>> On Wed, Feb 23, 2011 at 9:20 AM, John Drescher <<a href="mailto:drescherjm@gmail.com">drescherjm@gmail.com</a>> wrote:<br>
>>> I use it for Seagate drives a lot. Most of the important values are san<br>
>><br>
>> Given they claim they don't support general SMART standards, nor do<br>
>> they give any documentation on what models use what ids for what<br>
>> fields, how do you know that those numbers are sane?<br>
>><br>
> By observation. I have dozens of these drives at work.<br>
><br>
<br>
The counts look to be correct.<br>
<br>
I mean 05 - Reallocated Sectors Count raw value is correct. So is<br>
Reallocation Event Count, Current Pending Sector Count, Uncorrectable<br>
Sector Count, UltraDMA CRC Error Count. Some other values are<br>
definitely not standard but the important counts are correct.<br>
<br>
Here is an example that tells me the drive had 71 reallocated sectors.<br>
<br>
datastore2 ~ # smartctl --all /dev/sda<br>
smartctl version 5.38 [x86_64-pc-linux-gnu] Copyright (C) 2002-8 Bruce Allen<br>
Home page is <a href="http://smartmontools.sourceforge.net/" target="_blank">http://smartmontools.sourceforge.net/</a><br>
<br>
=== START OF INFORMATION SECTION ===<br>
Model Family: Seagate Barracuda 7200.11<br>
Device Model: ST3750330AS<br>
Serial Number: 3QK086XB<br>
Firmware Version: SD15<br>
User Capacity: 750,156,374,016 bytes<br>
Device is: In smartctl database [for details use: -P show]<br>
ATA Version is: 8<br>
ATA Standard is: ATA-8-ACS revision 4<br>
Local Time is: Wed Feb 23 08:15:37 2011 EST<br>
SMART support is: Available - device has SMART capability.<br>
SMART support is: Enabled<br>
<br>
=== START OF READ SMART DATA SECTION ===<br>
SMART overall-health self-assessment test result: PASSED<br>
<br>
General SMART Values:<br>
Offline data collection status: (0x82) Offline data collection activity<br>
was completed without error.<br>
Auto Offline Data Collection: Enabled.<br>
Self-test execution status: ( 0) The previous self-test routine completed<br>
without error or no self-test has ever<br>
been run.<br>
Total time to complete Offline<br>
data collection: ( 634) seconds.<br>
Offline data collection<br>
capabilities: (0x7b) SMART execute Offline immediate.<br>
Auto Offline data collection<br>
on/off support.<br>
Suspend Offline collection upon new<br>
command.<br>
Offline surface scan supported.<br>
Self-test supported.<br>
Conveyance Self-test supported.<br>
Selective Self-test supported.<br>
SMART capabilities: (0x0003) Saves SMART data before entering<br>
power-saving mode.<br>
Supports SMART auto save timer.<br>
Error logging capability: (0x01) Error logging supported.<br>
General Purpose Logging supported.<br>
Short self-test routine<br>
recommended polling time: ( 1) minutes.<br>
Extended self-test routine<br>
recommended polling time: ( 170) minutes.<br>
Conveyance self-test routine<br>
recommended polling time: ( 2) minutes.<br>
SCT capabilities: (0x103b) SCT Status supported.<br>
SCT Feature Control supported.<br>
SCT Data Table supported.<br>
<br>
SMART Attributes Data Structure revision number: 10<br>
Vendor Specific SMART Attributes with Thresholds:<br>
ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE<br>
UPDATED WHEN_FAILED RAW_VALUE<br>
1 Raw_Read_Error_Rate 0x000f 109 099 006 Pre-fail<br>
Always - 25116628<br>
3 Spin_Up_Time 0x0003 093 093 000 Pre-fail<br>
Always - 0<br>
4 Start_Stop_Count 0x0032 100 100 020 Old_age<br>
Always - 21<br>
5 Reallocated_Sector_Ct 0x0033 100 100 036 Pre-fail<br>
Always - 71<br>
7 Seek_Error_Rate 0x000f 075 060 030 Pre-fail<br>
Always - 34616371547<br>
9 Power_On_Hours 0x0032 072 072 000 Old_age<br>
Always - 24831<br>
10 Spin_Retry_Count 0x0013 100 100 097 Pre-fail<br>
Always - 0<br>
12 Power_Cycle_Count 0x0032 100 037 020 Old_age<br>
Always - 19<br>
184 Unknown_Attribute 0x0032 100 100 099 Old_age<br>
Always - 0<br>
187 Reported_Uncorrect 0x0032 001 001 000 Old_age<br>
Always - 618<br>
188 Unknown_Attribute 0x0032 100 094 000 Old_age<br>
Always - 176<br>
189 High_Fly_Writes 0x003a 001 001 000 Old_age<br>
Always - 536<br>
190 Airflow_Temperature_Cel 0x0022 059 052 045 Old_age<br>
Always - 41 (Lifetime Min/Max 39/48)<br>
194 Temperature_Celsius 0x0022 041 048 000 Old_age<br>
Always - 41 (0 19 0 0)<br>
195 Hardware_ECC_Recovered 0x001a 026 017 000 Old_age<br>
Always - 25116628<br>
197 Current_Pending_Sector 0x0012 100 100 000 Old_age<br>
Always - 0<br>
198 Offline_Uncorrectable 0x0010 100 100 000 Old_age<br>
Offline - 0<br>
199 UDMA_CRC_Error_Count 0x003e 200 200 000 Old_age<br>
Always - 0<br>
<br>
<br>
John<br>
<br></blockquote><div><br>Does anyone have any cacti graph templates and scripts to monitor these values?<br><br>I've only found ones to monitor the temp of each drive<br><br>Anthony <br></div></div><br>