Re: kernel: swap_pager: indefinite wait buffer - on 5.3-RELEASE-p5

From: Zoltan Frombach (tssajo_at_hotmail.com)
Date: 04/29/05

  • Next message: Kevin Kinsey: "bw failure - error: /openssl/crypto/engine/hw_aep.c"
    To: <freebsd-stable@freebsd.org>, "Mike Tancsa" <mike@sentex.net>
    Date: Fri, 29 Apr 2005 14:02:37 -0700
    
    

    Mike,

    Thank you so much for the information you have provided. I installed
    smartmontools from ports, what a great tool!

    However, I see 0 (zero) Reallocated_Sector_Ct ... So the hard drive seems to
    be okay? I attach the full output below. Is there someone knowledgeable
    about HD Smart information, who can possible tell me if this drive looks
    like if it is failing?

    # smartctl -a ad0
    smartctl version 5.32 Copyright (C) 2002-4 Bruce Allen
    Home page is http://smartmontools.sourceforge.net/

    === START OF INFORMATION SECTION ===
    Device Model: Maxtor 6Y080L0
    Serial Number: Y3H1TJKE
    Firmware Version: YAR41VW0
    Device is: In smartctl database [for details use: -P show]
    ATA Version is: 7
    ATA Standard is: ATA/ATAPI-7 T13 1532D revision 0
    Local Time is: Fri Apr 29 13:52:56 2005 PDT
    SMART support is: Available - device has SMART capability.
    SMART support is: Enabled

    The SMART RETURN STATUS return value (smartmontools -H option/Directive)
     can not be retrieved with this version of ATAng, please do not rely on this
    value
    === START OF READ SMART DATA SECTION ===
    SMART overall-health self-assessment test result: PASSED

    General SMART Values:
    Offline data collection status: (0x80) Offline data collection activity
                                            was never started.
                                            Auto Offline Data Collection:
    Enabled.
    Self-test execution status: ( 0) The previous self-test routine
    completed
                                            without error or no self-test has
    ever
                                            been run.
    Total time to complete Offline
    data collection: ( 242) seconds.
    Offline data collection
    capabilities: (0x5b) SMART execute Offline immediate.
                                            Auto Offline data collection on/off
    support.
                                            Suspend Offline collection upon new
                                            command.
                                            Offline surface scan supported.
                                            Self-test supported.
                                            No Conveyance Self-test supported.
                                            Selective Self-test supported.
    SMART capabilities: (0x0003) Saves SMART data before entering
                                            power-saving mode.
                                            Supports SMART auto save timer.
    Error logging capability: (0x01) Error logging supported.
                                            No General Purpose Logging support.
    Short self-test routine
    recommended polling time: ( 2) minutes.
    Extended self-test routine
    recommended polling time: ( 36) minutes.

    SMART Attributes Data Structure revision number: 16
    Vendor Specific SMART Attributes with Thresholds:
    ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED
    WHEN_FAILED RAW_VALUE
      3 Spin_Up_Time 0x0027 203 203 063 Pre-fail
    s - 12682
      4 Start_Stop_Count 0x0032 253 253 000 Old_age
    ys - 61
      5 Reallocated_Sector_Ct 0x0033 253 253 063 Pre-fail
    s - 0
      6 Read_Channel_Margin 0x0001 253 253 100 Pre-fail
    ine - 0
      7 Seek_Error_Rate 0x000a 253 252 000 Old_age
    ys - 0
      8 Seek_Time_Performance 0x0027 253 248 187 Pre-fail
    s - 36017
      9 Power_On_Minutes 0x0032 219 219 000 Old_age
    ys - 1023h+58m
     10 Spin_Retry_Count 0x002b 253 252 157 Pre-fail
    s - 0
     11 Calibration_Retry_Count 0x002b 253 252 223 Pre-fail
    s - 0
     12 Power_Cycle_Count 0x0032 253 253 000 Old_age
    ys - 177
    192 Power-Off_Retract_Count 0x0032 253 253 000 Old_age
    ys - 0
    193 Load_Cycle_Count 0x0032 253 253 000 Old_age
    ys - 0
    194 Temperature_Celsius 0x0032 253 253 000 Old_age
    ys - 31
    195 Hardware_ECC_Recovered 0x000a 253 252 000 Old_age
    ys - 2976
    196 Reallocated_Event_Count 0x0008 253 253 000 Old_age
    line - 0
    197 Current_Pending_Sector 0x0008 253 253 000 Old_age
    line - 0
    198 Offline_Uncorrectable 0x0008 253 253 000 Old_age
    line - 0
    199 UDMA_CRC_Error_Count 0x0008 199 199 000 Old_age
    line - 0
    200 Multi_Zone_Error_Rate 0x000a 253 252 000 Old_age
    ys - 0
    201 Soft_Read_Error_Rate 0x000a 253 252 000 Old_age
    ys - 12
    202 TA_Increase_Count 0x000a 253 252 000 Old_age
    ys - 0
    203 Run_Out_Cancel 0x000b 253 252 180 Pre-fail
    s - 1
    204 Shock_Count_Write_Opern 0x000a 253 252 000 Old_age
    ys - 0
    205 Shock_Rate_Write_Opern 0x000a 253 252 000 Old_age
    ys - 0
    207 Spin_High_Current 0x002a 253 252 000 Old_age
    ys - 0
    208 Spin_Buzz 0x002a 253 252 000 Old_age
    ys - 0
    209 Offline_Seek_Performnce 0x0024 200 200 000 Old_age
    line - 0
     99 Unknown_Attribute 0x0004 253 253 000 Old_age
    line - 0
    100 Unknown_Attribute 0x0004 253 253 000 Old_age
    line - 0
    101 Unknown_Attribute 0x0004 253 253 000 Old_age
    line - 0

    SMART Error Log Version: 1
    ATA Error Count: 1
            CR = Command Register [HEX]
            FR = Features Register [HEX]
            SC = Sector Count Register [HEX]
            SN = Sector Number Register [HEX]
            CL = Cylinder Low Register [HEX]
            CH = Cylinder High Register [HEX]
            DH = Device/Head Register [HEX]
            DC = Device Command Register [HEX]
            ER = Error register [HEX]
            ST = Status register [HEX]
    Powered_Up_Time is measured from power on, and printed as
    DDd+hh:mm:SS.sss where DD=days, hh=hours, mm=minutes,
    SS=sec, and sss=millisec. It "wraps" after 49.710 days.

    Error 1 occurred at disk power-on lifetime: 0 hours (0 days + 0 hours)
      When the command that caused the error occurred, the device was in an
    unknown state.

      After command completion occurred, registers were:
      ER ST SC SN CL CH DH
      -- -- -- -- -- -- --
      04 51 50 40 97 03 10 Error: ABRT

      Commands leading to the command that caused the error were:
      CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
      -- -- -- -- -- -- -- -- ---------------- --------------------
      ef fe 00 00 00 00 10 00 00:05:42.512 SET FEATURES [Reserved for CFA]
      c3 3d 00 00 00 00 10 00 00:05:42.464 [VENDOR SPECIFIC]
      c3 e4 00 00 00 00 10 00 00:05:42.432 [VENDOR SPECIFIC]
      c3 3d 00 00 00 00 10 00 00:05:42.432 [VENDOR SPECIFIC]
      70 00 00 00 5e 20 10 00 00:05:42.368 SEEK [OBS-7]

    SMART Self-test log structure revision number 1
    No self-tests have been logged. [To run self-tests, use: smartctl -t]

    SMART Selective self-test log data structure revision number 1
     SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS
        1 0 0 Not_testing
        2 0 0 Not_testing
        3 0 0 Not_testing
        4 0 0 Not_testing
        5 0 0 Not_testing
    Selective self-test flags (0x0):
      After scanning selected spans, do NOT read-scan remainder of disk.
    If Selective self-test is pending on power-up, resume after 0 minute delay.

    Thank you for looking at it!! Please response if you have any comments about
    the above output. Thanks a lot!!

    Zoltan

    ----- Original Message -----

    > At 06:23 AM 29/04/2005, Zoltan Frombach wrote:
    >>Apr 29 02:10:24 www kernel: swap_pager: indefinite wait buffer: device:
    >>ad0s1e, blkno: 329842, size: 4096
    >
    > As others have said, it looks to be a hard drive about to die. You can
    > get more info with /usr/ports/sysutils/smartmontools/. It can give some
    > simple diagnostics that might confirm this for you.
    >
    > ---Mike
    _______________________________________________
    freebsd-stable@freebsd.org mailing list
    http://lists.freebsd.org/mailman/listinfo/freebsd-stable
    To unsubscribe, send any mail to "freebsd-stable-unsubscribe@freebsd.org"


  • Next message: Kevin Kinsey: "bw failure - error: /openssl/crypto/engine/hw_aep.c"

    Relevant Pages

    • IDE HDD fail?
      ... This caused after this command: ... Device contains neither a valid DOS partition table, nor Sun, SGI or OSF ... Offline data collection status: ... Self-test execution status: ...
      (Linux-Kernel)
    • Re: sata_nv + ADMA + Samsung disk problem
      ... SMART support is: Available - device has SMART capability. ... Register values returned from SMART Status command are: ... Self-test supported. ... SMART Selective self-test log data structure revision number 1 ...
      (Linux-Kernel)
    • Re: Pathetic SATA performance
      ... SMART support is: Available - device has SMART capability. ... Offline data collection status: ... Self-test execution status: ... SMART Selective self-test log data structure revision number 1 ...
      (Debian-User)
    • Re: 2.6.24.3: regular sata drive resets - worrisome?
      ... SMART support is: Available - device has SMART capability. ... Offline data collection status: ... Self-test execution status: The previous self-test routine completed ... SMART Selective Self-Test Log Data Structure Revision Number should be 1 ...
      (Linux-Kernel)
    • RE: fedora 6 kernel panic issues
      ... SMART support is: Available - device has SMART capability. ... Auto Offline Data Collection: ... Self-test supported. ... SMART Selective self-test log data structure revision number 1 ...
      (Fedora)