Re: kernel: swap_pager: indefinite wait buffer - on 5.3-RELEASE-p5

From: Uwe Doering (gemini_at_geminix.org)
Date: 04/30/05

  • Next message: Jose M Rodriguez: "enable dummynet from /etc/rc.d"
    Date: Sat, 30 Apr 2005 08:28:07 +0200
    To: freebsd-stable@FreeBSD.ORG, tssajo@hotmail.com
    
    

    Oliver Fromme wrote:
    > Zoltan Frombach <tssajo@hotmail.com> wrote:
    > > Apr 29 02:10:14 www kernel: swap_pager: indefinite wait buffer: device:
    > > ad0s1a, blkno: 328636, size: 8192
    > > Apr 29 02:10:24 www kernel: swap_pager: indefinite wait buffer: device:
    > > ad0s1e, blkno: 329842, size: 4096
    > > [...]
    >
    > The error message indicates that there was an I/O error
    > accessing the swap area on your disk. Usually that's an
    > indication for a hardware failure, e.g. a dying disk.
    >
    > > I happen to have an identical hard drive around here, unused. If I hook it
    > > up as a slave (IDE) drive, is there a way I can mirror the dying drive to
    > > the spare one (with all partitions, etc, intact)?
    >
    > If they're really identical (i.e. the same size and same
    > geometry), then you can use dd(1) for duplication, like
    > this:
    >
    > # dd if=/dev/ad0 of=/dev/ad1 bs=64k conv=noerror,sync
    >
    > The "noerror,sync" part is important so the dd command will
    > not stop when it hits any bad spots on the source drive and
    > instead will fill the blocks with zeroes on the destination
    > drive. Since it's only the swap partition, you shouldn't
    > lose any data.

    I would like to point out that the conclusion you're drawing in the last
    sentence is invalid IMHO. "indefinite wait buffer" messages at
    apparently random block numbers just indicate that the pager was unable
    to access the swap area (in its entirety!) when it wanted to. It means
    that the disk drive was either dead at that point in time or busy trying
    to deal with a bad sector.

    This sector could have been anywhere on the disk. It just kept the disk
    drive busy for long enough that the pager started to complain. Since
    the swap area is usually just a minor portion of the disk it is
    therefore much more likely that the bad sector is located in a
    filesystem. So if you copy the disk and ignore i/o errors in this
    situation you _do_ run a very real risk of losing data! Unfortunately
    you can't do much about it but you should at least be aware of it.

        Uwe

    -- 
    Uwe Doering         |  EscapeBox - Managed On-Demand UNIX Servers
    gemini@geminix.org  |  http://www.escapebox.net
    _______________________________________________
    freebsd-stable@freebsd.org mailing list
    http://lists.freebsd.org/mailman/listinfo/freebsd-stable
    To unsubscribe, send any mail to "freebsd-stable-unsubscribe@freebsd.org"
    

  • Next message: Jose M Rodriguez: "enable dummynet from /etc/rc.d"

    Relevant Pages

    • Re: regarding swap
      ... No -- there's no filesystem on a swap partition to fsck. ... It does sound as if your swap area is damaged though. ... the disk type you could try doing a surface analysis. ... drives, you'll probably have to download a DOS program from the disk ...
      (freebsd-questions)
    • Re: Ultra 30 - help needed
      ... > It looks like someone has blanked the disk, which is reasonable on a system ... > (I think that is the HD name, otherwise open format to check), and use format to ... IMHO this swap area is not needed. ... Simply a Solaris disk label is missing. ...
      (comp.unix.solaris)
    • Re: problem saving with Microsoft Word in Office 2004 on G5 with 10.3.7 OSX
      ... I'm going to take a wild guess here that this is the RAID expression of our ... unable to save" or "Serious disk error" (in other words, ... > booting up from that Lacie firewire disk), the same error message could ... and the problem recurs after about 20 such saves. ...
      (microsoft.public.mac.office.word)
    • Re: O_DIRECT question
      ... kick the disk out of the array before the OS ever notices. ... before your app ever notices any I/O error. ... order matters, you can not use fsync, which is one of the reasons why ... Semantics is a question of correct operation, ...
      (Linux-Kernel)
    • Re: problem saving with Microsoft Word in Office 2004 on G5 with 10.3.7 OSX
      ... but I know nothing about RAID so I just can't answer your ... and I have not seen your specific error message before. ... > Disk Warrior also gives the HD a full bill of good health. ... >>> was created by using Clone Doubler on the G5's internal hard disk. ...
      (microsoft.public.mac.office.word)