ad WRITE_DMA timing out frequently

From: Reid Linnemann (lreid_at_cs.okstate.edu)
Date: 02/18/05

  • Next message: Eric Anderson: "Re: newfs limits? 10TB filesystem max?"
    To: freebsd-current@freebsd.org
    Date: Fri, 18 Feb 2005 09:03:35 -0600 (CST)
    
    

    I've recently brought a machine up from 5.3-STABLE to 6-CURRENT. It
    usually just sits in the corner and runs services, but lately I've come
    home form work or woken up to find that it is completely unresponsive,
    and I have to hard reset the machine. It happens at least once a day,
    and it's becoming more and more frequent. When I look at the console, I
    always have the same 4 messages before the failure:

    ad0: TIMEOUT - WRITE_DMA retrying (2 retries left) LBA=2085599
    ad0: TIMEOUT - WRITE_DMA retrying (2 retries left) LBA=2085599
     kernel: ad0: FAILURE - WRITE_DMA timed out
    kernel: g_vfs_done():ad0s1d[WRITE(offset=52772864, length=16384)]error = 5

    It seems to me that a sector on the disk might be dead in the ad0s1d
    slice (/var), but I want to be certain before I take further steps that
    the behavior I'm experiencing is positively unrelated to the migration
    to 6-CURRENT.

    I started poking around /var to see if anything was amiss, and I found
    that mail messages are being stacked up in /var/spool/clientmqueue, even
    though nothing should be using the msp queue (I've redirected periodic
    outputs to logfiles). In the last daily run mailed to root in January,
    I found records in the submit queue that looked like this:

    j0EDINHh049826 2489 Fri Jan 14 07:18 MAILER-DAEMON
                     (Deferred: Permission denied)

    There were nearly 500 of them.

    Even after redirecting periodic output to logs and clearing out the
    client mail queue, this continues to happen, and I have a hunch that it
    may be related to the WRITE_DMA timeouts, as it's the only weird
    behavior I can see on /var. If anyone can help me shed some light on
    this, I'd appreciate it. I've had 2 IDE drives die in this machine
    already, I'm going to be severely depressed if I've killed a third.

    -Reid
    _______________________________________________
    freebsd-current@freebsd.org mailing list
    http://lists.freebsd.org/mailman/listinfo/freebsd-current
    To unsubscribe, send any mail to "freebsd-current-unsubscribe@freebsd.org"


  • Next message: Eric Anderson: "Re: newfs limits? 10TB filesystem max?"

    Relevant Pages

    • [PATCH] p54: Fix for Bugzilla #13725 - tx refused but queue active
      ... In the mainline kernel, p54usb will fail because the TX queue length can ... This problem has been recently observed in the wireless-testing tree, ... but the simple change supplied here will prevent the failure. ...
      (Linux-Kernel)
    • Bulk Mail Sending in ASP
      ... For Bulk mails, Mail messages are processed by SMTP- ... Now we are going to replace the 3rd party control with MS-API. ... how send a message to SMTP queue in VB rather sending directly with ...
      (microsoft.public.exchange.applications)
    • Problems Mixing MSMQ API calls and System.Messaging in Pocket PC
      ... In order to place the message in the local queue of the Pocket PC I ... This dll, so that it can be invoked from the desktop should be a VC++ ... // close queue before returning send failure ... Managed code of the device App ...
      (microsoft.public.dotnet.framework.compactframework)
    • RE: Small Business SMTP Connector queue growing ...
      ... In the Queue are the mail messages in retry or are they in a ready state? ... Small Business SMTP Connector queue growing ... ... email notification before you are able to view my reply in the newsgroup. ...
      (microsoft.public.windows.server.sbs)
    • Re: Chankels memory lives on
      ... >an alleged points failure (alleged because I am not sure there ... >are any points) at Teddington led to our train being stopped in a queue ... >at Richmond. ...
      (uk.media.radio.archers)

    Loading