Re: READ_DMA interrupt was seen but timeout fired LBA=####

From: Ranjit (r4nj1t_at_xs4all.nl)
Date: 08/31/04

  • Next message: Marc G. Fournier: "free/flushing vnodes ..."
    To: "David E. Cross" <crossd@cs.rpi.edu>
    Date: Tue, 31 Aug 2004 22:02:24 +0200
    
    
    

    On Tue, 2004-08-31 at 20:35, David E. Cross wrote:
    > I searched the mailing list an didn't see anything listed for August, so I
    > thought I would write.
    >
    > Upon switching to 5.3-BETA2 I see these a LOT on my IBM TP T30. In fact
    > BETA2 panics shortly after boot, and once in DDB I can't actually do
    > anything ('panic' doesn't work, 'call boot(0)' doesn't work, eventually
    > DDB itself just locks). Upgrading to RELENG_5 makes it a bit better, I
    > still see oodles of those messages and WRITE_DMA messages also, but it
    > doesn't panic. Eventually everything will lock behind "ufs" in a wait
    > channel. Booting to single-user seems to make it better (I can run
    > commands, etc and not get stuck in [ufs]). ACPI doesn't seem to make a
    > difference. Also, the GENERIC on the CD loader doesn't seem to exhibit
    > any of these problems. Below is a copy of my DMESG; ideas? Any
    > additional information required?

    >

    I have a similar crash on 5.3-BETA2, it only happens on custom kernels,
    GENERIC runs fine.

    I have a crashdump with debugging kernel which I can make available if
    it would be useful. In any case I am including the backtrace and my
    system information.

    My system sources are all up to date except for sys/vm/uma_core.c which
    is version 1.104 rather than 1.105. I will try again with the newer
    version later tonight.

    My machine is a Sony VAIO PCG-NV105 or alternatively a PCG-9D7M
    depending on which label you look at.

    --------------------------------------------------------------------------

    FreeBSD ceylonica 5.3-BETA2 FreeBSD 5.3-BETA2 #0: Sun Aug 29 19:19:56
    CEST 2004 root@ceylonica:/usr/obj/usr/src/sys/GENERIC i386

    --------------------------------------------------------------------------

    _initialize_thread_db: can not load libthread_db.so.
    GNU gdb 6.1.1 [FreeBSD]
    Copyright 2004 Free Software Foundation, Inc.
    GDB is free software, covered by the GNU General Public License, and you
    are
    welcome to change it and/or distribute copies of it under certain
    conditions.
    Type "show copying" to see the conditions.
    There is absolutely no warranty for GDB. Type "show warranty" for
    details.
    This GDB was configured as "i386-marcel-freebsd".
    doadump () at pcpu.h:159
    (kgdb) where
    #0 doadump () at pcpu.h:159
    #1 0xc056a022 in boot (howto=260) at
    /usr/src/sys/kern/kern_shutdown.c:396
    #2 0xc056a410 in panic (fmt=0xc07029d5 "from debugger")
        at /usr/src/sys/kern/kern_shutdown.c:558
    #3 0xc0452472 in db_panic (addr=-1067945888, have_addr=0, count=-1,
        modif=0xcbcf09b0 "") at /usr/src/sys/ddb/db_command.c:435
    #4 0xc04523e2 in db_command (last_cmdp=0xc0777c84, cmd_table=0x0,
        aux_cmd_tablep=0xc073ce64, aux_cmd_tablep_end=0xc073ce68)
        at /usr/src/sys/ddb/db_command.c:349
    #5 0xc04524f5 in db_command_loop () at
    /usr/src/sys/ddb/db_command.c:455
    #6 0xc0454605 in db_trap (type=3, code=0) at
    /usr/src/sys/ddb/db_main.c:221
    #7 0xc058738e in kdb_trap (type=0, code=0, tf=0x1) at
    /usr/src/sys/kern/subr_kdb.c:418
    #8 0xc06d3d58 in trap (frame=
          {tf_fs = 24, tf_es = 16, tf_ds = -875626480, tf_edi = 256, tf_esi
    = 1, tf_ebp = -875623592, tf_isp = -875623620, tf_ebx = -875623532,
    tf_edx = 1, tf_ecx = -1056882688, tf_eax = 18, tf_trapno = 3, tf_err =
    0, tf_eip = -1067945888, tf_cs = 8, tf_eflags = 646, tf_esp =
    -1066268849, tf_ss = -1066277072}) at /usr/src/sys/i386/i386/trap.c:576
    #9 0xc06bf80a in calltrap () at /usr/src/sys/i386/i386/exception.s:140
    #10 0x00000018 in ?? ()
    #11 0x00000010 in ?? ()
    #12 0xcbcf0010 in ?? ()
    #13 0x00000100 in ?? ()
    #14 0x00000001 in ?? ()
    #15 0xcbcf0b58 in ?? ()
    #16 0xcbcf0b3c in ?? ()
    #17 0xcbcf0b94 in ?? ()
    #18 0x00000001 in ?? ()
    #19 0xc1014000 in ?? ()
    #20 0x00000012 in ?? ()
    #21 0x00000003 in ?? ()
    #22 0x00000000 in ?? ()
    #23 0xc0587060 in kdb_enter (msg=0x0) at cpufunc.h:56
    #24 0xc056a3aa in panic (fmt=0xc07327b3 "Duplicate free of item %p from
    zone %p(%s)\n")
        at /usr/src/sys/kern/kern_shutdown.c:542
    #25 0xc0695e34 in uma_dbg_free (zone=0xc152e000, slab=0xc17fff70,
    item=0xc17ff084)
        at /usr/src/sys/vm/uma_dbg.c:276
    #26 0xc06946e1 in uma_zfree_arg (zone=0xc152e000, item=0xc17ff084,
    udata=0x0)
        at /usr/src/sys/vm/uma_core.c:2222
    #27 0xc052f6b2 in g_destroy_bio (bp=0x0) at uma.h:302
    #28 0xc052d282 in g_disk_done (bp=0xc17ff084) at
    /usr/src/sys/geom/geom_disk.c:203
    #29 0xc05bf510 in biodone (bp=0xc17ff084) at
    /usr/src/sys/kern/vfs_bio.c:3002
    #30 0xc046f002 in ad_done (request=0xc17fc168) at
    /usr/src/sys/dev/ata/ata-disk.c:322
    #31 0xc045e4ac in ata_completed (context=0xc17fc168, dummy=0)
        at /usr/src/sys/dev/ata/ata-queue.c:404
    #32 0xc045e5fc in ata_timeout (request=0xc17fc168)
        at /usr/src/sys/dev/ata/ata-queue.c:442
    #33 0xc05788b8 in softclock (dummy=0x0) at
    /usr/src/sys/kern/kern_timeout.c:259
    #34 0xc0554132 in ithread_loop (arg=0xc14e4800) at
    /usr/src/sys/kern/kern_intr.c:546
    #35 0xc0553176 in fork_exit (callout=0xc0553fc0 <ithread_loop>, arg=0x0,
    frame=0x0)
        at /usr/src/sys/kern/kern_fork.c:820
    #36 0xc06bf86c in fork_trampoline () at
    /usr/src/sys/i386/i386/exception.s:209

    --------------------------------------------------------------------------

    Output from dmesg

    Copyright (c) 1992-2004 The FreeBSD Project.
    Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
            The Regents of the University of California. All rights
    reserved.
    FreeBSD 5.3-BETA2 #0: Tue Aug 31 16:48:01 CEST 2004
        root@ceylonica:/usr/obj/usr/src/sys/CEYTEST
    WARNING: WITNESS option enabled, expect reduced performance.
    Timecounter "i8254" frequency 1193182 Hz quality 0
    CPU: Intel(R) Pentium(R) 4 Mobile CPU 1.60GHz (1590.82-MHz 686-class
    CPU)
      Origin = "GenuineIntel" Id = 0xf24 Stepping = 4
     
    Features=0x3febf9ff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CLFLUSH,DTS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM>
    real memory = 267911168 (255 MB)
    avail memory = 252452864 (240 MB)
    npx0: [FAST]
    npx0: <math processor> on motherboard
    npx0: INT 16 interface
    acpi0: <SONY> on motherboard
    acpi_ec0: <Embedded Controller: GPE 0x1c> port 0x66,0x62 on acpi0
    Timecounter "ACPI-fast" frequency 3579545 Hz quality 1000
    acpi_timer0: <24-bit timer at 3.579545MHz> port 0x1008-0x100b on acpi0
    cpu0: <ACPI CPU (2 Cx states)> on acpi0
    acpi_tz0: <Thermal Zone> on acpi0
    acpi_lid0: <Control Method Lid Switch> on acpi0
    acpi_button0: <Power Button> on acpi0
    pcib0: <ACPI Host-PCI bridge> port 0xcf8-0xcff on acpi0
    acpi link get: empty IRQ resource
    acpi link get: empty IRQ resource
    acpi link get: empty IRQ resource
    pci0: <ACPI PCI bus> on pcib0
    agp0: <Intel 82845 host to AGP bridge> mem 0xec000000-0xefffffff at
    device 0.0 on pci0
    pcib1: <ACPI PCI-PCI bridge> at device 1.0 on pci0
    pci1: <ACPI PCI bus> on pcib1
    drm0: <ATI Radeon LW Mobility 7500 M7> port 0x3000-0x30ff mem
    0xe8100000-0xe810ffff,0xf0000000-0xf7ffffff irq 9 at device 0.0 on pci1
    info: [drm] AGP at 0xec000000 64MB
    info: [drm] Initialized radeon 1.11.0 20020828 on minor 0
    uhci0: <Intel 82801CA/CAM (ICH3) USB controller USB-A> port
    0x1800-0x181f irq 9 at device 29.0 on pci0
    uhci0: [GIANT-LOCKED]
    usb0: <Intel 82801CA/CAM (ICH3) USB controller USB-A> on uhci0
    usb0: USB revision 1.0
    uhub0: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1
    uhub0: 2 ports with 2 removable, self powered
    uhci1: <Intel 82801CA/CAM (ICH3) USB controller USB-B> port
    0x1820-0x183f irq 9 at device 29.1 on pci0
    uhci1: [GIANT-LOCKED]
    usb1: <Intel 82801CA/CAM (ICH3) USB controller USB-B> on uhci1
    usb1: USB revision 1.0
    uhub1: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1
    uhub1: 2 ports with 2 removable, self powered
    uhci2: <Intel 82801CA/CAM (ICH3) USB controller USB-C> port
    0x1840-0x185f at device 29.2 on pci0
    uhci2: [GIANT-LOCKED]
    usb2: <Intel 82801CA/CAM (ICH3) USB controller USB-C> on uhci2
    usb2: USB revision 1.0
    uhub2: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1
    uhub2: 2 ports with 2 removable, self powered
    umass0: Sony USB Memory Stick Slot, rev 1.10/1.09, addr 2
    pcib2: <ACPI PCI-PCI bridge> at device 30.0 on pci0
    acpi link get: empty IRQ resource
    acpi link get: empty IRQ resource
    pci2: <ACPI PCI bus> on pcib2
    cbb0: <RF5C476 PCI-CardBus Bridge> irq 9 at device 5.0 on pci2
    cardbus0: <CardBus bus> on cbb0
    pccard0: <16-bit PCCard bus> on cbb0
    cbb1: <RF5C476 PCI-CardBus Bridge> at device 5.1 on pci2
    cardbus1: <CardBus bus> on cbb1
    pccard1: <16-bit PCCard bus> on cbb1
    fwohci0: <Ricoh R5C552> mem 0xe8201000-0xe82017ff at device 5.2 on pci2
    fwohci0: [GIANT-LOCKED]
    fwohci0: OHCI version 1.0 (ROM=1)
    fwohci0: No. of Isochronous channels is 4.
    fwohci0: EUI64 08:00:46:03:01:16:9e:94
    fwohci0: Phy 1394a available S400, 2 ports.
    fwohci0: Link S400, max_rec 2048 bytes.
    firewire0: <IEEE1394(FireWire) bus> on fwohci0
    fwe0: <Ethernet over FireWire> on firewire0
    if_fwe0: Fake Ethernet address: 0a:00:46:16:9e:94
    fwe0: Ethernet address: 0a:00:46:16:9e:94
    sbp0: <SBP-2/SCSI over FireWire> on firewire0
    fwohci0: Initiate bus reset
    fwohci0: node_id=0xc800ffc0, gen=1, CYCLEMASTER mode
    firewire0: 1 nodes, maxhop <= 0, cable IRM = 0 (me)
    firewire0: bus manager 0 (me)
    fxp0: <Intel 82801CAM (ICH3) Pro/100 VE Ethernet> port 0x4000-0x403f mem
    0xe8200000-0xe8200fff irq 9 at device 8.0 on pci2
    miibus0: <MII bus> on fxp0
    inphy0: <i82562ET 10/100 media interface> on miibus0
    inphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto
    fxp0: Ethernet address: 08:00:46:5c:6d:fc
    fxp0: [GIANT-LOCKED]
    isab0: <PCI-ISA bridge> at device 31.0 on pci0
    isa0: <ISA bus> on isab0
    atapci0: <Intel ICH3 UDMA100 controller> port
    0x1860-0x186f,0x376,0x170-0x177,0x3f6,0x1f0-0x1f7 at device 31.1 on pci0
    ata0: channel #0 on atapci0
    ata1: channel #1 on atapci0
    pci0: <serial bus, SMBus> at device 31.3 (no driver attached)
    pci0: <multimedia, audio> at device 31.5 (no driver attached)
    pci0: <simple comms> at device 31.6 (no driver attached)
    atkbdc0: <Keyboard controller (i8042)> port 0x64,0x60 irq 1 on acpi0
    atkbd0: <AT Keyboard> irq 1 on atkbdc0
    kbd0 at atkbd0
    atkbd0: [GIANT-LOCKED]
    psm0: <PS/2 Mouse> irq 12 on atkbdc0
    psm0: [GIANT-LOCKED]
    psm0: model GlidePoint, device ID 0
    sio0 port 0x3f8-0x3ff irq 4 on acpi0
    sio0: type 16550A
    ppc0 port 0x778-0x77f,0x378-0x37f irq 7 drq 3 on acpi0
    ppc0: SMC-like chipset (ECP/EPP/PS2/NIBBLE) in COMPATIBLE mode
    ppc0: FIFO with 16/16/15 bytes threshold
    ppbus0: <Parallel port bus> on ppc0
    lpt0: <Printer> on ppbus0
    lpt0: Interrupt-driven port
    ppi0: <Parallel I/O> on ppbus0
    fdc0: <floppy drive controller (FDE)> port 0x3f7,0x3f0-0x3f5 irq 6 drq 2
    on acpi0
    acpi_cmbat0: <Control Method Battery> on acpi0
    acpi_acad0: <AC Adapter> on acpi0
    orm0: <ISA Option ROMs> at iomem
    0xdc000-0xdffff,0xd8000-0xdbfff,0xc0000-0xcffff on isa0
    pmtimer0 on isa0
    sc0: <System console> at flags 0x100 on isa0
    sc0: VGA <16 virtual consoles, flags=0x300>
    sio1: configured irq 3 not in bitmap of probed irqs 0
    sio1: port may not be enabled
    vga0: <Generic ISA VGA> at port 0x3c0-0x3df iomem 0xa0000-0xbffff on
    isa0
    Timecounter "TSC" frequency 1590816140 Hz quality 800
    Timecounters tick every 10.000 msec
    acpi_cpu: throttling enabled, 8 steps (100% to 12.5%), currently 100.0%
    ad0: 28615MB <IC25N030ATCS04-0/CA3OA71A> [58140/16/63] at ata0-master
    UDMA100
    ATAPI_RESET time = 230us
    ata1-slave: FAILURE - ATAPI_IDENTIFY timed out
    ATAPI_RESET time = 240us
    acd0: CDRW <UJDA730 DVD/CDRW/1.00> at ata1-master UDMA33
    (probe0:umass-sim0:0:0:0): INQUIRY. CDB: 12 0 0 0 24 0
    (probe0:umass-sim0:0:0:0): CAM Status: SCSI Status Error
    (probe0:umass-sim0:0:0:0): SCSI Status: Check Condition
    (probe0:umass-sim0:0:0:0): NOT READY asc:3a,0
    (probe0:umass-sim0:0:0:0): Medium not present
    (probe0:umass-sim0:0:0:0): Unretryable error
    Mounting root from ufs:/dev/ad0s1a
    ad0: WARNING - READ_DMA interrupt was seen but timeout fired
    LBA=49945023
    ad0: WARNING - READ_DMA interrupt was seen but timeout fired
    LBA=10029087
    ad0: WARNING - READ_DMA interrupt was seen but timeout fired
    LBA=10029087
    ad0: WARNING - WRITE_DMA interrupt was seen but timeout fired
    LBA=1659195
    ad0: WARNING - WRITE_DMA interrupt was seen but timeout fired
    LBA=1659195
    ad0: WARNING - WRITE_DMA interrupt was seen but taskqueue stalled
    LBA=1659195
    Slab at 0xc17fff70, freei 1 = 0.
    panic: Duplicate free of item 0xc17ff084 from zone 0xc152e000(g_bio)

    cpuid = 0;
    KDB: enter: panic
    panic: from debugger
    cpuid = 0;
    Uptime: 36s
    Dumping 255 MB
     16 32 48 64 80 96 112 128 144 160 176 192 208 224 240Copyright (c)
    1992-2004 The FreeBSD Project.
    Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994

    --------------------------------------------------------------------------

    The differences between my kernel config & GENERIC are :

    Removed:
     #cpu I486_CPU
     #cpu I586_CPU
     #options INET6
     #options NFSSERVER
     #options NFS_ROOT
     #options KTRACE
     #options AHC_REG_PRETTY_PRINT
     #options AHD_REG_PRETTY_PRINT
     #device eisa
     #device ahb
     #device ahc
     #device ahd
     #device amd
     #device isp
     #device mpt
     #device sym
     #device trm
     #device adv
     #device adw
     #device bt
     #device amr
     #device asr
     #device ciss
     #device dpt
     #device iir
     #device ips
     #device mly
     #device twa
     #device aac
     #device aacp
     #device ida
     #device mlx
     #device pst
     #device twe
     #device plip
     #device de
     #device em
     #device txp
     #device vx
     #device bfe
     #device bge
     #device pcn
     #device rl
     #device sf
     #device sis
     #device sk
     #device ste
     #device ti
     #device tl
     #device fe
     #device wl
     #device an
     #device awi
     #device wi
     #device sl
     #device faith
     #device aue
     #device axe
     #device cue
     #device kue
     #device rue

    Added:

    device radeondrm
    device sound

    ---------------------------------------------------------------------

    I hope this is enough information to be useful. If someone wants to look
    at this and needs more information then I would be pleased to provide
    it.

    
    



  • Next message: Marc G. Fournier: "free/flushing vnodes ..."
  • Quantcast