Re: Page fault in FreeBSD 5.3 on IBM e325, Dual AMD64 2.2GHz, 4GB RAM, ServeRAID 6M - debug logs

From: Ganbold (ganbold_at_micom.mng.net)
Date: 11/22/04

  • Next message: Peter Jeremy: "Re: [REVIEW/TEST] nanodelay() vs DELAY()"
    Date: Mon, 22 Nov 2004 15:27:56 +0800
    To: Scott Long <scottl@freebsd.org>
    
    

    Scott,

    I didn't put debug.mpsafenet=0 to /boot/loader.conf. I compiled kernel with:

            options DDB
            options KDB

    Here is crash time debug output and dmesg:

    --------------------------------------------------------------------------------------------------------
    Console: serial port
    BIOS drive C: is disk0
    BIOS 614kB/4127168kB available memory

    FreeBSD/i386 bootstrap loader, Revision 1.1
    (root@publica.ub.mng.net, Sun Nov 21 18:24:58 ULAT 2004)
    Loading /boot/defaults/loader.conf
    /boot/kernel/kernel text=0x2fa370 data=0x7a920+0x3b9d0
    -syms=[0x8+0x58278-+0x8+0x4b956|
    KDB: debugger backends: ddb
    KDB: current backend: ddb
    Copyright (c) 1992-2004 The FreeBSD Project.
    Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
            The Regents of the University of California. All rights reserved.
    FreeBSD 5.3-STABLE #12: Mon Nov 22 12:04:57 ULAT 2004
         tsgan@publicc.ub.mng.net:/usr/obj/usr/src/sys/AMD
    Timecounter "i8254" frequency 1193182 Hz quality 0
    CPU: AMD Opteron(tm) Processor 248 (2193.17-MHz K8-class CPU)
       Origin = "AuthenticAMD" Id = 0xf58 Stepping = 8
       Features=0x78bfbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CLFLUSH,MMX,FXSR,SSE,SSE2>
       AMD Features=0xe0500800<SYSCALL,NX,MMX+,LM,3DNow+,3DNow>
    real memory = 4227268608 (4031 MB)
    avail memory = 4083208192 (3894 MB)
    ACPI APIC Table: <PTLTD APIC >
    FreeBSD/SMP: Multiprocessor System Detected: 2 CPUs
      cpu0 (BSP): APIC ID: 0
      cpu1 (AP): APIC ID: 1
    MADT: Forcing active-low polarity and level trigger for SCI
    ioapic0 <Version 1.1> irqs 0-23 on motherboard
    ioapic1 <Version 1.1> irqs 24-27 on motherboard
    ioapic2 <Version 1.1> irqs 28-31 on motherboard
    acpi0: <PTLTD XSDT> on motherboard
    acpi0: Power Button (fixed)
    unknown: I/O range not supported
    unknown: I/O range not supported
    Timecounter "ACPI-fast" frequency 3579545 Hz quality 1000
    acpi_timer0: <24-bit timer at 3.579545MHz> port 0x8008-0x800b on acpi0
    cpu0: <ACPI CPU> on acpi0
    cpu1: <ACPI CPU> on acpi0
    acpi_button0: <Power Button> on acpi0
    pcib0: <ACPI Host-PCI bridge> port 0x8080-0x80ff,0x8000-0x807f,0xcf8-0xcff
    iomem 0xd8000-0xdbfff on acpi0
    pci0: <ACPI PCI bus> on pcib0
    pcib1: <ACPI PCI-PCI bridge> at device 6.0 on pci0
    pci1: <ACPI PCI bus> on pcib1
    ohci0: <OHCI (generic) USB controller> mem 0xfc100000-0xfc100fff irq 19 at
    device 0.0 on pci1
    ohci0: [GIANT-LOCKED]
    usb0: OHCI version 1.0, legacy support
    usb0: SMM does not respond, resetting
    usb0: <OHCI (generic) USB controller> on ohci0
    usb0: USB revision 1.0
    uhub0: AMD OHCI root hub, class 9/0, rev 1.00/1.00, addr 1
    uhub0: 3 ports with 3 removable, self powered
    ohci1: <OHCI (generic) USB controller> mem 0xfc101000-0xfc101fff irq 19 at
    device 0.1 on pci1
    ohci1: [GIANT-LOCKED]
    usb1: OHCI version 1.0, legacy support
    usb1: SMM does not respond, resetting
    usb1: <OHCI (generic) USB controller> on ohci1
    usb1: USB revision 1.0
    uhub1: AMD OHCI root hub, class 9/0, rev 1.00/1.00, addr 1
    uhub1: 3 ports with 3 removable, self powered
    ukbd0: Silitek IBM USB Keyboard, rev 1.10/1.00, addr 2, iclass 3/1
    kbd1 at ukbd0
    uhid0: Silitek IBM USB Keyboard, rev 1.10/1.00, addr 2, iclass 3/1
    pci1: <display, VGA> at device 5.0 (no driver attached)
    isab0: <PCI-ISA bridge> at device 7.0 on pci0
    isa0: <ISA bus> on isab0
    pci0: <mass storage, ATA> at device 7.1 (no driver attached)
    pci0: <bridge, PCI-unknown> at device 7.3 (no driver attached)
    pcib2: <ACPI PCI-PCI bridge> at device 10.0 on pci0
    pci2: <ACPI PCI bus> on pcib2
    bge0: <Broadcom BCM5704C Dual Gigabit Ethernet, ASIC rev. 0x2003> mem
    0xfe000000-0xfe00ffff,0xfe010000-0xfe01ffff irq 24 at device 1.0 on pci2
    miibus0: <MII bus> on bge0
    brgphy0: <BCM5704 10/100/1000baseTX PHY> on miibus0
    brgphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseTX,
    1000baseTX-FDX, auto
    bge0: Ethernet address: 00:0d:60:14:cc:9e
    bge1: <Broadcom BCM5704C Dual Gigabit Ethernet, ASIC rev. 0x2003> mem
    0xfe020000-0xfe02ffff,0xfe030000-0xfe03ffff irq 25 at device 1.1 on pci2
    miibus1: <MII bus> on bge1
    brgphy1: <BCM5704 10/100/1000baseTX PHY> on miibus1
    brgphy1: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseTX,
    1000baseTX-FDX, auto
    bge1: Ethernet address: 00:0d:60:14:cc:9f
    pci2: <mass storage, SCSI> at device 2.0 (no driver attached)
    pci0: <base peripheral, interrupt controller> at device 10.1 (no driver
    attached)
    pcib3: <ACPI PCI-PCI bridge> at device 11.0 on pci0
    pci3: <ACPI PCI bus> on pcib3
    pcib4: <PCI-PCI bridge> at device 3.0 on pci3
    pci4: <PCI bus> on pcib4
    ips0: <Adaptec ServeRAID Adapter> mem 0xfe100000-0xfe100fff irq 28 at
    device 8.0 on pci4
    ips0: [GIANT-LOCKED]
    pci0: <base peripheral, interrupt controller> at device 11.1 (no driver
    attached)
    sio0: <16550A-compatible COM port> port 0x3f8-0x3ff irq 4 flags 0x10 on acpi0
    sio0: type 16550A, console
    orm0: <ISA Option ROMs> at iomem
    0xcb000-0xcb7ff,0xc9800-0xcafff,0xc8000-0xc97ff,0xc0000-0xc7fff on isa0
    atkbdc0: <Keyboard controller (i8042)> at port 0x64,0x60 on isa0
    atkbd0: <AT Keyboard> flags 0x1 irq 1 on atkbdc0
    device_attach: atkbd0 attach returned 6
    sc0: <System console> at flags 0x100 on isa0
    sc0: VGA <16 virtual consoles, flags=0x100>
    sio1: configured irq 3 not in bitmap of probed irqs 0
    sio1: port may not be enabled
    vga0: <Generic ISA VGA> at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0
    Timecounters tick every 0.976 msec
    ips0: adapter type: ServeRAID 6M (marco)
    ips0: logical drives: 1
    ips0: Logical Drive 0: RAID5 sectors: 355481600, state OK
    ipsd0: <Logical Drive> on ips0
    ipsd0: Logical Drive (173575MB)
    SMP: AP CPU #1 Launched!
    Mounting root from ufs:/dev/ipsd0s1a
    Pre-seeding PRNG: kickstart.
    Loading configuration files.
    Entropy harvesting: interrupts ethernet point_to_point kickstart.
    swapon: adding /dev/ipsd0s1b as swap device
    Starting file system checks:
    /dev/ipsd0s1a: FILE SYSTEM CLEAN; SKIPPING CHECKS
    /dev/ipsd0s1a: clean, 507157 free (7565 frags, 62449 blocks, 0.7%
    fragmentation)
    /dev/ipsd0s1f: FILE SYSTEM CLEAN; SKIPPING CHECKS
    /dev/ipsd0s1f: clean, 1012200 free (40 frags, 126520 blocks, 0.0%
    fragmentation)
    /dev/ipsd0s1d: FILE SYSTEM CLEAN; SKIPPING CHECKS
    /dev/ipsd0s1d: clean, 4876181 free (45669 frags, 603814 blocks, 0.4%
    fragmentation)
    /dev/ipsd0s1e: FILE SYSTEM CLEAN; SKIPPING CHECKS
    /dev/ipsd0s1e: clean, 4965575 free (1087 frags, 620561 blocks, 0.0%
    fragmentation)
    /dev/ipsd0s1g: FILE SYSTEM CLEAN; SKIPPING CHECKS
    /dev/ipsd0s1g: clean, 53622407 free (671 frags, 6702717 blocks, 0.0%
    fragmentation)
    Setting hostname: publica.ub.mng.net.
    bge0: flags=8843<UP,BROADCAST,RUNNING,SIMPLEX,MULTICAST> mtu 1500
            options=1a<TXCSUM,VLAN_MTU,VLAN_HWTAGGING>
            inet 202.179.0.80 netmask 0xffffffe0 broadcast 202.179.0.95
            ether 00:0d:60:14:cc:9e
            media: Ethernet 100baseTX <full-duplex>
            status: active
    lo0: flags=8049<UP,LOOPBACK,RUNNING,MULTICAST> mtu 16384
            inet 127.0.0.1 netmask 0xff000000
    add net default: gateway 202.179.0.65
    Additional routing options:.
    Starting devd.
    kbdcontrol: cannot open /dev/kbd1: Device busy
    Mounting NFS file systems:.
    Starting syslogd.
    Nov 22 13:49:36 publica syslogd: kernel boot file is /boot/kernel/kernel

    ELF ldconfig path: /lib /usr/lib /usr/lib/compat /usr/X11R6/lib /usr/local/lib
    Starting usbd.
    Starting local daemons:.
    Updating motd.
    Configuring syscons: blanktime.
    Starting sshd.
    Initial amd64 initialization:.
    Additional ABI support:.
    Starting cron.
    Local package initialization:Starting apache.
    Starting clamav_clamd.
    Starting clamav_freshclam.
    Starting exim.
    Starting mysql.
    Starting spamd.
    .
    Additional TCP options:.
    Starting inetd.
    Starting background file system checks in 60 seconds.

    Mon Nov 22 13:49:41 ULAT 2004
    ààààààNààààààNàNov ààNààààNoàNàNov 22 14:08:àNàààààààààààààààààààààààààààà

    Fatal trap 12: page fault while in kernel mode
    cpuid = 1; apic id = 01
    fault virtual address = 0x18
    fault code = supervisor read, page not present
    instruction pointer = 0x8:0xffffffff80277fc0
    stack pointer = 0x10:0xffffffffb36ab830
    frame pointer = 0x10:0xffffffffb36ab890
    code segment = base 0x0, limit 0xfffff, type 0x1b
                            = DPL 0, pres 1, long 1, def32 0, gran 1
    processor eflags = interrupt enabled, resume, IOPL = 0
    current process = 44 (swi1: net)
    [thread 100044]
    Stopped at m_copym+0x190: incl %ecx
    db> bt
    No such command
    db> help
    print p examine x search set write
    w delete d break dwatch watch dhwatch
    hwatch step s continue c until next
    match trace where call show ps gdb
    reset kill watchdog thread panic
    db> trace
    m_copym() at m_copym+0x190
    tcp_output() at tcp_output+0xe91
    tcp_input() at tcp_input+0x2f80
    ip_input() at ip_input+0xff
    netisr_processqueue() at netisr_processqueue+0x79
    swi_net() at swi_net+0x14d
    ithread_loop() at ithread_loop+0xde
    fork_exit() at fork_exit+0x8f
    fork_trampoline() at fork_trampoline+0xe
    --- trap 0, rip = 0, rsp = 0xffffffffb36abd00, rbp = 0 ---
    db> trace
    m_copym() at m_copym+0x190
    tcp_output() at tcp_output+0xe91
    tcp_input() at tcp_input+0x2f80
    ip_input() at ip_input+0xff
    netisr_processqueue() at netisr_processqueue+0x79
    swi_net() at swi_net+0x14d
    ithread_loop() at ithread_loop+0xde
    fork_exit() at fork_exit+0x8f
    fork_trampoline() at fork_trampoline+0xe
    --- trap 0, rip = 0, rsp = 0xffffffffb36abd00, rbp = 0 ---
    db>
    --------------------------------------------------------------------------------------------------------

    It seems to me the problem is related to network stack and threading.
    Am I right? How to solve this problem?

    thanks in advance,

    Ganbold

    _______________________________________________
    freebsd-current@freebsd.org mailing list
    http://lists.freebsd.org/mailman/listinfo/freebsd-current
    To unsubscribe, send any mail to "freebsd-current-unsubscribe@freebsd.org"


  • Next message: Peter Jeremy: "Re: [REVIEW/TEST] nanodelay() vs DELAY()"