e 450 server not booting up

From: Rajeev Andrews (mailtorajiv_at_yahoo.com)
Date: 12/23/04

  • Next message: Kanellopoulos, Angelos: "restricted shell"
    Date: Thu, 23 Dec 2004 00:08:09 -0800 (PST)
    To: sunmanagers@sunmanagers.org
    
    

    Greetings All,

    We have one E 450 server with two CPU and one Board,2
    GB of Memory.The problem is that it went off suddenly
    and not coming up.Even it's not passing the POST and
    OpenBoot diganostics.We have made a hard reboot of the
    machine and sometimes it gave some messages on the
    HyperTerminal sometime not.But after some time it
    stops.

    Only I am getting a Green LED on Power On Icon and no
    blinkg Green LED on Activity Icon or any LED on
    General Fault Icon.

    I am attaching the output which i got in Hperterminal.

    Offline: CPU0 (MD'ed)
    Online: CPU1 Ultra-II (v2.0) 3:1 2048KB 2-2 ECache
    MCap 13
    Offline: CPU2 (MD'ed)
    Online: CPU3 Ultra-II (v2.0) 3:1 0KB 2-2 ECache MCap
    13
    Probing keyboard for L1/L1-D...
    Executing Power On SelfTest w/%o0 =
    0000.0000.0010.1002

    1>
    1>@(#) Sun UltraSPARC-II 4-way UPA/PCI POST 6.0.9
    02/12/99: 07:52
    1>INFO: Processor 1 is master CPU.
    1>INFO: Motherboard rev FCS
    1>
    1> <00> Init System BSS
    1> <00> NVRAM Battery Detect Test
    1> <00> NVRAM Scratch Addr Test
    1> <00> DMMU TLB Tag Access Test
    1> <00> DMMU TLB RAM Access Test
    1> <00> Probe Ecache
    1>INFO: 2048KB Ecache
    1> <00> Ecache RAM Addr Test
    1> <00> Ecache Tag Addr Test
    1> <00> Invalidate Ecache Tags
    1> <00> SC Dtag Probe
    1>INFO: Dtag supports up to 8MB Ecache
    1>INFO: Processor 0 is missing or disabled.
    1>INFO: Processor 2 is missing or disabled.
    1>INFO: Processor 3 - UltraSPARC-II.
    1> <00> Init SC Regs
    1> <00> SC Address Reg Test
    1> <00> SC Reg Index Test
    1> <00> SC Regs Test
    1> <00> SC Dtag RAM Addr Test
    1> <00> SC Dtag Init
    1> <00> Init SC Regs
    1> <00> SC Cache Size Init
    3> <00> Probe Ecache
    3>INFO: 2048KB Ecache
    3> <00> Ecache RAM Addr Test
    3>STATUS=FAILED
    3>TEST =Ecache RAM Addr
    SUSPECT =CPU Module, ECache Data
    3>MSG =RAM compare error
            index 00000000
            exp ffffffff.ffffffff
            obs 00000000.00000000
            xor ffffffff.ffffffff
    1> <00> Synch up Processor Ecache Sizes
    1> <00> Probe Memory
    1>INFO: 1024MB Bank 0
    1>INFO: 1024MB Bank 1
    1>INFO: No memory detected in Bank 2
    1>INFO: No memory detected in Bank 3
    1> <00> Test Memory Data Lines
    1> <00> Test Memory Address Lines
    1> <00> Malloc Post Memory
    1> <00> Init Post Memory
    1> <00> Map PROM/STACK/NVRAM in DMMU
    1> <00> Memory Stack Test
    1> <00> Init Memory
    1> <00> ECC Memory Addr Test
    1> <00> V9 Instruction Test
    1> <00> Memory Addr w/ Ecache Test
    1> <00> Block Memory Addr Test
    1> <00> Copy Post to Memory
    1> <00> Map/Exec POST from Memory
    1> <00> Map alternate CPUs

    SC Control: 0000.0000.0200.0000
    SC CP0 CFG: 0000.0000.8100.4000 STS:
    0000.0000.01e0.0000
    SC CP1 CFG: 0000.0000.0100.4000 STS:
    0000.0000.41e0.0000
    SC CP2 CFG: 0000.0000.8100.4000 STS:
    0000.0000.01e0.0000
    SC CP3 CFG: 0000.0000.0100.4000 STS:
    0000.0000.41e0.0000
    SC P1F CFG: 0000.0000.0110.4000 STS:
    0000.0000.0100.0000
    SC P06 CFG: 0000.0000.0110.4000 STS:
    0000.0000.0100.0000
    SC P04 CFG: 0000.0000.0110.4000 STS:
    0000.0000.0100.0000
    SC SIFault: 0000.0000.0000.0006
    CPU AFSR: 0000.0001.8800.0000 AFAR:
    0000.01ff.f150.0000
    CPU UDBH: 0000.0000.0000.0000 UDBL:
    7fff.f3b4.0100.0000
    P@1F Ctrl: 04f8.0000.0000.0000
      UE AFSR: 0000.4008.8400.0000 AFAR:
    0000.0082.8001.0011
      CE AFSR: 004c.0020.9400.0000 AFAR:
    0000.0030.4029.c012
    P@06 Ctrl: 04f8.0000.0000.0000
      UE AFSR: 0000.0985.8000.0000 AFAR:
    0000.0000.7070.4000
      CE AFSR: 0005.0414.5080.0000 AFAR:
    0000.01ac.0400.8408
    P@04 Ctrl: 04f8.0000.0000.0000
      UE AFSR: 0000.ffc9.7f00.0000 AFAR:
    0000.00db.7beb.ef14
      CE AFSR: 0073.3011.e180.0000 AFAR:
    0000.0128.3cc8.2da2

    Is this is the problem of any Hardware
    Failure(memory,board etc )

    Please advice on this.

    Thanks & Regards
    Rajeev Andrews

                    
    __________________________________
    Do you Yahoo!?
    Send a seasonal email greeting and help others. Do good.
    http://celebrity.mail.yahoo.com
    _______________________________________________
    sunmanagers mailing list
    sunmanagers@sunmanagers.org
    http://www.sunmanagers.org/mailman/listinfo/sunmanagers


  • Next message: Kanellopoulos, Angelos: "restricted shell"

    Relevant Pages

    • Re: Faulty CPU?
      ... > Seems to be a CPU problem (or memory, ... Instead the PSYND shows that a single byte in ecache had a parity ...
      (comp.unix.solaris)
    • Next July 27: boot failure(hang) on x86_64 box.
      ... Freeing unused kernel memory: 1360k freed ... ACPI: PM-Timer IO Port: 0x488 ... CPU: L2 Cache: 1024K ... # AX.25 network device drivers ...
      (Linux-Kernel)
    • [PATCH] Document Linuxs memory barriers [try #3]
      ... The attached patch documents the Linux kernel's memory barriers. ... I've tried to get rid of the concept of memory accesses appearing on the bus; ... barring implicit enforcement by the CPU. ...
      (Linux-Kernel)
    • Oops in 2.6.28-rc9 and -rc8 -- mtrr issues / e1000e
      ... Bios 1.04beta did show correct memory sizing in dmidecode, ... I hope this is as simple as me doing something glaringly wrong in the kernel ... DMI present. ... CPU: L2 cache: 6144K ...
      (Linux-Kernel)
    • Re: read vs. mmap (or io vs. page faults)
      ... not fit in main memory, and there are overheads related to the heuristics ... But because the CPU is underutilized, ... reasonably sized user buffer). ... You have to measure the actual overhead to see what the actual cost is. ...
      (freebsd-questions)