SUMMARY: Ultra5 Sunvts gives disturbing error without apparent problems

From: Fergus Wilde (fwilde@chethams.org.uk)
Date: 04/09/03

  • Next message: Sumedha: "can sendmail be configured this way??"
    From: Fergus Wilde <fwilde@chethams.org.uk>
    To: sunmanagers@sunmanagers.org
    Date: Wed, 9 Apr 2003 10:50:18 +0100
    

    Thanks to all those who wrote with helpful suggestions, Justin Stringfellow at
    Sun cracked it by notifying me that there was a known issue with VTS
    generating memory failures on Solaris 8 with certain patchlevels:

    "When running SunVTS 4.1, 5.1 with Solaris 8 and kernel update patch
    108528-16, you will see error messages of:

    FATAL mem: "Unable to read pointer to memlist structure."

     Problem Solution

    Problem is due to the fact that the symbol "max_ce_err" was deleted from the
    kernel with the 108528-16 patch.

    Workaround:

          Install patch 113689 from http://sunsolve.sun.com
          Run with kernel patch 108528-15
          Do not run pmemtest with 108528-16 or later."

    I solved the problem by 1) updating to SUNWvts 5.1 and its associated packages
                    2) Installing patch 1113689-10

    This summary was a little delayed while I registered with sunsolve to get this
    non-public patch. I suppose if you don't have a support contract you would
    be obliged either not to run pmemtest (still leaves another memory test
    available) or backout to a a kernel patchlevel < 108528-16

    Thanks also to Joe Fletcher for reminding me that you need to do cat
    /etc/release in order to get the version number, e.g:

    $ cat /etc/release
    Solaris 8 10/01 s28s_u6wos_08a SPARC
               Copyright 2001 Sun Microsystems, Inc. All Rights Reserved.
                               Assembled 12 September 2001

    and to David M. Dennis for helpful input about RAM. Please forgive me if I've
    missed anyone out, I've been collecting emails on a succession of different
    machines. My original query:

    In the spirit of hope triumphing over experience, I just wanted to ask if
    anyone can see a reason not to call the Sun Doctor (the machine is on Silver
    Service) after a stop a few seconds into a run of SunVtS with this error:

    04/03/03 15:36:05 server SunVTS4.5: VTSID 8004 pmemtest.
      FATAL mem: "Unable to read pointer to memlist structure."

    uname -a# SunOS server 5.8 Generic_108528-19 sun4u sparc SUNW,Ultra-5_10
    SunVTS version 4.5.

    Sun's own docs do seem to tell you to call them out, but I was just hoping
    someone knows a reason why the error might be in ... er ... error.

    Machine has up-to-date OBP, and is giving no signs of trouble, logs are void
    of doom-laden messages. However, since it's done very little work recently
    except fit its own recommended patch set, am I inviting trouble by doing
    nothing if I should need it soon?

    Fergus

    -- 
    Fergus Wilde
    Chetham's Library
    Long Millgate
    Manchester
    M3  1SB
    Tel: +44 161 834 7961
    Fax: +44 161 839 5797
    http://www.chethams.org.uk
    _______________________________________________
    sunmanagers mailing list
    sunmanagers@sunmanagers.org
    http://www.sunmanagers.org/mailman/listinfo/sunmanagers
    

  • Next message: Sumedha: "can sendmail be configured this way??"

    Relevant Pages

    • Re: [patch] lockf(3) user-exploitable kernel panic
      ... I know my patch fails ... that libutil tries to provide this interface. ... The reason I asked was because I don't have access to many boxes of ... different architectures or operating systems. ...
      (freebsd-arch)
    • Re: PATCH/RFC: [kdump] fix APIC shutdown sequence
      ... this is correct behavior and it is just specific to level ... Even if my patch in the form in which I submitted it is unusable, ... Or is there any specific reason why the current code does it vice-versa? ... PRIMERGY System Software Engineer ...
      (Linux-Kernel)
    • Re: Why are so many built-in types inheritable?
      ... reason why FunctionType is not subclassable is that nobody bothered to ... why is there a need for such a patch? ... The reason why it doesn't work then seems to boil down to the ... I know about practicality beating purity, ...
      (comp.lang.python)
    • Re: [PATCH 0/4] add task handling notifier
      ... For some reason neither ever made a lot of progess (performance ... it adds runtime overhead purely for the convenience of kernel ... While I (obviously, since I submitted the patch disagree), ...
      (Linux-Kernel)
    • SUMMARY openssh-3.7p1
      ... This has been noticed as a bug on Tru64 UNIX, ... I have a mail with a patch to the 3.7p1 codebase. ... cat>>confdefs.h <<\EOF ... +EOF ...
      (Tru64-UNIX-Managers)