Re: AIX Performance issue

From: Scott Richardson (CheetahFTL_at_attbi.com)
Date: 01/29/04


Date: Thu, 29 Jan 2004 00:36:16 GMT


"Khurram Khan" <khurram.khan@qict.net> wrote in message
news:a23090a4.0401280522.78307f1e@posting.google.com...
> Hi all,
>
> We are using F50 along with AIX 4.3.2 server and Oracle database 7.3.4
> is installed on that server we have approx 90 users on the network 100
> baseT network; however approx 30 users normally access AIX server
> through oracle application, every thing was running fine during last 3
> years but since a month occasionally (once in a week) we encounter
> problem when system doesn't allow users to commit any thing in oracle
> application, we have checked all possibilities in oracle and didn't
> found any error.
>
> Lastly during the time when we have encountered that error I have run
> sar and lpstat command and their output was as given below.
>
> / >sar 1 10
>
> AIX aix1 3 4 0041B35A4C00 01/28/04
>
> 02:29:56 %usr %sys %wio %idle
> 02:29:57 87 13 0 0
> 02:29:58 87 13 0 0
> 02:29:59 89 11 0 0
> 02:30:00 89 11 0 0
> 02:30:01 79 21 0 0
> 02:30:02 86 14 0 0
> 02:30:03 79 21 0 0
> 02:30:04 84 16 0 0
> 02:30:05 85 15 0 0
> 02:30:06 85 15 0 0
>
> Average 85 15 0 0
> / >iostat 10 2
>
> tty: tin tout avg-cpu: % user % sys % idle %
> iowait
> 5.1 6.8 38.0 11.2 42.4
> 8.5
>
> Disks: % tm_act Kbps tps Kb_read Kb_wrtn
> hdisk0 5.0 27.5 5.9 38467844 37423398
> hdisk1 19.7 362.0 39.2 927759935 71088344
> cd0 0.0 0.0 0.0 0 0
>
> tty: tin tout avg-cpu: % user % sys % idle %
> iowait
> 5.3 72.0 80.0 20.0 0.0
> 0.0
>
> Disks: % tm_act Kbps tps Kb_read Kb_wrtn
> hdisk0 10.5 60.7 13.9 464 152
> hdisk1 16.0 433.1 52.3 4296 96
> cd0 0.0 0.0 0.0 0 0
> / >
>
>
> Mostly when error occurred our database daily backup was in process
> however database runs daily but error occurs occationally and since
> long time we are taking database backup with same procedure.
>
> ---------------------------------------
> database backup script
> -----------------------------------------
> DATE=`date +"%d%m%H%M"`
> export_file=express${DATE}.dmp
> su - ora732 -c 'mknod /livedb/backup/'${export_file}' p'
> su - ora732 -c 'cat /livedb/backup/'${export_file}' | compress >
> /livedb/backup/
> '${export_file}'.Z &'
> su - ora732 -c 'exp system/manager
> file=/livedb/backup/'${export_file}' full=y l
> og=/livedb/backup/express-backup.log > /dev/null 2>&1'
> su - ora732 -c 'rm -f /livedb/backup/'${export_file}''
> ----------------------------------------
> during peak working hours I have again run sar command and got given
> output
> / >sar 1 10
>
> AIX aix1 3 4 0041B35A4C00 01/28/04
>
> 16:04:36 %usr %sys %wio %idle
> 16:04:37 75 25 0 0
> 16:04:38 76 24 0 0
> 16:04:39 79 21 0 0
> 16:04:40 68 32 0 0
> 16:04:41 78 22 0 0
> 16:04:42 83 17 0 0
> 16:04:43 73 25 2 0
> 16:04:44 78 22 0 0
> 16:04:45 77 23 0 0
> 16:04:46 73 27 0 0
>
> Average 76 24 0 0
> / >
>
> after 4 minutes I again run same command and got mentioned output
>
> / >sar 1 10
>
> AIX aix1 3 4 0041B35A4C00 01/28/04
>
> 16:08:59 %usr %sys %wio %idle
> 16:09:00 46 23 32 0
> 16:09:01 30 55 15 0
> 16:09:02 30 49 21 0
> 16:09:03 42 17 41 0
> 16:09:04 41 21 38 0
> 16:09:05 35 27 38 0
> 16:09:06 37 26 37 0
> 16:09:07 32 22 46 0
> 16:09:08 52 16 32 0
> 16:09:09 48 40 12 0
>
> Average 39 30 31 0
> /
>
>
> after 2 hour sar output
> / >sar 1 10
>
> AIX aix1 3 4 0041B35A4C00 01/28/04
>
> 18:08:31 %usr %sys %wio %idle
> 18:08:32 25 12 4 59
> 18:08:33 27 12 2 59
> 18:08:34 21 11 0 68
> 18:08:35 36 6 6 52
> 18:08:36 45 5 0 50
> 18:08:37 29 9 1 61
> 18:08:38 26 3 5 66
> 18:08:39 42 7 1 50
> 18:08:40 22 9 1 68
> 18:08:41 22 8 8 62
>
> Average 30 8 3 60
> / >
>
>
> I will highly appreciate if any of you can advice any thing to rectify
> that issue or to identify root cause of the problem
>
> Thanks,
>
> Khurram

Hello Khurram,
I have read your post on comp.unix.aix with interest.

You do not mention how much memory, how much swap
space is configured, and what other processes may be running
on the system, or what/how backups are accomplished. You
also do not mentioned what else may have changed recently
on the system since this problem has surfaced: Perhaps more
users, even if only minimal increase? A new user application
or process? Perhaps the scripts and commands you're running
to try to discover the problem may be contributing to the
problem? Who knows.

AIX 4.3.2 ? Is it possible the AIX OS needs to be updated,
to address possible OS bugs? Same for Oracle 7.3.4?
These are both extremely old versions of the Operating
System and RDBMS. Is this a home grown application?
Is there a VAR involved who developed/sold and supports
this application, in this database, on this Operating System?

It is many times difficult to gain a comprehensive picture of
everything that may be going on within a system of this size,
over time, by running some OS commands, or scripts, to try to
gain inisght as to what the problem or problems could be. It
could very well be that these scripts and commands, and the
overhead they require to gather such information, may
contribute to, or exasperate the problem or issue casuing the
performance problem(s).

May I suggest you consider running an extremely low overhead
process that tracks ALL system parameters and metrics, not only
at the AIX OS level, but also at the ORACLE RDBMS level.
This low-overhead process is a DPMonitor Performance Agent.

It runs on your AIX/Oracle Server, and keeps track of all that
goes on with extremely low-overhead, low-level kernel calls,
over time, 24 hours a day, 7 days a week, and it saves all
data in a very small footprint sized,compressed format file.

This compressed format data file generated by the DPMonitor
Performance Agent is sent to a DPMonitor Performance Explorer
Console process, that reads the compressed format file, and
generates easy to read, colorful, dynamically scaling graphs that
clearly show exactly what is going on within your system, around
the clock, over time. Information such as this is extremely critical
in helping identify, and the address/resolve the issues which affect
your application server platform's operational dynamics, at both
the AIX OS level, and at the Oracle RDBMS level.

May you go to the www.deltek.us website and check out the
DPMonitor.

I have used this Performance Monitor product at several sites,
across numerous applications, databases, and operating systems.
It helps easily point out problem areas on it's graphs that can be
easily understood even by non-technical management types, and
furthermore, after you take action to correct and resolve the
identified problem, you can help prove that the action you took to
address/resolve the problem actually did solve the problem. If it
per chance does not, then you have a graph like map to show you
what other issue has now popped up, which you can attack or
address and resolve.

Performance tuning, as you may know, is often like peeling an
onion; cutting through the layers to find the real root cause of a
problem, and addressing it at it's source. Often times, application
platform performance bottleneck issues often mask themselves
as other problems.

I wish you much luck in finding the root cause of your platform's
performance issue, and hope you consider the DPMonitor to help
take a lot of the mystery out of finding and resolving these problems.

Regards,
Scott Richardson
Sr Systems Engineering Consultant
Marlborough, MA USA



Relevant Pages

  • Libraries needed for Pro*C on AIX
    ... I am porting a program from Ms Windows to AIX 5. ... updating BLOB to the database. ... the only oracle library that is used is oraSQL.lib. ... on the AIX when I link the program to libsql9.a. ...
    (comp.databases.oracle.misc)
  • Re: Sysadmin trying to learn Oracle...help!
    ... > systems with various versions of Oracle running on Solaris. ... > coworker set up a database for me that I believe is fairly simple, ... Out of these two books I have, they cover a lot of high end ... > the "describe" command. ...
    (comp.databases.oracle.server)
  • Re: Oracle database not starting up after reboot on RedHat ES4 Linux !
    ... I have Oracle database enterprise edition 9.2.0.4 running on RedHat ... Although the installation went fine and after the ... I am using dbstart command via a /etc/init.d/oracle file. ...
    (comp.databases.oracle.server)
  • Re: optimal method for purging time based data
    ... databases (oracle obviously included). ... and periodically we have a routine in our code that "purges" the ... This is one of those things where using a database blind command can ...
    (comp.databases.oracle.server)
  • AIX 4.3.3 , Oracle 7.3.4 coredumps at startup
    ... I need to move an Oracle 7.3.4 database and application from an old ... the apps that are now reserved words in Oracle 8 and later. ... Will Oracle 7.3.4 compiled for AIX 4.1/4.2 just simply not run on this ...
    (comp.unix.aix)