Re: Clustering



Bill Gunshannon wrote:

In article <FD827B33AB0D9C4E92EACEEFEE2BA2FB773691@xxxxxxxxxxxxxxxxxxxxxxxxxxxxx>,
	"Main, Kerry" <Kerry.Main@xxxxxx> writes:

                       As I recall there was a university that ran
something like 120 WS's/servers in a cluster.



This brings up an interesting question (at least for me!)

You mentioned WS's above and I assume that means WorkStations.  What,
if anything, would be the advantage of building a cluster of, let's say,
2 multi-processor Vaxen like I currently have in the department and
a dozen or so VS3100's?  Could all the VS3100's run diskless getting
all their support from the HSJ served disks on the big boxes?  Assuming
the cluster traffic was all on a private ethernet and access to the
world was only through the two big boxes would performance be good
enough?  Is there something important I missed because I really have
no idea how a VMS Cluster works, never having built one but considering
it now. (Especially if it can make the whole system more visible locally!)

Then, of course, would come the biggest question.  Does HP have a bunch
of fully loaded VS3100's with big monitors that they want to truck up here
as a donation so I can build a this dream VAX lab.  :-)

I set up something like this once. Once of the questions you need to ask your self is: How important is reboot time for the cluster?


I once saw a cluster of 80-90 some VS3100s all being served by a single 8800 as the boot/disk server. From power up to first login on all workstations took over four hours. (A 10MB ethernet can only do so much.)

The cluster I setup up had close to 40 VS3100s.  The plan I used was:

1) Every machine has a page/swap disk. (In this case, they were 52MB RZ22s.)

2) Every fifth machine is a boot server. These machines had enough disk space for all software (VMS and layered apps), but no user data. I think we used 200MB RZ24s.

3) Each boot server serves four other machines. All machines in this "sub-cluster" are on the same ethernet segment.

4) The boot servers were kept synchronized with RSM. (Is that product still supported?) Except for SCS IDs and such, all boot servers were identical.

5) Boot servers were used as user workstations, just like the "diskless" machines. In fact, the users were not aware of which machines were boot servers and which were not.

6) All user data was stored on a "disk server" which was not used as a user workstation. This would be your HSJ machines. All shared cluster files (SYSUAF, queue files, etc.) were stored on the disk server.

7) The disk server was the only machine with votes. (If I redid this, I would consider spreading some votes around to help spread lock mastering, but this didn't seem to cause any performance problems for us.) The disk server did nothing but file serving, print queue execution, and lock mastering.

This configuration worked very well. Time from all machines off to login at all machines was about 15 minutes. This was with VS3100-30s.

All machines except the disk server were used as user workstations. User's could use any workstation interchangably.

User machines had no local data, so they did not need to be backed up.

Boot machines were identical to each other, so they could be restored from another boot machine's disk, so they did not need to be backed up.

The only files that needed to be backed up were the user files on the disk server.

--
-----------------------------------------------------------------------
Chris Scheers, Applied Synergy, Inc.

Voice: 817-237-3360            Internet: chris@xxxxxxxxxxxxxxxxxxx
  Fax: 817-237-3074
.



Relevant Pages

  • Re: Q329873, problems with DCs machine account, need help!
    ... Your best path is to disjoin and then rejoin the workstations to the domain. ... The NETDOM command would do most of the same functions and you may as well ... > couple Mac OSX 10.3 machines to participate in AD. ... > server, so that's obviously not good enough. ...
    (microsoft.public.win2000.active_directory)
  • RE: Windows Universal Storage Server 2003 quits working
    ... day) stops communicating with the domain controllers. ... but no other machines are affected (either servers or ... Local workstations can access files, ... The Server is a Dell NX1950 with a MD3000 Raid Enclosure. ...
    (microsoft.public.windows.server.general)
  • Re: Access 97 placing info in wrong field when record is edited or upd
    ... If all machines claim to be SR-2, ... > of the workstations contained the dap350.dll, nor did the server. ... >> The table (called workorder table) that this form originates ...
    (microsoft.public.access.forms)
  • SMS Advanced Clients
    ... 2nd server has SQL, SMS and is the sole MP. ... I am able to locate my network machines that I wish to make clients and even ... Viewing the workstations ... they appear to have installed the client software. ...
    (microsoft.public.sms.setup)
  • Re: SBS 2003 Misconfigured?
    ... up one of the workstations via remote web connection, ... but why are you looking at the server rather than the workstation? ... (this will show you the DHCP lease info). ... The Netgear, or whatever you use as your gateway to get out to the Internet. ...
    (microsoft.public.windows.server.sbs)