Recover full system solaris 8 x86 and legato networker 6.1

Hi all,

We have only 2 systems left to migrate off FreeBSD, and these require
an older backup client, and thus our older backup server running
Solaris 8 on x86 clone and legato 6.1.

There was a problem with the tape unit one night, and the next morning
we did a shutdown with reboot. When it came up, there were
terrible file system errors on all partitions of our disks. Attempts to repair
from a booted CDROM Solaris 8 environment were impossible in some
partitions and in others fsck reported losing a huge number of files.

Solaris was reinstalled, with patches for Networker 6.1 and st driver.
Indexes were recreated by the scanner command on the last full backup
tape of the backup server. Then recover was done, with all partitions
added (add /, add /var, add /usr, etc.). I elected to copy over
the files in place on the system. I was outside the server
room when an email arrived for root, indicating that
somehow, sendmail was now configured to route mail. I thought
this was a little surprising as the service had not been restarted during
the recover. The message stated that there had been over twenty
something errors on /dev/rmt/0ubn and the device was now disabled.

I was expecting to wait until the recover was complete, fix up the
vfstab for the current system partition assignments, and then reboot.
Instead, an operator with good knowledge of the backup software
told me the screen had gone blue. Somehow, the system
spontaneously rebooted while the recover was not complete.
The screen showed "Solaris Primary Boot" or something similar.

I'm looking for an opinion on the method of recovering
over a live system. I've done a full recover over a live system
before with Solaris 7 on a sparc server and not had
any problem like this.

Given the initial appearance of the disks getting messed up,
some troubles with the tape unit responding, the email about device
errors, and this second instance of the system going kaflooey,
I'm thinking we have a possible hardware intermittent fault -
maybe in one of the SCSI controllers.

Should I be recovering to a second system disk and then doing
installboot, etc., or can I exclude certain files in the recover
(e.g. /etc/mnttab) if overwriting this may have triggered the reboot?

I'd like to hear some opinions on this...
sunmanagers mailing list

Relevant Pages

  • Exchange 5.5 Recover from log files
    ... We have an Exchange 5.5 server that had a mailbox deleted. ... mailbox on a backup server, but the backup file was 2 days old. ... database to recover the 2 missing days of mail without affecting the other ...
  • W97_W2K Save Options Tutorial?
    ... Save Auto Recover Info ... Backup Copy: When "Always create Backup copy" is turned on, ... overwrites the associated .wbk file. ... computer without explicitly Saving an open, ...
  • Re: Backup and FreeBSD/ZFS
    ... Tape libraries are horribly expensive since they're not mass market ... a full backup of the current disk setup takes 4 tapes and ... ... to get Solaris to do what I want). ... If you do decide to go with zfs as the file system and need stability, ...
  • RAID5 (mdadm) array hosed after grow operation (there are two of us)
    ... but a lot of photos, yes I know the admin rule, backup backup backup, ... an additional drive the same size as the existing drives to add to the ... Now I cannot recover my LVM2 volume group, ... returns a result with "Incorrect metadata area header checksum" ...
  • Re: Exchange Backup
    ... ja nicht für jeden Service einen eigenen Server bereitstellen;). ... Die grafische Ecke im Haus hat einen Solaris ... Das BackUp habe ich voriges Jahr entsprechend mit viel Luft neu auf eben ... > ja auch mal Gedanken über iSCSI oder SAN als Ersatz für local attached ...