RE: A little story of failed raid5 (3ware 8000 series)




A day ago at 11 am i have turn off the server,
pull out the old driver, installed a new one, turned of the server
and started rebuild in an hour from remote location via web interface.
After about 5 minuted the machine became unresponsive. Tried rebooting
- nothing. I went to the machine and fingure out, that rebuild failed (0%)
and some data cannot be read because of bad sectors.

Why would you power cycle a RAID 5 array with a failed drive? That's like
the biggest no-no that there is. When you lose a drive on a RAID 5 array,
you are vulnerable until a replacement drive is configured and the array is
rebuilt. Any high risk operations during that time would be foolhardy.

So, no raid5 or even raid 6 for me any more. Never!

Since RAID6 would have saved you from what presumably was a drive failure
before a rebuild could be done, it's hard to understand why you would say
this is a reason to avoid RAID 6. Perhaps you would do better to understand
your failure and avoid the causes of the failure rather than avoiding the
things you happened to be using at the time of the failure.

If you get food poisoning while wearing a blue shirt, the solution is not to
avoid blue shirts in the future.

DS


_______________________________________________
freebsd-stable@xxxxxxxxxxx mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscribe@xxxxxxxxxxx"



Relevant Pages

  • Re: trouble HP SmartArray 6400
    ... A "Predictive Failure", but I don't know what is this. ... (google punctured strip). ... You did not say if you replaced pd4 or not. ... raid, rebuild the raid, restore the data. ...
    (Debian-User)
  • Re: trouble HP SmartArray 6400
    ... A "Predictive Failure", but I don't know what is this. ... (google punctured strip). ... You did not say if you replaced pd4 or not. ... raid, rebuild the raid, restore the data. ...
    (Debian-User)
  • Re: Why does the Cheetah hate me so?
    ... the array can only rebuild to a hot spare if the failure occurs while the system is up. ...
    (comp.sys.ibm.ps2.hardware)
  • Re: Intel abandons USEnet news
    ... This suggests that the probability of encountering a second whole-disk failure during a 5-hour rebuild of a failed disk in a 13-disk RAID-5 array would be 1/20,000 (the reciprocal of (single-disk ... PB storage system you'd need about 1440 750 GB drives in the configuration you specify, so you'd expect about 21 of them to fail over a 2-year period, with a probability of about 1/952 that one of those failures would encounter a second disk failure during the 5-hour rebuild: just about 3 nines, though you did say a fractional PB system rather than a full PB and for 1 or 2 years rather than a full 2 years. ...
    (comp.arch)
  • Re: A little story of failed raid5 (3ware 8000 series)
    ... that rebuild failed and some data cannot be read because of bad ... ports and i needed to check every driver basket to understand which port ... you would say this is a reason to avoid RAID 6. ... better to understand your failure and avoid the causes of the failure ...
    (freebsd-stable)