mismatch_cnt != 0, member content mismatch, but md says the mirror is good

Tom Buskey tom at buskey.name
Tue Feb 23 09:40:26 EST 2010


On Mon, Feb 22, 2010 at 3:14 PM, Benjamin Scott <dragonhawk at gmail.com>wrote:

> On Mon, Feb 22, 2010 at 1:39 PM, Michael ODonnell
> <michael.odonnell at comcast.net> wrote:
> > So far, then, it's looking like every Sunday at 4:22 all the RAIDs
> > (all types or just RAID1?) in standard x86_64 CentOS5.4 (and RHAT?)
> > boxes are broken and then resync'd.
>
>
FWIW, I don't see this in my logs (back to 1/24/2010, not far) on Fedora 12.


>   All types (as I interpret the script source).
>
>  If the documentation is to be believed, they are not being broken;
> they are being checked for consistency.  Not the same thing.  Breaking
> and rebuilding leaves the array vulnerable during the rebuild, as you
> note.  A consistency check just compares the supposedly identical
> members to confirm they really *are* identically, and warns you if
> they are not.
>
>



>  With a good RAID implementation, I/O for patrol reads is done when
> the array is idle.  (Kind of like "nice 19" for I/O.)  I don't know if
> Linux does this or not.
>
>
The correct terminology is a scrub.  I think most RAID systems can do it.
It's like a fsck - something that checks the RAID structures and data to
find inconsistancies so they can be dealt with.

Scrubs can be done live and are a good thing do.  They take IO and time.
ZFS doesn't have fsck, but it does do scrubs.  Hardware RAID can do scrubs
as well.

>From man zpool on Solaris:

Scrubbing and resilvering are very  similar  operations.
         The  difference  is  that resilvering only examines data
         that ZFS knows to be out  of  date  (for  example,  when
         attaching  a  new  device  to  a  mirror or replacing an
         existing device), whereas scrubbing examines all data to
         discover  silent  errors  due to hardware faults or disk
         operations, ZFS only allows one at a time. If a scrub is
         already in progress,  the  "zpool  scrub"  command  ter-
         minates  it  and starts a new scrub. If a resilver is in
         progress, ZFS does not allow a scrub to be started until
         the resilver completes.


How often to do them gets debated in the forums.  Times vary with activity,
hardware, size.  My consumer grade RAIDZ of 4 500GB SATA II takes an hour.

Your RAID system will be different.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mail.gnhlug.org/mailman/private/gnhlug-discuss/attachments/20100223/f83a55eb/attachment.html 


More information about the gnhlug-discuss mailing list