b5a8 [Solved] Unstoppable resilver - The FreeBSD Forums
The FreeBSD Forums  

Go Back   The FreeBSD Forums > Base System > Storage

Storage Place to ask questions about partitioning, labelling, filesystems, encryption or anything else related to storage area.

Reply
 
Thread Tools Display Modes
  #1  
Old April 30th, 2012, 09:54
simplex simplex is offline
Junior Member
 
Join Date: Nov 2010
Posts: 9
Thanks: 0
Thanked 0 Times in 0 Posts
Default Unstoppable resilver

Hi, I've a problem with my ZFS pool on FreeBSD 8.3-RELEASE. The pool is version 15, composed by four disks, two mirror. This was the situation: I had a faulty disk in the second mirror and before I was able to replace it the other one started having problems ("Already active DMA on this device"). I've fixed it disabling DMA (but now I've other errors in dmesg, but that's another problem..).

After booting without DMA the system was able to mount the ZFS pool and the data looked ok. I've replaced the originally dead disk and I've started the resilver. This is the situation now:
Code:
  pool: pr0nserv
 state: DEGRADED
status: One or more devices is currently being resilvered.  The pool will
	continue to function, possibly in a degraded state.
action: Wait for the resilver to complete.
  scan: resilver in progress since Mon Apr 30 10:29:09 2012
        284G scanned out of 1.18T at 277M/s, 0h56m to go
        16.6G resilvered, 23.51% done
config:

	NAME                       STATE     READ WRITE CKSUM
	pr0nserv                   DEGRADED     0     0   108
	  mirror-0                 ONLINE       0     0     0
	    ad4                    ONLINE       0     0     0
	    ad6                    ONLINE       0     0     0
	  mirror-1                 DEGRADED     0     0   648
	    replacing-0            DEGRADED   648     0     0
	      6530854401941125969  OFFLINE      0     0     0  was /dev/ad8/old
	      ad8                  ONLINE       0     0   648  (resilvering)
	    ad10                   ONLINE       0     0   648

errors: Permanent errors have been detected in the following files:

        <metadata>:<0x87>
The problem:
I have an error on metadata, and I can't get rid of it. The resilver keeps restarting again and again. If I reboot it, resilver again, if I do a # zpool clear pr0nserv or a # zpool clear pr0nserv mirror-1 the resilver restarts again. I've removed two files that were corrupted but I can't "fix" the metadata error. I think that a scrub could fix it but I can't scrub because it will resilver again

If someone knows how to fix it, please tell me.

I think that a brutal way to fix this could be copy all the files that are in the second mirror, remove it, re-create it and copy the files back but I would avoid this, if possible.

Thanks.

Last edited by DutchDaemon; May 1st, 2012 at 01:25. Reason: Mind your writing style: http://forums.freebsd.org/showthread.php?t=18043
Reply With Quote
  #2  
Old April 30th, 2012, 14:11
simplex simplex is offline
Junior Member
 
Join Date: Nov 2010
Posts: 9
Thanks: 0
Thanked 0 Times in 0 Posts
Default

Looks like I've solved with a # zpool detach pr0nserv 6530854401941125969
Now I'm scrubbing to see if that fixes the metadata error.

Last edited by DutchDaemon; May 1st, 2012 at 01:25.
Reply With Quote
  #3  
Old May 2nd, 2012, 07:04
simplex simplex is offline
Junior Member
 
Join Date: Nov 2010
Posts: 9
Thanks: 0
Thanked 0 Times in 0 Posts
Default

Looks like it's not finished...
I've upgraded the pool to version 28, scrubbed again and cleared the errors but the metadata error is still here:
Code:
[root@pr0nserv ~]# zpool status -v
  pool: pr0nserv
 state: ONLINE
status: One or more devices has experienced an error resulting in data
        corruption.  Applications may be affected.
action: Restore the file in question if possible.  Otherwise restore the
        entire pool from backup.
   see: http://www.sun.com/msg/ZFS-8000-8A
  scan: scrub repaired 389K in 16h37m with 1 errors on Wed May  2 04:15:07 2012
config:

        NAME        STATE     READ WRITE CKSUM
        pr0nserv    ONLINE       0     0     0
          mirror-0  ONLINE       0     0     0
            ad4     ONLINE       0     0     0
            ad6     ONLINE       0     0     0
          mirror-1  ONLINE       0     0     0
            ad8     ONLINE       0     0     0
            ad10    ONLINE       0     0     0

errors: Permanent errors have been detected in the following files:

        <metadata>:<0x87>
Does someone know how to fix it without destroying and re-creating the pool?
Thanks.

Last edited by DutchDaemon; May 2nd, 2012 at 16:24.
Reply With Quote
  #4  
Old May 2nd, 2012, 11:58
simplex simplex is offline
Junior Member
 
Join Date: Nov 2010
Posts: 9
Thanks: 0
Thanked 0 Times in 0 Posts
Default

I've rebooted the machine and the resilver started again
Reply With Quote
Reply

Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
zfs resilver restarts cjyar Storage 4 October 9th, 2012 06:37
9.0 ZFS resilver - 64K/s - 100K/s fbettag Storage 8 January 21st, 2012 20:24
ZFS scrub/resilver confusion mefizto General 5 March 27th, 2011 20:40


All times are GMT +1. The time now is 07:44.


Powered by vBulletin® Version 3.8.7
Copyright ©2000 - 2013, vBulletin Solutions, Inc.
The mark FreeBSD is a registered trademark of The FreeBSD Foundation and is used by The FreeBSD Project with the permission of The FreeBSD Foundation.
Web protection and acceleration provided by CloudFlare
0