I recently purchased a Rosewill RC-218 four port SATA card. This uses the Marvell 88SX7042 chipset, support for which I have seen mav commit in late 2010. I have four Samsung HD154UI 1.5TB drives connected to it in a Supermicro CSE-M35T1 backplane.
I backed up my existing 3 drive raidz and then created a newraidz2 pool using all seven drives. All seemed to be working fine so I began copying my backup data onto the new pool last night. Upon waking up this morning I see all of the drives have timed out were removed by mvs.
zpool status showed some read and write errors and I was instructed to run zpool clear once the devices had been reconnected. I had to reboot to get the drives recognized and online again - simply removing and re-inserting them in the hotswap bay did nothing. After a reboot zpool status shows the pool is health and happy, but this definitely has me nervous.
Thoughts on what could be going on?
I backed up my existing 3 drive raidz and then created a newraidz2 pool using all seven drives. All seemed to be working fine so I began copying my backup data onto the new pool last night. Upon waking up this morning I see all of the drives have timed out were removed by mvs.
Code:
mvsch0: Timeout on slot 0
mvsch0: iec ffffffff sstat ffffffff serr ffffffff edma_s ffffffff dma_c ffffffff dma_s ffffffff rs 00000001 status ff
mvsch0: stopping EDMA engine failed
mvsch2: (ada0:Timeout on slot 0mvsch0:0:
mvsch2: 0:iec ffffffff sstat ffffffff serr ffffffff edma_s ffffffff dma_c ffffffff dma_s ffffffff rs 00000001 status ff0):
lost device
(ada0:mvsch0:0:0:0): Invalidating pack
mvsch2: stopping EDMA engine failed
(ada2:mvsch2:0:0:0): lost device
(ada2:mvsch2:0:0:0): Invalidating pack
(ada0:mvsch0:0:0:0): Synchronize cache failed
(ada0:mvsch0:0:0:0): removing device entry
(ada2:mvsch2:0:0:0): Synchronize cache failed
(ada2:mvsch2:0:0:0): removing device entry
mvsch1: Timeout on slot 0
mvsch1: iec ffffffff sstat ffffffff serr ffffffff edma_s ffffffff dma_c ffffffff dma_s ffffffff rs 00000001 status ff
mvsch1: stopping EDMA engine failed
mvsch3: (ada1:Timeout on slot 0mvsch1:0:
mvsch3: 0:iec ffffffff sstat ffffffff serr ffffffff edma_s ffffffff dma_c ffffffff dma_s ffffffff rs 00000001 status ff0):
lost device
(ada1:mvsch1:0:0:0): Invalidating pack
mvsch3: stopping EDMA engine failed
(ada3:mvsch3:0:0:0): lost device
(ada3:mvsch3:0:0:0): Invalidating pack
(ada1:mvsch1:0:0:0): Synchronize cache failed
(ada1:mvsch1:0:0:0): removing device entry
(ada3:mvsch3:0:0:0): Synchronize cache failed
(ada3:mvsch3:0:0:0): removing device entry
zpool status showed some read and write errors and I was instructed to run zpool clear once the devices had been reconnected. I had to reboot to get the drives recognized and online again - simply removing and re-inserting them in the hotswap bay did nothing. After a reboot zpool status shows the pool is health and happy, but this definitely has me nervous.
Code:
pool: storage
state: ONLINE
scrub: none requested
config:
NAME STATE READ WRITE CKSUM
storage ONLINE 0 0 0
raidz2 ONLINE 0 0 0
ad0 ONLINE 0 0 0
ad1 ONLINE 0 0 0
ad12 ONLINE 0 0 0
ada0 ONLINE 0 0 0
ada1 ONLINE 0 0 0
ada2 ONLINE 0 0 0
ada3 ONLINE 0 0 0
errors: No known data errors
Thoughts on what could be going on?