[sheepdog-users] vdi corrupted after recovery

Valerio Pachera sirio81 at gmail.com
Thu Jun 26 20:46:17 CEST 2014


2014-06-26 17:01 GMT+02:00 Valerio Pachera <sirio81 at gmail.com>:

> I disconnected a node by dog node kill.
> After recovery I stopped and restarted the cluster because I had to move
> some cables.
> After that, I run one of the two guest but It was comlaining about
> filesystem errors.
>

After a second thought, I didn't check the guest status before
disconnecting the node (before recovery).
The node I disconnected in past 2 days was working slowly because of the
raid hardware controller cache not enabled.
I'm almost sure the corruption happened because of that.
The guest receives data from another server.
In the server's backup logs of the last night I read many of these errors:

rsync: recv_generator: mkdir
"/opt/zimbra/data/amavisd/tmp/amavis-20140625T092245-03629" failed:
Read-only file system.

Andrew J.H. said sheep daemon v0.7 was crashing in similar circumstances.
It seems 0.82 doesn't crash but doesn't handle it well.

Did any of you experienced that?

This may be replicated forcing the link of a nic at 10M/s
ethtool -s eth0 speed 10 duplex full

Maybe even running sheep without '-n' option (only on one of the nodes).
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.wpkg.org/pipermail/sheepdog-users/attachments/20140626/c6c80d45/attachment-0005.html>


More information about the sheepdog-users mailing list