[Sheepdog] partition recovery algorithm

Chris Webb chris at arachsys.com
Wed Nov 4 11:59:06 CET 2009


MORITA Kazutaka <morita.kazutaka at lab.ntt.co.jp> writes:

> Did you mention about a network partition problem (split-brain)?
> Sheepdog does not have the tolerance for network partition
> in current implementation.
> We think of using a majority voting algorithm to deal with this problem,
> but details have not been discussed enough.

I think this is one of the areas where porting to use a cluster engine like
corosync instead of jgroups may help you, quite apart from getting rid of
the jvm dependency. Corosync already has a well-tested quorum service, which
can be used to protect against network partition issues.

Cheers,

Chris.



More information about the sheepdog mailing list