[sheepdog] [PATCH v7 3/7] sheep: rejoin cluster after a zookeeper session timeout
Kai Zhang
kyle at zelin.io
Wed Jun 26 03:12:06 CEST 2013
On Jun 25, 2013, at 11:06 PM, Hitoshi Mitake <mitake.hitoshi at gmail.com> wrote:
> As you say, the rejoin would be an only way to handle session timeout
> correctly. But the current zookeeper driver produces serious problems
> when network failures happen (e.g. inconsistent epochs).
>
> So I believe the panic() or exit() would be better than doing
> nothing. If sheeps with zookeeper driver exits immediately in the
> above case, we can restart sheeps manually.
> # I understand this solution goes against the policy of sheepdog... :(
>
I see. Do you mean a separate patch based on upstream? or based on
PATCH 1/7 and 2/7?
Because these patches have been reviewed by Kazutaka and Yuan,
I think they will be merged soon after some minor modifications.
Would you mind that we merge the whole series to the stable branch later?
> And our internal team needs the solution until this Thursday (we have
> a local change for this problem but it is a temporal and dirty
> thing). If you can help us, I'm very happy :)
Our team is also waiting for this patch for a long time :)
Thanks,
Kyle
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.wpkg.org/pipermail/sheepdog/attachments/20130626/f0c2cad1/attachment-0004.html>
More information about the sheepdog
mailing list