[sheepdog] [PATCH v3] sheep: remove master node
Kai Zhang
kyle at zelin.io
Wed Jul 24 11:13:31 CEST 2013
On Jul 24, 2013, at 2:53 PM, MORITA Kazutaka <morita.kazutaka at lab.ntt.co.jp> wrote:
> At Tue, 23 Jul 2013 17:30:03 +0800,
> Kai Zhang wrote:
>>
>> On Jul 23, 2013, at 4:44 PM, MORITA Kazutaka <morita.kazutaka at lab.ntt.co.jp> wrote:
>>
>>> Ah, sorry. The node A doesn't start until the nodes B, C, and D come
>>> back. It is because the latest epoch in the node A includes B, C, and
>>> D.
>>
>> Well, it seems I didn't fully understand the current implementation of cluster driver.
>>
>> A very silly question: if B, C come back but D does not, what is the status of
>> the cluster? It can work or just wait for D?
>
> The cluster status will be SD_STATUS_WAIT. It will wait for the node
> D to join Sheepdog if you don't run "collie cluster recover force".
>
Does this mean that sheepdog is not self-healing?
Any persistent failure of sheep will be handled by administrator?
If so, I think there is no need for auto-recover.
Recover should happen when administrator call "collie cluster recover force".
Thanks,
Kyle
More information about the sheepdog
mailing list