[sheepdog] [PATCH] sheep: add a kill node operation
Liu Yuan
namei.unix at gmail.com
Fri Jul 20 13:25:20 CEST 2012
On 07/20/2012 07:09 PM, Dietmar Maurer wrote:
>>> Maybe we can delay the start of recovery for some time (1h)? That way
>>> a normal server reboot does not harm.
>>>
>>>
>>
>> Then how do you handle IOs routed to the down node if you don't recover
>> the membership state?
>
> like 'recovery in process'? It simply delays starting copy data by some time.
>
> For example you could set 'recovery_delay' to:
>
> 0 => start immediately (current behaviour)
> X => start copying data after X seconds
> MAX_INT => never start (manual)
>
> When set to a moderate value (5 minutes), you can simple reboot a server without problems.
>
> Recovery takes quite long (depends on amount of data and network speed), so adding
> a short delay should not harm?
>
> - Dietmar
>
>
I meant IOs from VMs, you can't simple delay the recovery process. For e.g, object OBJ has three
copies in node A, B, C. Suppose that B is down, before it start up again, how do you handle requests
on OBJ? and when B is back, how do you handle off the updates back to B for OBJ?
--
thanks,
Yuan
More information about the sheepdog
mailing list