[sheepdog] [PATCH] sheep: add a kill node operation

Liu Yuan namei.unix at gmail.com
Fri Jul 20 13:25:20 CEST 2012


On 07/20/2012 07:09 PM, Dietmar Maurer wrote:
>>> Maybe we can delay the start of recovery for some time (1h)? That way
>>> a normal server reboot does not harm.
>>>
>>>
>>
>> Then how do you handle IOs routed to the down node if you don't recover
>> the membership state? 
> 
> like 'recovery in process'? It simply delays starting copy data by some time.
> 
> For example you could set 'recovery_delay' to:
> 
> 0 => start immediately (current behaviour)
> X => start copying data after X seconds
> MAX_INT => never start (manual)
> 
> When set to a moderate value (5 minutes), you can simple reboot a server without problems.
> 
> Recovery takes quite long (depends on amount of data and network speed), so adding 
> a short delay should not harm?
> 
> - Dietmar
> 
> 

I meant IOs from VMs, you can't simple delay the recovery process. For e.g, object OBJ has three
copies in node A, B, C. Suppose that B is down, before it start up again, how do you handle requests
on OBJ? and when B is back, how do you handle off the updates back to B for OBJ?

-- 
thanks,
Yuan



More information about the sheepdog mailing list