[Sheepdog] PATCH S003: Handle master crashing before sending JOIN request

Liu Yuan namei.unix at gmail.com
Sat Apr 28 11:42:12 CEST 2012


On 04/27/2012 03:12 PM, Shevek wrote:

> Our patch is required because if the master never sends a join_response,
> the secondary sheep will have a blocked JOIN event for itself in its
> queue, and has never set join_finished, so has never built cpg_nodes,
> and cannot set .gone. The c4e3 patch only works if the master has
> unblocked the JOIN event by sending a join_response.


After a second thought, c4e3 works at least for following scenario:


cluster(master A, other nodes)
					X joins
cluster(master A crashed, other nodes)
  now mastership is transferred to one
  in 'other nodes', say B
cluster(master B, other nodes)
  B is responsible to send join_response
					X join finished.

Thanks,
Yuan



More information about the sheepdog mailing list