On Wed, Apr 25, 2012 at 03:51:30PM +0800, Liu Yuan wrote: > I am more interested in how do you plan to deal with block_cb()? We > already meet some subtle problem that cluster gets hung at block state > for ever running a 1000 sheep daemon on dozen of machines, but not yet > come to any conclusion useful. We can only say that the block mechanism > would leave some holes to hang the whole cluster by only several minor > failed nodes (be it whether EIO-exiting or down). I haven't looked into a better scheme yet - I just identified that the area needs way more work than a simple cleanup, that's why I didn't touch it for now. |