[sheepdog-users] Cluster hung...

David Douard david.douard at logilab.fr
Thu Jul 26 16:00:20 CEST 2012


On 26/07/2012 15:33, Bastian Scholz wrote:
> Hi List,
> 
> I have a small cluster, 3 nodes, with 1 gateway each and on one
> node only one working sheep, and three working sheeps on the
> other two nodes...

Hi,

just a question:

why having a gateway on each node? Is it a recommended configuration to
have a gateway on each node?

David


> 
> When a node fails, the recovery process starts as expected, but
> when the failed node joins again, the cluster hangs for a long
> time without responding to a lot of collie commands...
> collie node info and collie node recovery dont give an answer
> for at least 20 minutes.
> 
> The connected kvm guest cant access the VDIs in this time and
> the windows guests dont survive this time...
> 
> I am using sheepdog from sheepdog_0.4.0-0+tek2b-7_amd64.deb...
> 
> Could someone explain me briefly what happens here and if I
> can avoid these hung?
> 
> Thanks
> 
> Bastian


-- 
--
David DOUARD		LOGILAB
+33 1 45 32 03 12	david.douard at logilab.fr
+33 1 83 64 25 26	http://www.logilab.fr/id/david.douard

Formations - http://www.logilab.fr/formations
Développements - http://www.logilab.fr/services
Gestion de connaissances - http://www.cubicweb.org/
-------------- next part --------------
A non-text attachment was scrubbed...
Name: david_douard.vcf
Type: text/x-vcard
Size: 302 bytes
Desc: not available
URL: <http://lists.wpkg.org/pipermail/sheepdog-users/attachments/20120726/16a123c3/attachment-0004.vcf>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 262 bytes
Desc: OpenPGP digital signature
URL: <http://lists.wpkg.org/pipermail/sheepdog-users/attachments/20120726/16a123c3/attachment-0003.sig>


More information about the sheepdog-users mailing list