[sheepdog-users] Cluster hung...

Alexandre DERUMIER aderumier at odiso.com
Thu Jul 26 15:36:23 CEST 2012

same here.

I think a small tutorial on the wiki could help.

If we do 3x replication, do we need 5 nodes to avoid hang ?

----- Mail original ----- 

De: "Bastian Scholz" <nimrodxx at gmx.de> 
À: sheepdog-users at lists.wpkg.org 
Envoyé: Jeudi 26 Juillet 2012 15:33:15 
Objet: [sheepdog-users] Cluster hung... 

Hi List, 

I have a small cluster, 3 nodes, with 1 gateway each and on one 
node only one working sheep, and three working sheeps on the 
other two nodes... 

When a node fails, the recovery process starts as expected, but 
when the failed node joins again, the cluster hangs for a long 
time without responding to a lot of collie commands... 
collie node info and collie node recovery dont give an answer 
for at least 20 minutes. 

The connected kvm guest cant access the VDIs in this time and 
the windows guests dont survive this time... 

I am using sheepdog from sheepdog_0.4.0-0+tek2b-7_amd64.deb... 

Could someone explain me briefly what happens here and if I 
can avoid these hung? 


sheepdog-users mailing lists 
sheepdog-users at lists.wpkg.org 




Alexandre D e rumier 

Ingénieur Systèmes et Réseaux 

Fixe : 03 20 68 88 85 

Fax : 03 20 68 90 88 

45 Bvd du Général Leclerc 59100 Roubaix 
12 rue Marivaux 75002 Paris 

More information about the sheepdog-users mailing list