[Sheepdog] qemu-img convert slowness and high availability status

MORITA Kazutaka morita.kazutaka at lab.ntt.co.jp
Wed Jun 15 09:49:58 CEST 2011


At Tue, 14 Jun 2011 22:06:59 +0200,
krimson wrote:
> 
> I need to give a little more info on the cluster join fail problem:
> 
> node 1:
> # sheep -f /data/sheep
> sheep: jrnl_recover(2221) Openning the directory 
> /data/sheep/journal/00000003/.
> sheep: set_addr(1595) addr = 172.16.1.1, port = 7000
> sheep: main(144) Sheepdog daemon (version 0.2.3) started
> sheep: get_cluster_status(408) sheepdog is waiting with newer epoch, 1 3 
> 172.16.1.2:7000 (when I start sheep on node 2)
> 
> node 2:
> # sheep -f /data/sheep
> sheep: jrnl_recover(2221) Openning the directory 
> /data/sheep/journal/00000001/.
> sheep: jrnl_recover(2226) start jrnl_recovery.
> sheep: jrnl_recover(2267) end jrnl_recovery.
> sheep: set_addr(1595) addr = 172.16.1.2, port = 7000
> sheep: main(144) Sheepdog daemon (version 0.2.3) started
> sheep: send_join_request(1048) 33624236 17428
> sheep: update_cluster_info(568) failed to join sheepdog, 66

Could you apply the patch I just sent minutes ago and give me the
output of the following commands?  It would really help my debugging.

 $ collie cluster info -a 172.16.1.1
 $ collie cluster info -a 172.16.1.2

The patch is also in my development repository:
  git://github.com/kazum/sheepdog.git


Thanks,

Kazutaka



More information about the sheepdog mailing list