[sheepdog-users] Simultaneous startup of sheep daemon may fail

Valerio Pachera sirio81 at gmail.com
Wed Nov 13 15:39:20 CET 2013


On my testing cluster I noticed that starting all sheeps at the "same
time", may lead to failure in joining the cluster.

parallel-ssh -H 'test004 test005 test006 test007' /root/script/run_sheep.sh

root at test004:~# dog node list
  Id   Host:Port         V-Nodes       Zone
   0   192.168.2.44:7000        128  738371776

root at test005:~# dog node list
  Id   Host:Port         V-Nodes       Zone
   0   192.168.2.45:7000        119  755148992
   1   192.168.2.47:7000        137  788703424

root at test006:~# dog node list
  Id   Host:Port         V-Nodes       Zone
   0   192.168.2.46:7000        128  771926208

root at test007:~# dog node list
  Id   Host:Port         V-Nodes       Zone
   0   192.168.2.45:7000        119  755148992
   1   192.168.2.47:7000        137  788703424

It's not repeatable tough.
I tried to shutdown the cluster and re-run parallel-ssh and all nodes were
showing the right 'node list' (4 nodes total).

It's not a problem for me but I was wondering if anybody else noticed the
same behavior.
I also wonder if may depend on corosync or sheepdog.

I'm running sheep -v
and corosync 1.4.6.

I don't see anything useful in sheep.log

Nov 13 13:01:51   INFO [main] main(845) shutdown
Nov 13 15:11:19   INFO [main] md_add_disk(310) /mnt/sheep/dsk01, vdisk nr
217, total disk 1
Nov 13 15:11:19   INFO [main] md_add_disk(310) /mnt/sheep/dsk02, vdisk nr
233, total disk 2
Nov 13 15:11:19   INFO [main] send_join_request(777) IPv4 ip:192.168.2.44
port:7000
Nov 13 15:11:19   INFO [main] check_host_env(424) Allowed open files
1024000, suggested 6144000
Nov 13 15:11:19   INFO [main] main(838) sheepdog daemon (version
0.7.0_197_g9f718d2) started
Nov 13 15:13:59   INFO [main] md_add_disk(310) /mnt/sheep/dsk01, vdisk nr
217, total disk 1
Nov 13 15:13:59   INFO [main] md_add_disk(310) /mnt/sheep/dsk02, vdisk nr
233, total disk 2
Nov 13 15:13:59   INFO [main] send_join_request(777) IPv4 ip:192.168.2.44
port:7000
Nov 13 15:13:59   INFO [main] check_host_env(424) Allowed open files
1024000, suggested 6144000
Nov 13 15:13:59   INFO [main] main(838) sheepdog daemon (version
0.7.0_197_g9f718d2) started
Nov 13 15:14:41   INFO [main] main(845) shutdown
Nov 13 15:14:53   INFO [main] md_add_disk(310) /mnt/sheep/dsk01, vdisk nr
217, total disk 1
Nov 13 15:14:53   INFO [main] md_add_disk(310) /mnt/sheep/dsk02, vdisk nr
233, total disk 2
Nov 13 15:14:53   INFO [main] send_join_request(777) IPv4 ip:192.168.2.44
port:7000
Nov 13 15:14:53   INFO [main] check_host_env(424) Allowed open files
1024000, suggested 61440
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.wpkg.org/pipermail/sheepdog-users/attachments/20131113/fe8d56d1/attachment-0004.html>


More information about the sheepdog-users mailing list