[sheepdog-users] Sheep failing re-joing

Valerio Pachera sirio81 at gmail.com
Fri Nov 15 16:19:38 CET 2013


I just made a "crash" test unplugging the switch power supply.
Sheep deamons died but corosync is still active.

I try the to restart the cluster sunning sheep on all nodes (one after the
other).

And happens that strange thing:

root at test005:~# dog node list
There are no active sheep daemons

root at test005:~# pgrep -lf sheep
10884 sheep -n /var/sheep /mnt/sheep/dsk01,/mnt/sheep/dsk02 -i
host=192.168.10.5 port=3333
10885 sheep -n /var/sheep /mnt/sheep/dsk01,/mnt/sheep/dsk02 -i
host=192.168.10.5 port=3333

root at test005:~# dog node list
  Id   Host:Port         V-Nodes       Zone
   0   192.168.2.45:7000        128  755148992

sheep.log
Nov 15 16:06:02   INFO [main] md_add_disk(310) /mnt/sheep/dsk01, vdisk nr
220, total disk 1
Nov 15 16:06:02   INFO [main] md_add_disk(310) /mnt/sheep/dsk02, vdisk nr
233, total disk 2
Nov 15 16:06:02   INFO [main] send_join_request(777) IPv4 ip:192.168.2.45
port:7000
Nov 15 16:06:02   INFO [main] check_host_env(424) Allowed open files
1024000, suggested 6144000
Nov 15 16:06:02   INFO [main] main(838) sheepdog daemon (version
0.7.0_197_g9f718d2) started

It seems corosync problem related.

(PS: Yes, I'l try also zookeeper but I need more time).
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.wpkg.org/pipermail/sheepdog-users/attachments/20131115/3dbed2d4/attachment-0004.html>


More information about the sheepdog-users mailing list