[sheepdog-users] restart cluster after update

Valerio Pachera sirio81 at gmail.com
Fri May 3 11:37:37 CEST 2013


Hi, finally I have some time for testing.

I've been upgrading my hosts to 0.5.5_318_gc17f5d7.
My nodes are actualy using a single disk, so I'm trying to start the
cluster as usual.

sheep /mnt/sheepdog

Running node list I was getting
  "There are no active sheep daemons"

So I killed alle sheep daemon and restarted corosync.
After that, 3 nodes are showing

collie node list
M   Id   Host:Port         V-Nodes       Zone
-    0   192.168.2.44:7000      64  738371776
-    1   192.168.2.45:7000      64  755148992
-    2   192.168.2.46:7000      64  771926208

but the last node shows only him self

collie node list
M   Id   Host:Port         V-Nodes       Zone
-    0   192.168.2.47:7000      64  788703424

I check the time and corosync conf and they are all correct.

There's almost nothing on this cluster but I would like to be able to start
it.
It's important to know that after an update, cluster will start.

-------
More info

collie cluster info
Cluster status: Waiting for other nodes to join cluster

Cluster created at Tue Apr  9 18:36:23 2013

(Any node)
Epoch Time           Version
2013-04-09 18:36:23      1 [192.168.2.44:7000, 192.168.2.45:7000,
192.168.2.46:7000, 192.168.2.47:7000]

(node 4,5,6)
collie node info
Id      Size    Used    Use%
Response's result: Waiting for other nodes to join cluster
Response's result: Waiting for other nodes to join cluster
Response's result: Waiting for other nodes to join cluster
Cannot get information from any node

(node 7)
collie node info
Id      Size    Used    Use%
Response's result: Waiting for other nodes to join cluster
Cannot get information from any nodes

(node 7)
sheep.log
Apr 12 16:59:32 [deletion] objlist_deletion_work(186) VDI (9533ed) is still
in use, can not be deleted
Apr 12 16:59:36 [deletion] objlist_deletion_work(186) VDI (a34c67) is still
in use, can not be deleted
Apr 12 18:16:10 [main] main(691) shutdown
May 03 11:04:42 [main] md_add_disk(164) /mnt/sheepdog/obj, nr 1
May 03 11:04:42 [main] send_join_request(1082) IPv4 ip:192.168.2.47
port:7000
May 03 11:04:42 [main] check_host_env(381) WARN: Allowed open files 1024
too small, suggested 1024000
May 03 11:04:42 [main] check_host_env(390) Allowed core file size 0,
suggested unlimited
May 03 11:04:42 [main] main(745) sheepdog daemon (version
0.5.5_318_gc17f5d7) started
May 03 11:15:30 [main] md_add_disk(164) /mnt/sheepdog/obj, nr 1
May 03 11:15:32 [main] corosync_init(774) failed to initialize cpg (6) - is
corosync running?
May 03 11:15:32 [main] main(689) failed to create sheepdog cluster
May 03 11:15:32 [main] crash_handler(487) sheep pid 2895 exited
unexpectedly.
May 03 11:16:19 [main] md_add_disk(164) /mnt/sheepdog/obj, nr 1
May 03 11:16:21 [main] corosync_init(774) failed to initialize cpg (6) - is
corosync running?
May 03 11:16:21 [main] main(689) failed to create sheepdog cluster
May 03 11:16:21 [main] crash_handler(487) sheep pid 2909 exited
unexpectedly.
May 03 11:17:06 [main] md_add_disk(164) /mnt/sheepdog/obj, nr 1
May 03 11:17:06 [main] send_join_request(1082) IPv4 ip:192.168.2.47
port:7000
May 03 11:17:06 [main] check_host_env(381) WARN: Allowed open files 1024
too small, suggested 1024000
May 03 11:17:06 [main] check_host_env(390) Allowed core file size 0,
suggested unlimited
May 03 11:17:06 [main] main(745) sheepdog daemon (version
0.5.5_318_gc17f5d7) started
May 03 11:17:06 [main] update_cluster_info(862) status = 4, epoch = 1,
finished: 0
May 03 11:18:54 [main] md_add_disk(164) /mnt/sheepdog/obj, nr 1
May 03 11:18:54 [main] send_join_request(1082) IPv4 ip:192.168.2.47
port:7000
May 03 11:18:54 [main] check_host_env(381) WARN: Allowed open files 1024
too small, suggested 1024000
May 03 11:18:54 [main] check_host_env(390) Allowed core file size 0,
suggested unlimited
May 03 11:18:54 [main] main(745) sheepdog daemon (version
0.5.5_318_gc17f5d7) started
May 03 11:18:54 [main] update_cluster_info(862) status = 4, epoch = 1,
finished: 0
May 03 11:21:23 [main] md_add_disk(164) /mnt/sheepdog/obj, nr 1
May 03 11:21:23 [main] send_join_request(1082) IPv4 ip:192.168.2.47
port:7000
May 03 11:21:23 [main] check_host_env(381) WARN: Allowed open files 1024
too small, suggested 1024000
May 03 11:21:23 [main] check_host_env(390) Allowed core file size 0,
suggested unlimited
May 03 11:21:23 [main] main(745) sheepdog daemon (version
0.5.5_318_gc17f5d7) started
May 03 11:21:23 [main] update_cluster_info(862) status = 4, epoch = 1,
finished: 0
May 03 11:32:56 [main] md_add_disk(164) /mnt/sheepdog/obj, nr 1
May 03 11:32:56 [main] send_join_request(1082) IPv4 ip:192.168.2.47
port:7000
May 03 11:32:56 [main] check_host_env(381) WARN: Allowed open files 1024
too small, suggested 1024000
May 03 11:32:56 [main] check_host_env(390) Allowed core file size 0,
suggested unlimited
May 03 11:32:56 [main] main(745) sheepdog daemon (version
0.5.5_318_gc17f5d7) started
May 03 11:32:56 [main] update_cluster_info(862) status = 4, epoch = 1,
finished: 0

-------------

Thank you.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.wpkg.org/pipermail/sheepdog-users/attachments/20130503/1de7b813/attachment.html>


More information about the sheepdog-users mailing list