<div dir="ltr"><div><div>Hi, this is a problem I had long time ago when I was using corosync.<br></div>I see it after long time using zookeeper:<br><br>dog node list<br><br>[1] 12:13:57 [SUCCESS] test004<br> Id Host:Port V-Nodes Zone<br>
0 <a href="http://192.168.10.4:7000">192.168.10.4:7000</a> 127 67807424<br> 1 <a href="http://192.168.10.6:7000">192.168.10.6:7000</a> 129 101361856<br> 2 <a href="http://192.168.10.7:7000">192.168.10.7:7000</a> 129 118139072<br>
[2] 12:13:57 [SUCCESS] test005<br> Id Host:Port V-Nodes Zone<br> 0 <a href="http://192.168.10.4:7000">192.168.10.4:7000</a> 127 67807424<br> 1 <a href="http://192.168.10.5:7000">192.168.10.5:7000</a> 128 84584640<br>
2 <a href="http://192.168.10.6:7000">192.168.10.6:7000</a> 128 101361856<br> 3 <a href="http://192.168.10.7:7000">192.168.10.7:7000</a> 128 118139072<br>[3] 12:13:57 [SUCCESS] test006<br> Id Host:Port V-Nodes Zone<br>
0 <a href="http://192.168.10.4:7000">192.168.10.4:7000</a> 127 67807424<br> 1 <a href="http://192.168.10.6:7000">192.168.10.6:7000</a> 129 101361856<br> 2 <a href="http://192.168.10.7:7000">192.168.10.7:7000</a> 129 118139072<br>
[4] 12:13:57 [SUCCESS] test007<br> Id Host:Port V-Nodes Zone<br> 0 <a href="http://192.168.10.4:7000">192.168.10.4:7000</a> 127 67807424<br> 1 <a href="http://192.168.10.5:7000">192.168.10.5:7000</a> 128 84584640<br>
2 <a href="http://192.168.10.6:7000">192.168.10.6:7000</a> 128 101361856<br> 3 <a href="http://192.168.10.7:7000">192.168.10.7:7000</a> 128 118139072<br><br></div>These are the last 7 raws of sheep.log.<br>
As you can see, they are different in the nodes showing all 4 nodes (test005 and test007).<br><div><br>[1] 12:24:45 [SUCCESS] test004<br>Jul 07 12:10:41 INFO [main] recover_object_main(906) object recovery progress 93% <br>
Jul 07 12:10:41 INFO [main] recover_object_main(906) object recovery progress 94% <br>Jul 07 12:10:41 INFO [main] recover_object_main(906) object recovery progress 95% <br>Jul 07 12:10:41 INFO [main] recover_object_main(906) object recovery progress 96% <br>
Jul 07 12:10:41 INFO [main] recover_object_main(906) object recovery progress 97% <br>Jul 07 12:10:41 INFO [main] recover_object_main(906) object recovery progress 98% <br>Jul 07 12:10:41 INFO [main] recover_object_main(906) object recovery progress 99% <br>
[2] 12:24:45 [SUCCESS] test005<br>Jul 04 18:19:08 INFO [main] zk_leave(985) leaving from cluster<br>Jul 07 12:10:28 INFO [main] md_add_disk(343) /mnt/sheep/0, vdisk nr 220, total disk 1<br>Jul 07 12:10:28 NOTICE [main] get_local_addr(519) found IPv4 address<br>
Jul 07 12:10:28 INFO [main] send_join_request(828) IPv4 ip:192.168.10.5 port:7000<br>Jul 07 12:10:28 NOTICE [main] nfs_init(600) nfs server service is not compiled<br>Jul 07 12:10:28 INFO [main] check_host_env(493) Allowed open files 1024000, suggested 6144000<br>
Jul 07 12:10:28 INFO [main] main(942) sheepdog daemon (version 0.8.0_223_ge4735ba) started<br>[3] 12:24:45 [SUCCESS] test006<br>Jul 07 12:10:40 INFO [main] recover_object_main(906) object recovery progress 94% <br>Jul 07 12:10:40 INFO [main] recover_object_main(906) object recovery progress 95% <br>
Jul 07 12:10:41 INFO [main] recover_object_main(906) object recovery progress 96% <br>Jul 07 12:10:41 INFO [main] recover_object_main(906) object recovery progress 97% <br>Jul 07 12:10:41 INFO [main] recover_object_main(906) object recovery progress 98% <br>
Jul 07 12:10:42 INFO [main] recover_object_main(906) object recovery progress 99% <br>Jul 07 12:10:42 INFO [main] recover_object_main(906) object recovery progress 100% <br>[4] 12:24:45 [SUCCESS] test007<br>Jul 04 18:19:08 INFO [main] zk_leave(985) leaving from cluster<br>
Jul 07 12:10:34 INFO [main] md_add_disk(343) /mnt/sheep/0, vdisk nr 220, total disk 1<br>Jul 07 12:10:34 NOTICE [main] get_local_addr(519) found IPv4 address<br>Jul 07 12:10:34 INFO [main] send_join_request(828) IPv4 ip:192.168.10.7 port:7000<br>
Jul 07 12:10:35 NOTICE [main] nfs_init(600) nfs server service is not compiled<br>Jul 07 12:10:35 INFO [main] check_host_env(493) Allowed open files 1024000, suggested 6144000<br>Jul 07 12:10:35 INFO [main] main(942) sheepdog daemon (version 0.8.0_223_ge4735ba) started<br>
<br></div><div>This is a testing cluster with 4 nodes, Sheepdog daemon version 0.8.0_223_ge4735ba, and zookeeper.<br><br></div><div>What may cause this?<br></div></div>