<div dir="ltr"><div><div><div><div>I'm doing a series of test on 2 nodes cluster.<br><br></div>I ended up not being able to add a 3th node and unable to run vdi check.<br><br>root@test006:~# dog node list<br> Id Host:Port V-Nodes Zone<br>
0 <a href="http://192.168.10.4:7000">192.168.10.4:7000</a> 67 67807424<br> 1 <a href="http://192.168.10.6:7000">192.168.10.6:7000</a> 61 101361856<br><br>root@test006:~# dog vdi list<br> Name Id Size Used Shared Creation time VDI id Copies Tag<br>
wheezy_template3 0 10 GB 1.8 GB 0.0 MB 2013-12-18 12:34 832ac1 3 <br><br>root@test006:~# dog vdi check wheezy_template3<br>ABORT: Not enough active nodes for consistency-check<br><br>root@test006:~# dog cluster info<br>
Cluster status: running, auto-recovery enabled<br>Cluster created at Wed Dec 18 09:51:34 2013<br>Epoch Time Version<br>2013-12-18 14:26:46 11 [<a href="http://192.168.10.4:7000">192.168.10.4:7000</a>, <a href="http://192.168.10.6:7000">192.168.10.6:7000</a>]<br>
2013-12-18 14:26:13 10 [<a href="http://192.168.10.4:7000">192.168.10.4:7000</a>, <a href="http://192.168.10.5:7000">192.168.10.5:7000</a>, <a href="http://192.168.10.6:7000">192.168.10.6:7000</a>]<br>2013-12-18 14:23:22 9 [<a href="http://192.168.10.4:7000">192.168.10.4:7000</a>, <a href="http://192.168.10.6:7000">192.168.10.6:7000</a>]<br>
2013-12-18 14:22:50 8 [<a href="http://192.168.10.4:7000">192.168.10.4:7000</a>, <a href="http://192.168.10.5:7000">192.168.10.5:7000</a>, <a href="http://192.168.10.6:7000">192.168.10.6:7000</a>]<br>2013-12-18 13:06:38 7 [<a href="http://192.168.10.4:7000">192.168.10.4:7000</a>, <a href="http://192.168.10.6:7000">192.168.10.6:7000</a>]<br>
2013-12-18 13:06:05 6 [<a href="http://192.168.10.4:7000">192.168.10.4:7000</a>, <a href="http://192.168.10.5:7000">192.168.10.5:7000</a>, <a href="http://192.168.10.6:7000">192.168.10.6:7000</a>]<br>2013-12-18 12:51:04 5 [<a href="http://192.168.10.4:7000">192.168.10.4:7000</a>, <a href="http://192.168.10.6:7000">192.168.10.6:7000</a>]<br>
2013-12-18 12:44:52 4 [<a href="http://192.168.10.6:7000">192.168.10.6:7000</a>]<br>2013-12-18 12:41:39 3 [<a href="http://192.168.10.4:7000">192.168.10.4:7000</a>, <a href="http://192.168.10.6:7000">192.168.10.6:7000</a>]<br>
2013-12-18 12:37:34 2 [<a href="http://192.168.10.6:7000">192.168.10.6:7000</a>]<br>2013-12-18 09:51:34 1 [<a href="http://192.168.10.4:7000">192.168.10.4:7000</a>, <a href="http://192.168.10.6:7000">192.168.10.6:7000</a>]<br>
<br><br></div>Sheep.log of the 3th node (test005):<br><br>Dec 18 14:26:13 INFO [main] md_add_disk(141) /mnt/sheep/dsk01, nr 1<br>Dec 18 14:26:13 INFO [main] md_add_disk(141) /mnt/sheep/dsk02, nr 2<br>Dec 18 14:26:14 ERROR [main] get_local_addr(595) found IPv4 address<br>
Dec 18 14:26:14 INFO [main] send_join_request(770) IPv4 ip:192.168.10.5 port:7000<br>Dec 18 14:26:14 INFO [main] main(880) sheepdog daemon (version 0.7.6_rc1) started<br>Dec 18 14:26:14 ERROR [main] init_vdi_state(188) failed to read inode header 8083290e00000000 0<br>
Dec 18 14:26:14 EMERG [main] setup_backend_store(553) PANIC: failed to initialize store<br>Dec 18 14:26:14 EMERG [main] crash_handler(257) sheep exits unexpectedly (Aborted).<br>Dec 18 14:26:14 EMERG [main] sd_backtrace(895) sheep.c:259: crash_handler<br>
Dec 18 14:26:14 EMERG [main] sd_backtrace(909) /lib/x86_64-linux-gnu/libpthread.so.0(+0xf02f) [0x7f3a91cf002f]<br>Dec 18 14:26:14 EMERG [main] sd_backtrace(909) /lib/x86_64-linux-gnu/libc.so.6(gsignal+0x34) [0x7f3a910e3474]<br>
Dec 18 14:26:14 EMERG [main] sd_backtrace(909) /lib/x86_64-linux-gnu/libc.so.6(abort+0x17f) [0x7f3a910e66ef]<br>Dec 18 14:26:14 EMERG [main] sd_backtrace(895) group.c:564: setup_backend_store<br>Dec 18 14:26:14 EMERG [main] sd_backtrace(895) zookeeper.c:911: zk_handle_accept<br>
Dec 18 14:26:14 EMERG [main] sd_backtrace(895) zookeeper.c:1068: zk_event_handler<br>Dec 18 14:26:14 EMERG [main] sd_backtrace(895) event.c:210: do_event_loop<br>Dec 18 14:26:14 EMERG [main] sd_backtrace(895) sheep.c:885: main<br>
Dec 18 14:26:14 EMERG [main] sd_backtrace(909) /lib/x86_64-linux-gnu/libc.so.6(__libc_start_main+0xfc) [0x7f3a910cfeac]<br>Dec 18 14:26:14 EMERG [main] sd_backtrace(909) sheep() [0x405f28]<br>Dec 18 14:26:14 ERROR [main] crash_handler(524) sheep pid 32050 exited unexpectedly.<br>
<br></div>What do you think about this?<br><br>Note: I'm going to format the cluster and hopefully to find out hot to reproduce it.<br></div>You might already understand if there's some sort of bug.<br><br><br><div>
<div><br></div></div></div>