<html><head><style type="text/css"><!-- DIV {margin:0px;} --></style></head><body><div style="font-family:times new roman,new york,times,serif;font-size:12pt">I am also having similar issues making one sheep node connect to another one. <br><br>I am still testing with nodes inside VMWare. I have tried NAT mode vs bridged set up to see if it makes any diffrence, which it seams to make none. I have set up three different nodes. Some times the nodes are able to see each other, but often upon boot none of the nodes are able to see each other. Is there a specified way to make a node to connect to the rest of the cluster? Best way to shut down a node for maintenance? <br><div style="font-family:times new roman, new york, times, serif;font-size:12pt"><div style="font-family:arial, helvetica, sans-serif;font-size:13px"><br>----------------------------------------------------------------------<br><br>Message: 1<br>Date:
Fri, 29 Apr 2011 17:15:43 +0200<br>From: "S. Bonnegent" <<a ymailto="mailto:sebastien.bonnegent@gmail.com" href="mailto:sebastien.bonnegent@gmail.com">sebastien.bonnegent@gmail.com</a>><br>To: <a ymailto="mailto:sheepdog@lists.wpkg.org" href="mailto:sheepdog@lists.wpkg.org">sheepdog@lists.wpkg.org</a><br>Subject: [Sheepdog] sheepdog on ubuntu 11.04<br>Message-ID: <<a ymailto="mailto:4DBAD61F.90204@gmail.com" href="mailto:4DBAD61F.90204@gmail.com">4DBAD61F.90204@gmail.com</a>><br>Content-Type: text/plain; charset="utf-8"<br><br>Hi,<br><br>I try to test Sheepdog on Ubuntu 11.04 64 bits with 2 hosts but I have<br>some troubles to debug the situation.<br><br>I use 2 nodes (PC1 and PC2) with default sheepdog package<br>(0.2.2-0ubuntu1). I added "user_xattr" (my partition is in ext4) in<br>/etc/fstab and configured /etc/corosync/corosync.conf (file is below).<br><br>PC1 is 172.29.22.74<br>PC2 is 172.29.22.78<br><br>On PC1, I start sheepdog
with "service sheepdog start" and I have:<br><br># collie node list<br> Idx Node id (FNV-1a) - Host:Port<br>------------------------------------------------<br>* 0 52b76e70de45e6c8 - 172.29.22.74:7000<br><br># corosync-cpgtool<br>Group Name PID Node ID<br>sheepdog<br> 1070 1242963372 (172.29.22.74)<br><br><br>On PC2, I start sheepdog too and I obtain:<br># collie node list<br>The node had failed to join sheepdog<br>failed to get node list<br><br># corosync-cpgtool<br>Group Name PID Node ID<br>sheepdog<br> 1070 1242963372 (172.29.22.74)<br>
1064 1310072236 (172.29.22.78)<br><br>And now, on PC1 I have:<br># collie node list<br>The node had failed to join sheepdog<br>failed to get node list<br><br># corosync-cpgtool<br>Group Name PID Node ID<br>sheepdog<br> 1070 1242963372 (172.29.22.74)<br> 1064 1310072236 (172.29.22.78)<br><br>In PC1 logs, there are this message:<br><br>get_cluster_status(362) joining node has invalid ctime, 5960354062892656328<br><br>but PC1 and PC2 use NTP and have exactly same date and time.<br>Do you know why sheepdog can't start ?<br><br><br>Thank you.<br><br><br><br>Note: my /etc/corosync/corosync.conf<br>compatibility: whitetank<br>totem {<br> version: 2<br> secauth: off<br>
threads: 0<br> interface {<br> ringnumber: 0<br> bindnetaddr: 172.29.0.0<br> mcastaddr: 226.94.1.1<br> mcastport: 5405<br> }<br> }<br> logging {<br> fileline: off<br> to_stderr: no<br> to_logfile: yes<br> to_syslog: yes<br> logfile: /var/log/corosync.log<br> debug: off<br> timestamp: on<br> logger_subsys {<br> subsys: AMF<br> debug: off<br> }<br> }<br> amf {<br> mode: disabled<br> }<br><br>-------------- next part --------------<br>A non-text attachment was scrubbed...<br>Name: last_logs_on_pc1.log<br>Type: text/x-log<br>Size: 2108 bytes<br>Desc: not available<br><span>URL: <<a target="_blank" href="http://lists.wpkg.org/pipermail/sheepdog/attachments/20110429/4cb26be6/attachment-0001.bin">http://lists.wpkg.org/pipermail/sheepdog/attachments/20110429/4cb26be6/attachment-0001.bin</a>></span><br><br>------------------------------<br></div></div>
</div></body></html>