[Sheepdog] Small sheepdog howto

PCextreme B.V. - Wido den Hollander wido at pcextreme.nl
Tue Jan 26 11:55:37 CET 2010


Hi,

There is no "corosync-cpgtool" under Ubuntu 9.10.

I'm using the version shipped with Ubuntu 9.10, in my case this is
corosync 1.0.

After changing my "hosts" file where it only contains a line with
"127.0.0.1" and localhost and my hostname i am getting different errors
when starting collie:

Jan 26 11:33:56 wido-desktop collie: Target daemon logger with pid=9909
started!
Jan 26 11:33:57 wido-desktop collie: worker_routine(175) started this
thread 0
Jan 26 11:33:57 wido-desktop collie: last message repeated 2 times
Jan 26 11:33:57 wido-desktop collie: create_cluster(756) Failed to join
the sheepdog group, try again
Jan 26 11:33:57 wido-desktop collie: worker_routine(175) started this
thread 0
Jan 26 11:33:58 wido-desktop collie: create_cluster(756) Failed to join
the sheepdog group, try again

Now all my 4 nodes are giving this error.

After turning off encryption in corosync and changing the /etc/hosts
file to something like:

127.0.0.1	localhost
192.168.6.16	sheepdog01


My sheepdog is working:

  Idx	Node id (FNV-1a)    - Host:Port
--------------------------------------------------
  0	3207b6b8f585c0b6 - 192.168.6.32:7000
* 1	4d90ac8be7d92a5c - 192.168.6.10:7000
  2	8cf20c574f9095db - 192.168.6.17:7000
  3	a9158087adfa4d46 - 192.168.6.16:7000

So the hostname of the machine should map to the "external" IP of the
machine.

-- 
Met vriendelijke groet,

Wido den Hollander
Hoofd Systeembeheer / CSO
Telefoon Support Nederland: 0900 9633 (45 cpm)
Telefoon Support België: 0900 70312 (45 cpm)
Telefoon Direct: (+31) (0)20 50 60 104
Fax: +31 (0)20 50 60 111
E-mail: support at pcextreme.nl
Website: http://www.pcextreme.nl
Kennisbank: http://support.pcextreme.nl/
Netwerkstatus: http://nmc.pcextreme.nl


On Tue, 2010-01-26 at 10:21 +0000, Piavlo wrote:
> Hi,
> 
> Do you see something similar to this (the output should be same on all
> cluster nodes)
> 
> shell> corosync-cpgtool
> Group Name             PID         Node ID
> sheepdog
>                        790        53364868 (192.168.0.1)
>                       7159        86919300 (192.168.0.2)
>                      26995        70142084 (192.168.0.3)
> shell>
> 
> also have a look at corosync log defined with logfile: in corosync.conf
> maybe you'll find the source of your problem there.
> 
>  Alex
> 
> PCextreme B.V. - Wido den Hollander wrote:
> > Hi,
> >
> > The patch fails to apply, so i did it manually, but it didn't work:
> >
> > Jan 26 10:45:10 wido-desktop collie: Target daemon logger with pid=7773
> > started!
> > Jan 26 10:45:11 wido-desktop collie: worker_routine(175) started this
> > thread 0
> > Jan 26 10:45:11 wido-desktop collie: last message repeated 3 times
> > Jan 26 10:45:11 wido-desktop collie: create_cluster(816) failed to get
> > address info
> > Jan 26 10:45:11 wido-desktop collie: main(105) failed to create sheepdog
> > cluster.
> >
> > I reverted back to the current GIT tree and removed the 127.0.1.1 entry
> > from my /etc/hosts, now shepherd looks like:
> >
> >   Idx	Node id (FNV-1a)    - Host:Port
> > --------------------------------------------------
> >   0	52b76e70de45e6c8 - 127.0.0.1:7000
> >   1	52b76e70de45e6c8 - 127.0.0.1:7000
> > * 2	52b76e70de45e6c8 - 127.0.0.1:7000
> >   3	52b76e70de45e6c8 - 127.0.0.1:7000
> >   4	52b76e70de45e6c8 - 127.0.0.1:7000
> >
> >   
> 




More information about the sheepdog mailing list