[Sheepdog] corosync crashes

Steven Dake sdake at redhat.com
Mon Oct 4 16:42:10 CEST 2010


On 10/03/2010 11:43 PM, Robert Terhaar wrote:
> Hi All,
>
> I'm having some problems with corosync crashes in Fedora 14.
> Occasionally after pushing a lot of traffic thru sheepdog, corosync
> crashes, and writes to the logs a fairly unhelpful message
> "corosync[23367]:   [TOTEM ] FAILED TO RECEIVE"
>
> I've attached my very basic corosync.conf below. Does this config look ok?
>
> compatibility: whitetank
>

This is caused by delayed multicast messages.  There is a patch in the 
below bz to address this issue.

https://bugzilla.redhat.com/show_bug.cgi?id=619496

Another workaround that might be helpful if you don't want to rebuild 
your own package is changing the fail_recv_const (goes in totem 
directive) to a very large value such as 5000.

Regards
-steve


> totem {
>   version: 2
>   secauth: off
>   threads: 0
>   interface {
>   ringnumber: 0
>   bindnetaddr: 192.168.10.185
>   mcastaddr: 226.94.1.1
>   mcastport: 5405
>   }
> }
>
> logging {
>   fileline: off
>   to_stderr: no
>   to_logfile: yes
>   to_syslog: yes
>   logfile: /var/log/cluster/corosync.log
>   debug: on
>   timestamp: on
>   logger_subsys {
>   subsys: AMF
>   debug: off
>   }
> }
>
> amf {
>   mode: disabled
> }
>
>
> and my corosync version info:
>
> # yum info corosync
> Installed Packages
> Name        : corosync
> Arch        : x86_64
> Version     : 1.2.8
> Release     : 1.fc14
>




More information about the sheepdog mailing list