[Sheepdog] corosync crashes
Steven Dake
sdake at redhat.com
Mon Oct 4 16:42:10 CEST 2010
On 10/03/2010 11:43 PM, Robert Terhaar wrote:
> Hi All,
>
> I'm having some problems with corosync crashes in Fedora 14.
> Occasionally after pushing a lot of traffic thru sheepdog, corosync
> crashes, and writes to the logs a fairly unhelpful message
> "corosync[23367]: [TOTEM ] FAILED TO RECEIVE"
>
> I've attached my very basic corosync.conf below. Does this config look ok?
>
> compatibility: whitetank
>
This is caused by delayed multicast messages. There is a patch in the
below bz to address this issue.
https://bugzilla.redhat.com/show_bug.cgi?id=619496
Another workaround that might be helpful if you don't want to rebuild
your own package is changing the fail_recv_const (goes in totem
directive) to a very large value such as 5000.
Regards
-steve
> totem {
> version: 2
> secauth: off
> threads: 0
> interface {
> ringnumber: 0
> bindnetaddr: 192.168.10.185
> mcastaddr: 226.94.1.1
> mcastport: 5405
> }
> }
>
> logging {
> fileline: off
> to_stderr: no
> to_logfile: yes
> to_syslog: yes
> logfile: /var/log/cluster/corosync.log
> debug: on
> timestamp: on
> logger_subsys {
> subsys: AMF
> debug: off
> }
> }
>
> amf {
> mode: disabled
> }
>
>
> and my corosync version info:
>
> # yum info corosync
> Installed Packages
> Name : corosync
> Arch : x86_64
> Version : 1.2.8
> Release : 1.fc14
>
More information about the sheepdog
mailing list