[Sheepdog] corosync crashes

Robert Terhaar rterhaar at atlanticdynamic.com
Mon Oct 4 19:32:02 CEST 2010


On Oct 4, 2010 10:40 AM, "Steven Dake" <sdake at redhat.com> wrote:
> On 10/03/2010 11:43 PM, Robert Terhaar wrote:
>> Hi All,
>>
>> I'm having some problems with corosync crashes in Fedora 14.
>> Occasionally after pushing a lot of traffic thru sheepdog, corosync
>> crashes, and writes to the logs a fairly unhelpful message
>> "corosync[23367]: [TOTEM ] FAILED TO RECEIVE"
>>
>> I've attached my very basic corosync.conf below. Does this config look
ok?
>>
>> compatibility: whitetank
>>
>
> This is caused by delayed multicast messages. There is a patch in the
> below bz to address this issue.
>
> https://bugzilla.redhat.com/show_bug.cgi?id=619496
>
> Another workaround that might be helpful if you don't want to rebuild
> your own package is changing the fail_recv_const (goes in totem
> directive) to a very large value such as 5000.
>
> Regards
> -steve
>
>

Thanks Steve!

Setting fail_recv_const:5000 seems to have fixed the problem. Is there any
chance that the corosync patch will make it into Fedora 14's corosync
package?

Also, I think it would be very helpful for new users if there was a sheepdog
wiki article with a recommended corosync.conf example. Is there any plans
for a sheepdog wiki?
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.wpkg.org/pipermail/sheepdog/attachments/20101004/e20d724f/attachment.html>


More information about the sheepdog mailing list