[Sheepdog] corosync crashes

Steven Dake sdake at redhat.com
Mon Oct 4 19:37:40 CEST 2010


On 10/04/2010 10:32 AM, Robert Terhaar wrote:
> On Oct 4, 2010 10:40 AM, "Steven Dake" <sdake at redhat.com
> <mailto:sdake at redhat.com>> wrote:
>  > On 10/03/2010 11:43 PM, Robert Terhaar wrote:
>  >> Hi All,
>  >>
>  >> I'm having some problems with corosync crashes in Fedora 14.
>  >> Occasionally after pushing a lot of traffic thru sheepdog, corosync
>  >> crashes, and writes to the logs a fairly unhelpful message
>  >> "corosync[23367]: [TOTEM ] FAILED TO RECEIVE"
>  >>
>  >> I've attached my very basic corosync.conf below. Does this config
> look ok?
>  >>
>  >> compatibility: whitetank
>  >>
>  >
>  > This is caused by delayed multicast messages. There is a patch in the
>  > below bz to address this issue.
>  >
>  > https://bugzilla.redhat.com/show_bug.cgi?id=619496
>  >
>  > Another workaround that might be helpful if you don't want to rebuild
>  > your own package is changing the fail_recv_const (goes in totem
>  > directive) to a very large value such as 5000.
>  >
>  > Regards
>  > -steve
>  >
>  >
>
> Thanks Steve!
>
> Setting fail_recv_const:5000 seems to have fixed the problem. Is there
> any chance that the corosync patch will make it into Fedora 14's
> corosync package?
>

Yes it is coming.  Corosync upstream releases are blocked on an 
impending svn->git source control conversion.  Hope to have that sorted 
out in 1-2 weeks.

> Also, I think it would be very helpful for new users if there was a
> sheepdog wiki article with a recommended corosync.conf example. Is there
> any plans for a sheepdog wiki?
>

Not sure on this point.

Regards
-steve



More information about the sheepdog mailing list